Python: Twelve Labs Video AI Integration #2009
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Motivation and Context
This change adds video intelligence capabilities to the Agent Framework, enabling agents to understand and interact with video content.
Why is this change required?
The framework currently lacks native support for video understanding. Users who want to build agents that can analyze video content must implement custom solutions or use external tools.
What problem does it solve?
It solves the problem of video analysis by providing a complete integration with Twelve Labs' Pegasus APIs, which use multimodal foundation models to understand both visual and audio content in videos.
What scenario does it contribute to?
Description
This PR adds a new Python package
agent-framework-twelvelabsthat integrates Twelve Labs' Pegasus video intelligence APIs into the Agent Framework.What is Twelve Labs?
Twelve Labs provides video understanding APIs powered by multimodal foundation models. Unlike traditional video processing that relies solely on transcription, Twelve Labs' Pegasus model analyzes both visual and audio content to understand what's happening in videos.
Implementation approach:
The package follows established framework patterns (similar to OpenAI and Anthropic integrations) and provides:
Key features:
Integration options:
TwelveLabsToolsVideoProcessingAgentTwelveLabsClientfor custom workflowsThe implementation is fully modular and doesn't modify the core framework.
Contribution Checklist