Discover and explore top open-source AI tools and projects—updated daily.
GetStreamBuild real-time vision agents with any model or provider
Top 59.9% on SourcePulse
GetStream/Vision-Agents offers a framework for rapidly developing real-time video AI applications. It enables developers to integrate diverse object detection models (e.g., YOLO) and LLMs (OpenAI, Gemini, Claude) with ultra-low latency, leveraging Stream's edge network. The project targets developers building sophisticated video analysis tools for applications like sports coaching, surveillance, and interactive gaming.
How It Works
The core Agent class orchestrates LLM interactions with specialized processors. These processors execute auxiliary AI models (like YOLO for pose estimation) and manage tasks such as API calls, audio/video manipulation, and state tracking. This modular design facilitates flexible integration of various AI capabilities, feeding real-time video and audio into LLMs for analysis, optimized for low latency via Stream's edge infrastructure.
Quick Start & Requirements
yolo11n-pose.pt).Highlighted Details
Maintenance & Community
Developed by Stream, the project highlights key figures in AI research. A roadmap indicates ongoing development, with planned additions including broader model support (Roboflow, QWen3, Moondream vision) and enhanced WebRTC capabilities. Community interaction channels are not explicitly detailed.
Licensing & Compatibility
The specific open-source license is not stated in the provided README, a critical omission for adoption assessment. Compatibility is emphasized with Stream Chat and various LLM/video providers.
Limitations & Caveats
The project appears to be in active development, with several features listed as "Coming Soon" and acknowledged limitations in its underlying WebRTC library. Reliance on Stream's proprietary edge network for optimal performance may present vendor-specific integration challenges. The absence of explicit licensing information poses a significant adoption barrier.
5 hours ago
Inactive
vercel
microsoft