Survey of video understanding via LLMs
Top 18.7% on sourcepulse
This repository serves as a comprehensive, curated list of the latest research papers, code repositories, and datasets focused on leveraging Large Language Models (LLMs) for Video Understanding (Vid-LLMs). It targets researchers and practitioners in computer vision and natural language processing, offering a structured overview of the rapidly evolving Vid-LLM landscape.
How It Works
The project categorizes Vid-LLMs based on their architectural approach and functional role, such as "Video Analyzer × LLM" or "Video Embedder × LLM," further detailing how LLMs are employed as summarizers, managers, text decoders, or regressors. It also outlines pre-training and instruction-tuning strategies, including adapter-based fine-tuning methods. The repository provides a taxonomy of tasks, datasets, and benchmarks relevant to Vid-LLMs.
Quick Start & Requirements
This repository is a curated list of resources, not a runnable software package. It links to external papers and code repositories, each with its own setup requirements.
Highlighted Details
Maintenance & Community
The project is actively maintained by a large team of contributors from multiple universities, including the University of Rochester and Southern University of Science and Technology. Contributions are welcomed via pull requests.
Licensing & Compatibility
The repository itself does not have a specific license mentioned, but it links to numerous external research papers and code repositories, each with their own respective licenses. Users must consult the licenses of individual linked projects for usage and compatibility.
Limitations & Caveats
As a curated list, this repository does not provide direct code execution or pre-trained models. Users must navigate to individual linked projects for implementation details, dependencies, and potential usage restrictions. The rapid pace of research means the list is constantly updated, requiring users to check for the latest versions of linked resources.
5 days ago
1 week