natural-language-youtube-search  by haltakov

Tool to search YouTube videos using natural language

created 4 years ago
928 stars

Top 40.2% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This project enables searching within YouTube videos using natural language queries, leveraging OpenAI's CLIP model. It is designed for researchers and users interested in content discovery and analysis within video media. The primary benefit is the ability to locate specific moments in videos based on descriptive text rather than relying on manual scrubbing or inaccurate transcriptions.

How It Works

The system extracts frames from a YouTube video at a specified interval. Each frame is then encoded into a vector representation using OpenAI's CLIP model. A natural language search query is similarly encoded by CLIP. The project identifies frames whose embeddings are most similar to the query embedding, effectively matching visual content with textual descriptions.

Quick Start & Requirements

  • Install: pip install -r requirements.txt
  • Prerequisites: Python 3.7+, OpenAI API key (for CLIP encoding).
  • Demo: Available on Google Colab and Hugging Face Spaces.

Highlighted Details

  • Utilizes OpenAI's CLIP for joint text-image embedding.
  • Demonstrates search capabilities on a San Francisco car driving video.
  • Offers a parallel project for searching Unsplash images.

Maintenance & Community

No specific information on contributors, sponsorships, or community channels is provided in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial use or closed-source linking is not addressed.

Limitations & Caveats

The project requires an OpenAI API key, which may incur costs. The effectiveness of the search is dependent on the quality of CLIP's embeddings and the frame extraction interval.

Health Check
Last commit

3 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.