AI-powered search tool for content-based image and text similarity
Top 98.8% on sourcepulse
CLIPPyX offers system-wide search capabilities for text and images, leveraging AI for content-based and visual similarity. It targets users needing to efficiently locate files based on their visual content, embedded text, or descriptive captions, enhancing productivity for digital asset management.
How It Works
CLIPPyX utilizes OpenAI's CLIP model to generate image embeddings, storing them in a vector database for efficient similarity searches. It also employs OCR to extract text from images, embedding this text with a separate model for semantic text search. A Flask server handles search queries from various UIs, querying both image and text embeddings to return relevant results.
Quick Start & Requirements
pip install -e .
after cloning the repository.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is described as a development server, advising against production deployment without a dedicated WSGI server. The "Deep Scan" feature may impact performance on large directories. License information is not provided, which may affect commercial adoption.
2 months ago
1 day