DeepImageSearch  by TechyNilesh

AI-powered image search and retrieval SDK

Created 4 years ago
476 stars

Top 64.0% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

DeepImageSearch is a Python library designed for building advanced AI-powered image search systems. It enables fast and accurate text-to-image, image-to-image, and hybrid search, targeting engineers and researchers seeking robust image pattern identification with seamless Python integration and GPU acceleration.

How It Works

The library leverages multimodal embeddings (CLIP, SigLIP, EVA-CLIP) coupled with scalable vector stores like FAISS, ChromaDB, or Qdrant for efficient indexing and retrieval. It supports hybrid search combining text and image queries, LLM-powered auto-captioning, and integrates directly into agentic RAG pipelines via an MCP server and LangChain tool.

Quick Start & Requirements

Installation is straightforward via pip (pip install DeepImageSearch) or directly from GitHub. Python 3.10+ is required. GPU acceleration (CUDA, MPS) is auto-detected and recommended for performance; ensure faiss-gpu is installed if using a GPU. Optional extras ([llm], [chroma], [qdrant], [postgres], [mcp], [langchain]) enable specific functionalities. Demo scripts are available in the Demo/ folder.

Highlighted Details

  • Supports text-to-image, image-to-image, and weighted hybrid search modes.
  • Utilizes various multimodal embedding models including CLIP, SigLIP, EVA-CLIP, and legacy timm models.
  • Offers flexible vector storage with FAISS (default), ChromaDB, and Qdrant, alongside JSON or PostgreSQL for metadata.
  • Features LLM-powered image captioning and direct integration for agentic workflows (MCP server, LangChain tool).

Maintenance & Community

The project is maintained by Nilesh Verma, with a GitHub repository available for contributions, bug reports, and suggestions. Community channels like Discord or Slack are not explicitly mentioned.

Licensing & Compatibility

The license type is not specified in the provided README, which is a critical omission for adoption decisions. Compatibility is confirmed for Python 3.10+ and modern agentic frameworks.

Limitations & Caveats

The absence of explicit licensing information poses a significant adoption blocker. While GPU support is highlighted, performance on CPU-only systems may be limited. The project appears actively developed, with "latest v3" suggesting potential for ongoing API changes.

Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
3
Issues (30d)
0
Star History
6 stars in the last 30 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Chenlin Meng Chenlin Meng(Cofounder of Pika), and
9 more.

clip-retrieval by rom1504

0.1%
3k
CLIP retrieval system for semantic search
Created 4 years ago
Updated 2 weeks ago
Feedback? Help us improve.