marqo  by marqo-ai

End-to-end vector search engine for text and images

created 3 years ago
4,910 stars

Top 10.3% on sourcepulse

GitHubView on GitHub
Project Summary

Marqo is an end-to-end vector search engine designed to simplify the integration of semantic search into applications. It handles text and image embedding generation, storage, and retrieval through a unified API, eliminating the need for users to manage separate ML models or vector databases. This makes it suitable for developers looking to quickly implement advanced search capabilities.

How It Works

Marqo bundles embedding generation with vector search, offering a "documents in, documents out" approach. It leverages state-of-the-art embedding models from Huggingface, PyTorch, and OpenAI, supporting both CPU and GPU. Data is stored in in-memory HNSW indexes for high-speed retrieval. The system handles preprocessing, embedding, and inference, allowing for flexible search behavior modification without model retraining. It also supports multimodal search, enabling combined text and image indexing and querying.

Quick Start & Requirements

  • Installation: Requires Docker. Run Marqo via docker run --name marqo -it -p 8882:8882 marqoai/marqo:latest. Install the client with pip install marqo.
  • Prerequisites: Docker with at least 8GB memory and 50GB storage.
  • Documentation: https://docs.marqo.ai/
  • Demos: https://www.marqo.ai/blog/

Highlighted Details

  • Supports text, image, and multimodal search with configurable embedding models.
  • Offers weighted queries for complex search scenarios and negations.
  • Scalable to hundreds of millions of documents with horizontal sharding.
  • Integrates with popular AI frameworks like Haystack, Griptape, and Langchain.

Maintenance & Community

Marqo is a community-driven project with active development. Support and discussion are available via their Discourse forum and Slack community.

Licensing & Compatibility

The project appears to be open-source, but a specific license is not explicitly stated in the README. Compatibility for commercial use or closed-source linking would require license clarification.

Limitations & Caveats

Marqo requires Docker, which may be a barrier for some environments. The README warns against running other applications on Marqo's Vespa cluster due to automatic configuration changes.

Health Check
Last commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
22
Issues (30d)
1
Star History
96 stars in the last 90 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Mike McNeil Mike McNeil(Author of Sails.js; Cofounder of Fleet), and
10 more.

meilisearch by meilisearch

0.2%
53k
Search engine API for integrating AI-powered hybrid search
created 7 years ago
updated 1 day ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Alex Cheema Alex Cheema(Cofounder of EXO Labs), and
3 more.

Perplexica by ItzCrazyKns

0.3%
23k
AI-powered search engine alternative
created 1 year ago
updated 1 day ago
Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Zhiqiang Xie Zhiqiang Xie(Author of SGLang), and
7 more.

milvus by milvus-io

0.4%
36k
Cloud-native vector database for scalable ANN search
created 5 years ago
updated 1 day ago
Feedback? Help us improve.