SDK for late-interaction multi-modal models
Top 44.7% on sourcepulse
Byaldi is a Python library designed to simplify the use of late-interaction multi-modal retrieval models, specifically those compatible with the ColPali framework. It targets developers and researchers looking to quickly integrate advanced multi-modal search capabilities into their applications, offering a familiar API for rapid prototyping and development.
How It Works
Byaldi acts as a lightweight wrapper around the ColPali repository, abstracting away the complexities of loading and indexing multi-modal models. It leverages ColPali's underlying engine, which supports multi-billion parameter models like ColQwen2 checkpoints, enabling efficient retrieval across various data types. The library's design prioritizes ease of use, mirroring RAGatouille's approach to minimize code required for setting up a retrieval pipeline.
Quick Start & Requirements
pip install --upgrade byaldi
pip install flash-attn
Highlighted Details
vidore/colqwen2-v1.0
).Maintenance & Community
This is a pre-release version of Byaldi, and users are encouraged to report issues. The project aims to evolve with the multi-modal ecosystem, with plans to support additional backends like VisRAG and features such as HNSW indexing and quantization.
Licensing & Compatibility
The README does not explicitly state the license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
This is a pre-release version with potential quirks and unrefined features. Performance on CPU or MPS for encoding is expected to be poor. The project is actively under development, and future updates may introduce breaking changes.
6 months ago
1 week