Framework for text embeddings, retrieval, and reranking
Top 2.7% on sourcepulse
This framework provides an easy method to compute state-of-the-art text embeddings and perform semantic search, similarity, and reranking. It is designed for researchers and developers working with Natural Language Processing (NLP) tasks, offering access to over 10,000 pre-trained models and the ability to train custom models.
How It Works
The library leverages Siamese and Siamese-like network structures (Sentence-BERT) to generate fixed-size embeddings for sentences or paragraphs. This approach allows for efficient similarity calculations using cosine similarity on these embeddings. For improved relevance in search scenarios, it also incorporates Cross-Encoder models that directly compare pairs of texts, yielding higher accuracy but at a greater computational cost.
Quick Start & Requirements
pip install -U sentence-transformers
or conda install -c conda-forge sentence-transformers
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README states the repository contains "experimental software" and is published for "additional background details on the respective publication," suggesting it may not be intended for production use without further evaluation.
3 days ago
Inactive