fiftyone-docs-search  by voxel51

LLM-powered search for FiftyOne documentation

Created 2 years ago
374 stars

Top 75.8% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a semantic search interface for the Voxel51 documentation, enabling users to query information using natural language. It targets developers and researchers working with the FiftyOne computer vision library, offering a more intuitive way to navigate documentation compared to traditional keyword search.

How It Works

The system leverages OpenAI's text-embedding-ada-002 model to generate vector embeddings for the FiftyOne documentation. These embeddings are stored and queried using the Qdrant vector search database. The approach uses LangChain splitters with custom Markdown parsing for robust document segmentation, enhancing the accuracy and relevance of search results.

Quick Start & Requirements

  • Install via pip: pip install -e .
  • Requires an OpenAI API key (export OPENAI_API_KEY=XXXXXXXX).
  • Requires a running Qdrant server (docker run -d -p 6333:6333 qdrant/qdrant).
  • Official docs: https://docs.voxel51.com/

Highlighted Details

  • Command-line interface (fiftyone-docs-search query <query>) with options for result count, opening URLs, scores, and document types.
  • Python API for programmatic access.
  • Versioning aligns with FiftyOne documentation versions.
  • Option to build the index locally from a cloned FiftyOne repository.

Maintenance & Community

  • Project is associated with Voxel51, the creators of the FiftyOne library.
  • Community engagement is encouraged via the FiftyOne Slack community.

Licensing & Compatibility

  • The repository itself is not explicitly licensed in the README. The FiftyOne library is open source. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project requires an external OpenAI API key, incurring potential costs. The README does not specify the license for the fiftyone-docs-search package itself, which may impact commercial use.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Simon Willison Simon Willison(Coauthor of Django).

semantra by freedmand

0.1%
3k
CLI tool for semantic document search
Created 2 years ago
Updated 1 year ago
Starred by Vaibhav Nivargi Vaibhav Nivargi(Cofounder of Moveworks), Jared Palmer Jared Palmer(Ex-VP AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), and
4 more.

searchkick by ankane

0.1%
7k
Ruby gem for integrating intelligent search
Created 12 years ago
Updated 1 week ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Taranjeet Singh Taranjeet Singh(Cofounder of Mem0), and
8 more.

Perplexica by ItzCrazyKns

5.7%
25k
AI-powered search engine alternative
Created 1 year ago
Updated 1 day ago
Feedback? Help us improve.