DocsGPT  by arc53

Open-source RAG assistant for reliable, hallucination-free answers from knowledge sources

Created 2 years ago
17,607 stars

Top 2.7% on SourcePulse

GitHubView on GitHub
Project Summary

DocsGPT is an open-source Retrieval Augmented Generation (RAG) tool designed to provide reliable, hallucination-free answers from diverse knowledge sources. It targets developers and users needing to integrate AI-powered Q&A into applications, offering features like broad document support, web scraping, and flexible LLM integration.

How It Works

DocsGPT employs a RAG architecture to ground LLM responses in user-provided data, mitigating hallucinations. It supports a wide array of document formats (PDF, DOCX, images, etc.) and data sources (URLs, sitemaps, GitHub). The system offers built-in tooling and agentic capabilities, allowing LLMs to interact with external APIs and services for enhanced functionality.

Quick Start & Requirements

  • Install via git clone and running provided setup.sh (macOS/Linux) or setup.ps1 (Windows) scripts.
  • Requires Docker.
  • Setup involves choosing an inference option (public API, local, cloud API).
  • Access the UI at http://localhost:5173/.
  • Stop with docker compose down.
  • Detailed documentation available: Quickstart, Documentation

Highlighted Details

  • Supports over 15 document formats and various web/data sources.
  • Offers integrations like React chat widgets, Discord/Telegram bots.
  • Compatible with major LLMs (OpenAI, Google, Anthropic) and local models (Ollama, llama_cpp).
  • Features secure and scalable deployment options, including Kubernetes support.
  • Provides an API key system for managing integrations.

Maintenance & Community

  • Active development with a public roadmap.
  • Community support via Discord: discord.gg/docsgptai
  • Offers commercial support and a "Lighthouse Program" for technical leaders.

Licensing & Compatibility

  • Licensed under MIT.
  • Permissive license suitable for commercial use and integration into closed-source applications.

Limitations & Caveats

The project is actively under development, with several features slated for release in Spring 2025, including Anthropic Tool compatibility and new agent scheduling capabilities.

Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
72
Issues (30d)
11
Star History
133 stars in the last 30 days

Explore Similar Projects

Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Dirk Englund Dirk Englund(MIT EECS Professor and Cofounder of Axiomatic AI), and
25 more.

firecrawl by firecrawl

1.8%
74k
API service for turning websites into LLM-ready data
Created 1 year ago
Updated 1 day ago
Feedback? Help us improve.