DocsGPT  by arc53

Open-source RAG assistant for reliable, hallucination-free answers from knowledge sources

created 2 years ago
16,866 stars

Top 2.8% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

DocsGPT is an open-source Retrieval Augmented Generation (RAG) tool designed to provide reliable, hallucination-free answers from diverse knowledge sources. It targets developers and users needing to integrate AI-powered Q&A into applications, offering features like broad document support, web scraping, and flexible LLM integration.

How It Works

DocsGPT employs a RAG architecture to ground LLM responses in user-provided data, mitigating hallucinations. It supports a wide array of document formats (PDF, DOCX, images, etc.) and data sources (URLs, sitemaps, GitHub). The system offers built-in tooling and agentic capabilities, allowing LLMs to interact with external APIs and services for enhanced functionality.

Quick Start & Requirements

  • Install via git clone and running provided setup.sh (macOS/Linux) or setup.ps1 (Windows) scripts.
  • Requires Docker.
  • Setup involves choosing an inference option (public API, local, cloud API).
  • Access the UI at http://localhost:5173/.
  • Stop with docker compose down.
  • Detailed documentation available: Quickstart, Documentation

Highlighted Details

  • Supports over 15 document formats and various web/data sources.
  • Offers integrations like React chat widgets, Discord/Telegram bots.
  • Compatible with major LLMs (OpenAI, Google, Anthropic) and local models (Ollama, llama_cpp).
  • Features secure and scalable deployment options, including Kubernetes support.
  • Provides an API key system for managing integrations.

Maintenance & Community

  • Active development with a public roadmap.
  • Community support via Discord: discord.gg/docsgptai
  • Offers commercial support and a "Lighthouse Program" for technical leaders.

Licensing & Compatibility

  • Licensed under MIT.
  • Permissive license suitable for commercial use and integration into closed-source applications.

Limitations & Caveats

The project is actively under development, with several features slated for release in Spring 2025, including Anthropic Tool compatibility and new agent scheduling capabilities.

Health Check
Last commit

5 days ago

Responsiveness

1 day

Pull Requests (30d)
23
Issues (30d)
8
Star History
1,369 stars in the last 90 days

Explore Similar Projects

Starred by Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), Tim J. Baek Tim J. Baek(Founder of Open WebUI), and
1 more.

harbor by av

0.3%
2k
CLI tool for local LLM stack orchestration
created 1 year ago
updated 3 weeks ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jeremy Howard Jeremy Howard(Cofounder of fast.ai), and
3 more.

cohere-toolkit by cohere-ai

0.2%
3k
RAG toolkit for LLM application development and deployment
created 1 year ago
updated 1 week ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Alex Cheema Alex Cheema(Cofounder of EXO Labs), and
3 more.

Perplexica by ItzCrazyKns

0.3%
23k
AI-powered search engine alternative
created 1 year ago
updated 1 day ago
Feedback? Help us improve.