yacy_expert by yacy

Search portal using LLMs and RAG for comprehensive question answering

Created 15 years ago

685 stars

Top 49.7% on SourcePulse

Project Summary

YaCy Expert aims to create a question-answering search engine by combining Large Language Models (LLMs) with Retrieval Augmented Generation (RAG). It targets users with large text corpora, such as those acquired through YaCy web crawling, enabling them to build domain-specific "expert" chatbots. The project leverages LLMs for semantic understanding and RAG for context delivery, moving beyond traditional keyword-based search.

How It Works

The system architecture consists of a web interface acting as a wrapper for two backend services: an LLM (inference engine) and a RAG system (knowledge base). The LLM is designed as a drop-in replacement for the OpenAI chat API, utilizing llama.cpp. The RAG system is embedded within the YaCy Expert web interface's backend and uses Faiss for efficient vector similarity search on user-provided data dumps. This approach allows for semantically relevant context retrieval from large text corpora to augment LLM responses.

Quick Start & Requirements

Installation: Requires Python, a server for the LLM (native or Docker), and one or more YaCy index dump files.
Hardware: Recommends significant RAM (64GB+) for LLM and vector index. A proof-of-concept for Raspberry Pi is in development.
Data Indexing: Involves exporting YaCy data (.jsonlist), placing it in the knowledge directory, and running python3 knowledge_indexing.py . (approx. 1 hour per 10,000 entries). Custom BERT models can be specified via .ini files.
LLM Backend: Can be configured via OpenAI API or by self-hosting an LLM using llama.cpp (Docker recommended).
Running: Start the YaCy Expert backend server and access the chat web page.
Docs: YaCy Expert README

Highlighted Details

Leverages llama.cpp for OpenAI API compatibility, enabling self-hosted LLMs.
Utilizes Faiss for efficient vector similarity search, forming the RAG knowledge base.
Designed to process YaCy data dumps (WARC, ZIM) for domain-specific knowledge.
Inspired by "Expert Systems" AI concepts, separating inference and knowledge base.

Maintenance & Community

Project is hosted on GitHub under the yacy organization.
No specific community links (Discord/Slack) or roadmap details are provided in the README.

Licensing & Compatibility

The README does not explicitly state a license. The parent yacy project is typically licensed under GPLv2.

Limitations & Caveats

Indexing large datasets (millions of entries) can take days or weeks.
The project is described as a "stub" for installation, implying it may require significant setup effort.
Performance is highly dependent on hardware, particularly RAM and GPU availability for the LLM and vector index.

Health Check

Last Commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

wikipedia-semantic-search by upstash

Semantic search engine and RAG chatbot using Wikipedia data

Created 1 year ago

Updated 1 month ago

awsdocsgpt by antimetal

AI-powered search and chat for AWS documentation

Created 2 years ago

Updated 2 years ago

Starred by

Mckay Wrigley

Mckay Wrigley(Founder of Takeoff AI).

wait-but-why-gpt by mckaywrigley

AI-powered search and chat for a blog

Created 2 years ago

Updated 2 years ago

Starred by

Bryan Helmig

Bryan Helmig(Cofounder of Zapier).

content-chatbot by mpaepper

Website content chatbot/Q&A agent

Created 2 years ago

Updated 1 year ago

Chat_with_Datawhale_langchain by logan-zou

RAG for personal knowledge base Q&A

Created 1 year ago

Updated 1 year ago

DataChad by gustavz

App for Q&A using LLMs and vector DBs

Created 2 years ago

Updated 1 year ago

ChatPDF by shibing624

RAG for local LLM, enables chat with PDF/docs

Created 2 years ago

Updated 9 months ago

DeepSeek-RAG-Chatbot by SaiAkhil066

Local RAG chatbot for private document Q\&A using advanced retrieval

Created 11 months ago

Updated 4 months ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

RasaGPT by paulpierre

Headless LLM chatbot platform built on Rasa and Langchain

Created 2 years ago

Updated 2 months ago

Starred by

Daniel Gross

Daniel Gross(Cofounder of Safe Superintelligence),

Matei Zaharia

Matei Zaharia(Cofounder of Databricks), and

11 more.

ColBERT by stanford-futuredata

Neural search for fast, accurate retrieval over large text collections

Created 5 years ago

Updated 2 months ago

chatgpt-retrieval by techleadhd

Simple script to query your files using ChatGPT

Created 2 years ago

Updated 1 year ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Anton Troynikov

Anton Troynikov(Cofounder of Chroma), and

4 more.

kotaemon by Cinnamon

Open-source RAG UI for chatting with documents, targeting both end-users and developers

Created 1 year ago

Updated 6 months ago

Feedback? Help us improve.