RAG app for interacting with ArXiv research papers
Top 58.8% on sourcepulse
ArXivChatGuru enables users to interact with research papers from ArXiv by leveraging LangChain and Redis. This tool is designed for researchers and developers interested in understanding Retrieval Augmented Generation (RAG) systems, offering a practical demonstration of vector databases and semantic caching in a scientific context.
How It Works
The application retrieves papers from ArXiv based on a user-provided topic. These papers are then segmented into smaller chunks, and embeddings are generated for each chunk. Redis serves as a vector database, storing these embeddings for efficient similarity search. When a user asks a question, the system retrieves the most relevant document chunks from Redis and uses them with an OpenAI model to generate an answer, showcasing the RAG process.
Quick Start & Requirements
.env
with OPENAI_API_KEY
, then run poetry install --no-root
followed by poetry run streamlit run app.py
. Alternatively, use docker compose up
after setting up .env
.Highlighted Details
Maintenance & Community
This project is from redis-developer
, indicating potential backing or focus from the Redis community. Contributions and feedback are welcomed.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The project is explicitly stated as a learning tool, not a production-ready application, and is not designed for scalability. Chunking is described as "arbitrary."
3 months ago
1 day