ask-astro by astronomer

Q&A interface for Airflow and Astronomer documentation

Created 2 years ago

278 stars

Top 93.5% on SourcePulse

1 Expert Loves This Project

bobvanluijt

Cofounder of Weaviate

Project Summary

This project provides an end-to-end reference implementation for a Retrieval Augmented Generation (RAG) question-answering system, specifically tailored for Apache Airflow and Astronomer documentation. It targets developers and users seeking to build or understand LLM-powered Q&A applications, offering a comprehensive example that includes data ingestion, prompt orchestration, and feedback loops.

How It Works

The system employs a RAG architecture to ensure factual accuracy. Data from various sources (Airflow docs, Astronomer blog, GitHub PRs, Stack Overflow) is ingested, chunked using LangChain, embedded via OpenAI's models, and stored in Weaviate. Prompt orchestration involves generating multiple prompt variations, retrieving relevant documents from Weaviate, reranking them with Cohere Reranker, and finally using GPT-4o for answer generation. Feedback loops are integrated, allowing user ratings and LLM-based quality assessments to refine the system by re-ingesting high-quality Q&A pairs as new data sources.

Quick Start & Requirements

Install/Run: Local development is supported via Python scripts (scripts/local_dev.py).
Prerequisites: OpenAI API key, Cohere API key, Weaviate instance.
Setup: Local development environment setup is facilitated by provided scripts.
Links: Ingest README for source configuration.

Highlighted Details

Leverages LangChain for prompt engineering and data processing.
Utilizes Weaviate as the vector database.
Employs OpenAI's embedding models and GPT-3.5/GPT-4o for LLM calls.
Integrates Cohere Reranker for improved document relevance.
Includes a Slack bot interface for accessibility.
Features a feedback loop mechanism for continuous model improvement.

Maintenance & Community

Developed by Astronomer.
Contact: ai@astronomer.io for questions and feedback.

Licensing & Compatibility

The README does not explicitly state the license.

Limitations & Caveats

Requires API keys for OpenAI and Cohere, incurring costs.
Relies on a running Weaviate instance.
The project is described as a "reference implementation," suggesting potential for further development and refinement.

Health Check

Last Commit

6 months ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

4 stars in the last 30 days

Explore Similar Projects

AI-SmartFuse-Framework by mainpropath

Java framework for large language model applications

Created 2 years ago

Updated 1 year ago

Starred by

Jesse Clark

Jesse Clark(Cofounder of Marqo).

memprompt by madaan

Memory-assisted prompt editing refines GPT-3 via user feedback post-deployment

Created 4 years ago

Updated 2 years ago

nanoPerplexityAI by Yusuke710

Minimal LLM service implementation, like Perplexity AI

Created 1 year ago

Updated 11 months ago

Starred by

Jerry Liu

Jerry Liu(Cofounder of LlamaIndex).

beyondllm by aiplanethub

RAG toolkit for LLM app building, evaluation, and observation

Created 1 year ago

Updated 11 months ago

PAI-RAG by aigc-apps

Modular RAG framework for question-answering with LLMs

Created 2 years ago

Updated 2 days ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

self-refine by madaan

Self-Refine: LLM research paper for iterative output refinement

Created 2 years ago

Updated 1 year ago

Starred by

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp) and

Dominik Moritz

Dominik Moritz(Research Scientist at Apple; Professor at CMU).

Sidekick by johnbean393

macOS app for local LLM chat using files, folders, and websites

Created 1 year ago

Updated 1 month ago

agentic-rag-for-dummies by GiovanniPasq

Agentic RAG for learning and building

Created 3 months ago

Updated 2 days ago

Starred by

Ankush Gola

Ankush Gola(Cofounder of LangChain).

langsmith-cookbook by langchain-ai

Cookbook for LangSmith, a tool to debug, evaluate, test, and improve LLM apps

Created 2 years ago

Updated 1 month ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

easy-dataset by ConardLi

Dataset tool for LLM fine-tuning

Created 10 months ago

Updated 1 day ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

rag-from-scratch by langchain-ai

RAG tutorial for expanding LLM knowledge via external data

Created 1 year ago

Updated 6 months ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Rodrigo Nader

Rodrigo Nader(Cofounder of Langflow), and

30 more.

haystack by deepset-ai

AI orchestration framework for LLM application development

Created 6 years ago

Updated 2 days ago

Feedback? Help us improve.