Question answering system for PDF files
Top 55.5% on sourcepulse
This project provides a question-answering system for PDF files, specifically targeting board game rulebooks, built upon GPT-3. It's designed for avid board game fans and offers a proof-of-concept solution for quickly finding answers within complex documents.
How It Works
The system leverages a combination of In-Context Retrieval-Augmented Language Models (RALM) and Hypothetical Document Embeddings (HyDE). HyDE generates a hypothetical answer to a query, which is then embedded and used for retrieval. RALM uses these retrieved documents to augment the language model's context, enabling more accurate and relevant answers based on the PDF content.
Quick Start & Requirements
pip install -r ask-my-pdf/requirements.txt
cd ask-my-pdf/src
then execute run.sh
or run.bat
.STORAGE_SALT
, OPENAI_KEY
).Highlighted Details
Maintenance & Community
The project is maintained by mobarski, who encourages following on Twitter for updates. It is presented as a proof of concept.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
This is a proof-of-concept system and may contain bugs or unfinished features. The accuracy of answers is dependent on the quality of the PDF and the OpenAI model's performance, with potential for hallucinations.
1 year ago
Inactive