AI companion for reading papers with a "grumpy professor" persona
Top 27.6% on sourcepulse
This project provides an AI-powered desktop application designed to enhance academic paper reading efficiency for researchers. It offers features like PDF processing, AI translation, RAG-based Q&A, and voice interaction, all delivered through a unique "mad professor" persona for a more engaging experience.
How It Works
The application employs a multi-stage pipeline: PDF ingestion and parsing (via magic-pdf
), content translation, structuring, and embedding for RAG. A PyQt6 frontend provides a split-pane interface for viewing documents and interacting with the AI. The core AI functionality leverages LLMs for Q&A and includes speech recognition (Whisper) and TTS for voice interaction, with RAG enhancing retrieval accuracy.
Quick Start & Requirements
conda
to create an environment, then pip install
dependencies.faiss-gpu
(via conda
), numpy<=2.1.1
.config.py
.download_models.py
script handles model downloads.Highlighted Details
Maintenance & Community
The project explicitly thanks the MinerU and RealtimeSTT projects. Customization of persona and voice requires manual code modification.
Licensing & Compatibility
Licensed under the Apache License. This license is permissive and generally compatible with commercial use and closed-source linking.
Limitations & Caveats
The application is primarily designed for structured academic PDFs; unstructured documents may cause errors. Concurrent audio input device switching and AI voice feedback loops (if not using headphones) are noted issues.
3 months ago
Inactive