Discover and explore top open-source AI tools and projects—updated daily.
LYiHubAI companion for reading papers with a "grumpy professor" persona
Top 26.7% on SourcePulse
This project provides an AI-powered desktop application designed to enhance academic paper reading efficiency for researchers. It offers features like PDF processing, AI translation, RAG-based Q&A, and voice interaction, all delivered through a unique "mad professor" persona for a more engaging experience.
How It Works
The application employs a multi-stage pipeline: PDF ingestion and parsing (via magic-pdf), content translation, structuring, and embedding for RAG. A PyQt6 frontend provides a split-pane interface for viewing documents and interacting with the AI. The core AI functionality leverages LLMs for Q&A and includes speech recognition (Whisper) and TTS for voice interaction, with RAG enhancing retrieval accuracy.
Quick Start & Requirements
conda to create an environment, then pip install dependencies.faiss-gpu (via conda), numpy<=2.1.1.config.py.download_models.py script handles model downloads.Highlighted Details
Maintenance & Community
The project explicitly thanks the MinerU and RealtimeSTT projects. Customization of persona and voice requires manual code modification.
Licensing & Compatibility
Licensed under the Apache License. This license is permissive and generally compatible with commercial use and closed-source linking.
Limitations & Caveats
The application is primarily designed for structured academic PDFs; unstructured documents may cause errors. Concurrent audio input device switching and AI voice feedback loops (if not using headphones) are noted issues.
6 months ago
Inactive
docling-project