Discover and explore top open-source AI tools and projects—updated daily.
AI assistant for STEM with voice/text interaction
Top 95.5% on SourcePulse
ADA (Advanced Design Assistant) is an AI assistant specializing in STEM fields, offering voice and text interaction for concise, accurate information and task assistance. It provides both a cloud-dependent online version leveraging Google Gemini and ElevenLabs for enhanced performance, and a local version reliant on user hardware and Ollama. The project is ideal for STEM professionals and researchers seeking an interactive AI assistant.
How It Works
ADA utilizes a modular architecture with distinct local and online components. The online version integrates Google Gemini for advanced natural language understanding and response generation, coupled with ElevenLabs for high-quality, low-latency Text-to-Speech (TTS). The local version relies on Ollama to serve models like Gemma, with performance directly tied to the user's hardware. Both versions employ RealtimeSTT for speech-to-text transcription and support function calling for task execution, such as accessing system information, managing timers, or performing web searches.
Quick Start & Requirements
pip install -r requirements.txt
(assuming a requirements.txt
file exists, otherwise list key packages like ollama
, websockets
, pyaudio
, RealtimeSTT
, RealtimeTTS
, torch
, google-generativeai
, opencv-python
, pillow
, mss
, psutil
, GPUtil
, elevenlabs
, python-dotenv
, python-weather
, googlemaps
)..env
file. FFmpeg is recommended for audio processing.Highlighted Details
ada_local
and ada_online
.multimodal_live_api.py
) supporting camera or screen sharing with audio.Maintenance & Community
No specific details on contributors, sponsorships, or community channels (like Discord/Slack) are provided in the README.
Licensing & Compatibility
The README does not explicitly state a license. Users should verify licensing for all dependencies, especially for commercial or closed-source use.
Limitations & Caveats
The local version's performance is heavily dependent on user hardware. The README strongly recommends the online version for better quality and speed. Some tools like to_do_list.py
are noted as not currently integrated as callable tools. The camera.py
implementation is described as returning a string rather than maintaining an open feed.
5 months ago
Inactive