AI assistant using Gemini 1.5 Pro for multimodal memory
Top 86.0% on sourcepulse
Insight is a personal AI assistant that leverages Gemini 1.5 Pro to answer questions based on visual and auditory input, with memory capabilities. It is designed for users seeking an AI companion that can process real-time sensory data and recall past interactions.
How It Works
The system integrates Gemini 1.5 Pro for advanced reasoning and question answering. It processes input from a webcam and microphone, enabling it to understand and respond to queries related to the user's environment and conversations. Memory is managed to retain context across interactions.
Quick Start & Requirements
pip install -r requirements.txt
.pvporcupine
, google-generativeai
, SpeechRecognition
, firebase-admin
, google-cloud-texttospeech
, picamera2
.config.py
(based on config.example.py
).python main.py
.Highlighted Details
Maintenance & Community
The project is maintained by @advaitpaliwal. Further community or roadmap information is not detailed in the README.
Licensing & Compatibility
The project is licensed under "[License Name]". The specific license type and its implications for commercial use or closed-source linking are not fully detailed.
Limitations & Caveats
The project has significant hardware dependencies, requiring a Raspberry Pi and specific peripherals. The licensing is not clearly specified, which may impact commercial adoption.
1 year ago
Inactive