be-more-agent by brenpoly

Customizable offline AI agent for embedded systems

Created 5 months ago

950 stars

Top 38.0% on SourcePulse

Project Summary

This project provides a framework for building a fully local, offline-first conversational AI agent on a Raspberry Pi. It targets hobbyists and engineers seeking a private, customizable, and low-latency AI assistant without cloud dependencies or API fees. The core benefit is enabling a personal AI companion that runs entirely on edge hardware.

How It Works

The agent integrates multiple open-source components for local processing: Ollama serves Large Language Models (LLMs) like Gemma and Moondream, Whisper.cpp handles Speech-to-Text, OpenWakeWord detects custom wake words, and Piper TTS generates low-latency neural voices. It features reactive GUI faces, hardware-aware audio processing, and optional web search via DuckDuckGo for real-time information. This approach ensures data privacy and eliminates recurring costs.

Quick Start & Requirements

Primary Install/Run: Clone the repository, run ./setup.sh (installs dependencies, Python venv, Piper TTS), activate the virtual environment (source venv/bin/activate), and run python agent.py.
Non-default Prerequisites: Raspberry Pi 5 (recommended) or Pi 4 (4GB RAM minimum), USB Microphone, Speaker, LCD Screen (DSI/HDMI), Raspberry Pi Camera Module. Requires updated Raspberry Pi OS and git. Ollama installation is handled by a provided script (curl -fsSL https://ollama.com/install.sh| sh).
Links: Ollama Install Script

Highlighted Details

100% Local Intelligence: Operates entirely offline using Ollama, Whisper.cpp, OpenWakeWord, and Piper TTS, ensuring data privacy and no API fees.
Customizable Character: Easily swap face image sequences and sound effect .wav files to create unique agent personalities.
Vision Capable: Integrates with the Moondream vision model for image description capabilities.
Hardware-Aware Audio: Automatically detects and resamples microphone audio to prevent ALSA errors.

Licensing & Compatibility

Licensed under the MIT License. This project is a fan creation for educational and hobbyist purposes, not affiliated with or endorsed by Cartoon Network. Users are responsible for the assets they integrate.

Limitations & Caveats

Requires specific Raspberry Pi hardware and peripherals. Users may encounter ALSA errors upon script exit, noted as normal but indicative of audio stream interruption. Audio speed issues can arise if voice model sample rates are misconfigured. Custom wake word implementation requires training a new .onnx model.

be-more-agent by brenpoly

Explore Similar Projects

pi-card by nkasmanoff

llama-assistant by nrl-ai

z-waif by SugarcaneDefender

openclaw-assistant by yuga-hashimoto

unity-AI-Chat-Toolkit by zhangliwei7758

jarvis by llm-guy

pipecat-examples by pipecat-ai

whisplay-ai-chatbot by PiSugar

Mark-XLVIII by FatihMakes

parlor by fikrikarim

py-gpt by szczyglis-dev

Open-LLM-VTuber by Open-LLM-VTuber