sepia-docs by SEPIA-Framework

Self-hosted, extendable, personal, intelligent assistant framework

Created 7 years ago

251 stars

Top 99.8% on SourcePulse

Project Summary

SEPIA is a self-hosted, extendable, personal, intelligent assistant framework. It provides a modular, open-source solution for building custom digital voice assistants, targeting users who want control over their data and functionality. The framework offers a comprehensive suite of tools, including speech recognition, wake-word detection, TTS, NLU, and dialog management, enabling the creation of sophisticated voice-controlled applications.

How It Works

SEPIA operates on a client-server architecture. The SEPIA Client handles user interactions (voice, text, touch) and manages the dialog flow, while the Assist-Server acts as the "brain," processing natural language understanding, integrating smart services, and managing user accounts. A separate SEPIA STT Server provides real-time speech-to-text capabilities, supporting various open-source ASR models for flexibility and privacy. This modular design allows components to run on diverse hardware, including low-power devices like the Raspberry Pi.

Quick Start & Requirements

Server Installation: Requires Java JDK 8 or 11. Download the SEPIA-Home bundle, extract, and run the setup script (.bat for Windows, .sh for Linux/Mac).
Client Access: Use the web client (https://sepia-framework.github.io/app/) or the Android app, connecting to your SEPIA server by specifying the hostname.
Raspberry Pi Installation: Specific instructions and an automated script are available.
API Keys: Some services require API keys for functionality (e.g., navigation). Instructions for obtaining them are provided.
Resources: Optimized to run on Raspberry Pi, suggesting low resource requirements for basic operation.

Highlighted Details

Supports a wide range of smart services out-of-the-box, including news, music, timers, smart home integration (openHAB), navigation, weather, and more.
Includes a Control HUB for server management, headless clients, and smart home configuration, along with an integrated code editor for custom services and widgets.
Offers a Java SDK for developing custom services and supports custom commands in German and English, with basic support for other languages.
The SEPIA STT Server supports custom, dynamic ASR models using tools like Kaldi, Vosk, or Zamia speech.

Maintenance & Community

Users are encouraged to post questions and bug reports in the issues section.
Links to Wiki, Blog, Twitter, and Mastodon feeds are provided for news and detailed descriptions.
Discussions can be initiated via provided links.

Licensing & Compatibility

The framework is open-source. Specific license details (e.g., MIT, Apache) are not explicitly stated in the provided text, but the emphasis on self-hosting and customization suggests a permissive approach.
Components are compatible with Linux, Windows, and Mac, including Raspberry Pi.

Limitations & Caveats

Some services are optimized for German, potentially leading to mixed-language results for certain queries (e.g., news, soccer results).
Users running public servers are responsible for security and data privacy policies due to handling potentially sensitive personal information.

Health Check

Last Commit

3 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

PI-Assistant by Lucky-183

AI-driven voice assistant framework for smart homes

Created 2 years ago

Updated 3 weeks ago

CAAL by CoreWorxLab

Local voice assistant with extensible tool capabilities

Created 2 months ago

Updated 1 week ago

folotoy-server-self-hosting by FoloToy

Self-hosted server for customized toy interaction

Created 2 years ago

Updated 3 weeks ago

chatterbox-tts-api by travisvn

OpenAI-compatible TTS API with voice cloning

Created 9 months ago

Updated 2 months ago

aoai-realtime-audio-sdk by Azure-Samples

Azure OpenAI SDK for real-time audio processing with GPT-4o

Created 1 year ago

Updated 4 months ago

rhasspy by rhasspy

Offline private voice assistant

Created 6 years ago

Updated 10 months ago

bolna by bolna-ai

Voice AI agents platform for building conversational apps

Created 1 year ago

Updated 1 day ago

vosk-server by alphacep

Offline speech recognition server

Created 6 years ago

Updated 7 months ago

xiaozhi-android-client by TOM88812

Cross-platform Flutter app for AI voice/text chat

Created 1 year ago

Updated 2 weeks ago

mirotalksfu by miroslavpejic85

WebRTC SFU for scalable real-time video conferences

Created 4 years ago

Updated 1 day ago

Starred by

Jason Huggins

Jason Huggins(Creator of Selenium),

Junyang Lin

Junyang Lin(Core Maintainer at Alibaba Qwen), and

3 more.

01 by openinterpreter

Open-source voice interface for desktop, mobile, and ESP32 chips

Created 2 years ago

Updated 1 year ago

Starred by

Chaoyu Yang

Chaoyu Yang(Founder of Bento),

Nir Gazit

Nir Gazit(Cofounder of Traceloop), and

4 more.

pipecat by pipecat-ai

Open-source framework for building real-time voice and multimodal conversational AI agents

Created 2 years ago

Updated 20 hours ago

Feedback? Help us improve.