sage by farshed

Self-hosted voice chat with LLMs

Created 11 months ago

461 stars

Top 65.7% on SourcePulse

Project Summary

Sage provides a self-hosted, offline voice chat experience with large language models, targeting users who want privacy and control over their AI interactions. It offers low latency and can run on consumer hardware, allowing seamless integration of speech-to-text and LLM responses.

How It Works

Sage utilizes state-of-the-art open-source speech processing models for transcription. For text generation, it supports self-hosted LLMs via Ollama or integrates with third-party providers like Deepseek, OpenAI, Anthropic, and Together.ai. Configuration is managed through a .env file, specifying API keys and desired models, enabling easy switching between LLM backends.

Quick Start & Requirements

Docker: bun docker-build then bun docker-run. UI at http://localhost:3000. Requires Docker.
Native: Install Bun, Rust, OpenSSL, LLVM, Clang, CMake. Run setup-unix.sh or setup-win.bat. First macOS run (~20 mins) compiles CoreML models.
Models: Requires downloading kokoro-v0_19.onnx, voices.json, and ggml-large-v3-turbo.bin for Docker. Native setup handles downloads.
Docs: https://github.com/farshed/sage

Highlighted Details

Self-hosted, offline pipeline for privacy.
Modular design for easy LLM provider switching.
Low latency operation on consumer hardware.
Supports multiple LLM providers and Ollama.

Maintenance & Community

The project is actively developed by farshed. Further community engagement channels are not specified in the README.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial or closed-source use is not specified.

Limitations & Caveats

Docker execution is significantly slower (4-5x) for speech inference compared to native builds. CUDA support is listed as future work, indicating current GPU acceleration is not available. The native setup requires a substantial list of development tools.

sage by farshed

Explore Similar Projects

local_llm_assistant by nickbild

S.A.T.U.R.D.A.Y by GRVYDEV

pi-card by nkasmanoff

talk by yacineMTB

OpenVoiceChat by Finity-Alpha

ollama-voice-mac by apeatling

unity-AI-Chat-Toolkit by zhangliwei7758

bolna by voxos-ai

unmute by kyutai-labs

bolna by bolna-ai

chat-with-gpt by cogentapps

vocode-core by vocodedev