sage  by farshed

Self-hosted voice chat with LLMs

created 6 months ago
434 stars

Top 69.6% on sourcepulse

GitHubView on GitHub
Project Summary

Sage provides a self-hosted, offline voice chat experience with large language models, targeting users who want privacy and control over their AI interactions. It offers low latency and can run on consumer hardware, allowing seamless integration of speech-to-text and LLM responses.

How It Works

Sage utilizes state-of-the-art open-source speech processing models for transcription. For text generation, it supports self-hosted LLMs via Ollama or integrates with third-party providers like Deepseek, OpenAI, Anthropic, and Together.ai. Configuration is managed through a .env file, specifying API keys and desired models, enabling easy switching between LLM backends.

Quick Start & Requirements

  • Docker: bun docker-build then bun docker-run. UI at http://localhost:3000. Requires Docker.
  • Native: Install Bun, Rust, OpenSSL, LLVM, Clang, CMake. Run setup-unix.sh or setup-win.bat. First macOS run (~20 mins) compiles CoreML models.
  • Models: Requires downloading kokoro-v0_19.onnx, voices.json, and ggml-large-v3-turbo.bin for Docker. Native setup handles downloads.
  • Docs: https://github.com/farshed/sage

Highlighted Details

  • Self-hosted, offline pipeline for privacy.
  • Modular design for easy LLM provider switching.
  • Low latency operation on consumer hardware.
  • Supports multiple LLM providers and Ollama.

Maintenance & Community

The project is actively developed by farshed. Further community engagement channels are not specified in the README.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial or closed-source use is not specified.

Limitations & Caveats

Docker execution is significantly slower (4-5x) for speech inference compared to native builds. CUDA support is listed as future work, indicating current GPU acceleration is not available. The native setup requires a substantial list of development tools.

Health Check
Last commit

5 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
8 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Andre Zayarni Andre Zayarni(Cofounder of Qdrant), and
2 more.

RealChar by Shaunwei

0.1%
6k
Real-time AI character/companion creation and interaction codebase
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Pietro Schirano Pietro Schirano(Founder of MagicPath), and
1 more.

SillyTavern by SillyTavern

3.2%
17k
LLM frontend for power users
created 2 years ago
updated 3 days ago
Feedback? Help us improve.