sage  by farshed

Self-hosted voice chat with LLMs

Created 7 months ago
461 stars

Top 65.7% on SourcePulse

GitHubView on GitHub
Project Summary

Sage provides a self-hosted, offline voice chat experience with large language models, targeting users who want privacy and control over their AI interactions. It offers low latency and can run on consumer hardware, allowing seamless integration of speech-to-text and LLM responses.

How It Works

Sage utilizes state-of-the-art open-source speech processing models for transcription. For text generation, it supports self-hosted LLMs via Ollama or integrates with third-party providers like Deepseek, OpenAI, Anthropic, and Together.ai. Configuration is managed through a .env file, specifying API keys and desired models, enabling easy switching between LLM backends.

Quick Start & Requirements

  • Docker: bun docker-build then bun docker-run. UI at http://localhost:3000. Requires Docker.
  • Native: Install Bun, Rust, OpenSSL, LLVM, Clang, CMake. Run setup-unix.sh or setup-win.bat. First macOS run (~20 mins) compiles CoreML models.
  • Models: Requires downloading kokoro-v0_19.onnx, voices.json, and ggml-large-v3-turbo.bin for Docker. Native setup handles downloads.
  • Docs: https://github.com/farshed/sage

Highlighted Details

  • Self-hosted, offline pipeline for privacy.
  • Modular design for easy LLM provider switching.
  • Low latency operation on consumer hardware.
  • Supports multiple LLM providers and Ollama.

Maintenance & Community

The project is actively developed by farshed. Further community engagement channels are not specified in the README.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial or closed-source use is not specified.

Limitations & Caveats

Docker execution is significantly slower (4-5x) for speech inference compared to native builds. CUDA support is listed as future work, indicating current GPU acceleration is not available. The native setup requires a substantial list of development tools.

Health Check
Last Commit

6 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Victor Taelin Victor Taelin(Author of Bend, Kind, HVM) and Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research).

chat-with-gpt by cogentapps

0.0%
2k
Open-source ChatGPT app with voice
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.