LingEcho-App  by code-100-precent

An intelligent voice interaction platform for AI

Created 2 months ago
252 stars

Top 99.6% on SourcePulse

GitHubView on GitHub
Project Summary

LingEcho is an enterprise-grade intelligent voice interaction platform providing a comprehensive AI voice solution. It integrates advanced speech recognition (ASR), text-to-speech (TTS), large language models (LLM), and real-time communication technologies. The platform targets businesses seeking to implement sophisticated AI-driven voice capabilities, offering features like real-time calls, voice cloning, and workflow automation to enhance customer engagement and operational efficiency.

How It Works

The platform is architected with a Go backend (using Gin) and a React frontend. Core functionalities are powered by integrating specialized AI services: WebRTC for real-time calls, SileroVAD for voice activity detection, ModelScope for voiceprint recognition, Whisper for ASR, and edge-tts for TTS. It supports LLMs for intelligent responses and features a visual workflow designer for automating complex business processes, alongside a knowledge base management system for intelligent data retrieval.

Quick Start & Requirements

  • Primary install: Docker Compose is the recommended method for setup (docker-compose up -d).
  • Prerequisites: Go >= 1.24.0, Node.js >= 18.0.0, npm/pnpm >= 8.0.0, Git. Python >= 3.10 is required for optional services (VAD, Voiceprint, ASR-TTS). Docker & Docker Compose are recommended.
  • Links: Live Demo: https://lingecho.com, API Documentation: http://localhost:7072/api/docs.

Highlighted Details

  • AI Character Real-time Calls leveraging WebRTC for low-latency interaction.
  • Custom Voice Cloning and Training capabilities for personalized AI assistant voices.
  • Visual Workflow Automation designer supporting multiple trigger types for complex business process automation.
  • Knowledge Base Management system with document storage, retrieval, and AI analysis features.
  • Standalone ASR (Whisper) and TTS (edge-tts) services for flexible integration.

Maintenance & Community

The project is maintained by a core team of two full-stack engineers: chenting (Project Manager, Architecture) and wangyueran (Frontend Development). Direct contact is available via email at 19511899044@163.com. No community channels like Discord or Slack are explicitly mentioned in the README.

Licensing & Compatibility

The project is licensed under the MIT License. This license is permissive and generally allows for commercial use, modification, and distribution, including within closed-source applications.

Limitations & Caveats

The provided README does not explicitly detail limitations, alpha status, or known bugs. The setup for optional Python-based services (VAD, Voiceprint, ASR-TTS) requires separate configuration and management, potentially adding complexity to the deployment.

Health Check
Last Commit

4 days ago

Responsiveness

Inactive

Pull Requests (30d)
7
Issues (30d)
1
Star History
17 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
12 more.

rasa by RasaHQ

0.1%
21k
AI framework for automating text and voice conversations
Created 9 years ago
Updated 3 weeks ago
Feedback? Help us improve.