swift by ai-ng

Voice assistant demo powered by Groq, Cartesia, and Vercel

Created 1 year ago

586 stars

Top 55.4% on SourcePulse

View on GitHub

1 Expert Loves This Project

Travis Fischer

Founder of Agentic

Project Summary

Swift is a high-performance AI voice assistant designed for rapid interaction, leveraging cutting-edge AI models for transcription, response generation, and speech synthesis. It targets developers and power users seeking a responsive, voice-first application experience.

How It Works

Swift integrates Groq for accelerated inference of OpenAI's Whisper for speech-to-text and Meta's Llama 3 for text generation. Speech synthesis is handled by Cartesia's Sonic model, streamed directly to the user. Voice Activity Detection (VAD) manages audio input, triggering callbacks on detected speech segments. The application is built with Next.js and TypeScript, deployed on Vercel.

Quick Start & Requirements

Install dependencies: pnpm install
Start development server: pnpm dev
Requires API keys for Groq and Cartesia.
Project is a Next.js application.

Highlighted Details

Utilizes Groq for low-latency LLM inference.
Employs Cartesia Sonic for fast, streamed speech synthesis.
Integrates VAD for efficient speech segment detection.
Built with Next.js and TypeScript, deployed on Vercel.

Maintenance & Community

The project acknowledges contributions from Groq and Cartesia for API access. Further community or maintenance details are not specified in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial or closed-source use is undetermined.

Limitations & Caveats

The project is presented as a demo, and its production-readiness, scalability, and long-term maintenance are not detailed. API key requirements may incur costs.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

4 stars in the last 30 days