AI audio playground using Cloudflare AI Workers
Top 68.9% on sourcepulse
Audioflare is an AI-powered audio processing playground designed for developers and researchers interested in leveraging Cloudflare's AI Workers. It offers a unified interface to transcribe, summarize, analyze sentiment, and translate audio files, demonstrating a practical workflow for multi-step AI tasks within the Cloudflare ecosystem.
How It Works
The project orchestrates a series of Cloudflare AI Workers to process audio. It begins with speech-to-text transcription using OpenAI's Whisper API, followed by text summarization via Meta's Llama-2 model. Sentiment analysis is performed using Huggingface's DistilBERT, and translation into nine languages is handled by Meta's m2m100 model. Cloudflare's AI Gateway provides observability, including analytics, logging, caching, and rate limiting for these worker interactions.
Quick Start & Requirements
bun install
bun add wrangler --dev
, wrangler login
).env
with required Cloudflare API keys.bun dev
Highlighted Details
Maintenance & Community
The project is a side project by Sean Oliver. Contributions are welcomed via pull requests and issues.
Licensing & Compatibility
Distributed under the MIT License. This license permits commercial use and integration with closed-source projects.
Limitations & Caveats
Audio transcription is limited to the first 30 seconds of any uploaded file. The LLM summarization model may struggle with lengthy prompts. Cloudflare's AI models are noted as being in 'beta'.
1 year ago
1 day