audioflare by seanoliver

AI audio playground using Cloudflare AI Workers

Created 2 years ago

450 stars

Top 66.8% on SourcePulse

View on GitHub

2 Experts Love This Project

Travis Fischer

Founder of Agentic

Shawn Wang

Editor of Latent Space

Project Summary

Audioflare is an AI-powered audio processing playground designed for developers and researchers interested in leveraging Cloudflare's AI Workers. It offers a unified interface to transcribe, summarize, analyze sentiment, and translate audio files, demonstrating a practical workflow for multi-step AI tasks within the Cloudflare ecosystem.

How It Works

The project orchestrates a series of Cloudflare AI Workers to process audio. It begins with speech-to-text transcription using OpenAI's Whisper API, followed by text summarization via Meta's Llama-2 model. Sentiment analysis is performed using Huggingface's DistilBERT, and translation into nine languages is handled by Meta's m2m100 model. Cloudflare's AI Gateway provides observability, including analytics, logging, caching, and rate limiting for these worker interactions.

Quick Start & Requirements

Install dependencies: bun install
Cloudflare account and Wrangler CLI (bun add wrangler --dev, wrangler login)
Configure .env with required Cloudflare API keys.
Run locally: bun dev
Demo: https://audioflare.seanoliver.dev/
Project Repo: https://github.com/seanoliver/audioflare

Highlighted Details

Integrates Cloudflare's Speech to Text, LLM, Text Classification, and Translation AI workers.
Demonstrates AI Gateway for observability, caching, and rate limiting.
Supports drag-and-drop for local audio files and includes sample files.
Calculates and displays processing time for each AI task.

Maintenance & Community

The project is a side project by Sean Oliver. Contributions are welcomed via pull requests and issues.

Licensing & Compatibility

Distributed under the MIT License. This license permits commercial use and integration with closed-source projects.

Limitations & Caveats

Audio transcription is limited to the first 30 seconds of any uploaded file. The LLM summarization model may struggle with lengthy prompts. Cloudflare's AI models are noted as being in 'beta'.

Health Check

Last Commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days