use-whisper by chengsokdara

React hook for OpenAI Whisper API with speech recorder

Created 2 years ago

784 stars

Top 44.8% on SourcePulse

Project Summary

This React hook simplifies integrating OpenAI's Whisper API into web applications, offering built-in speech recording, real-time transcription, and silence removal. It targets React developers seeking to add voice input and transcription capabilities to their projects, providing a convenient abstraction over complex audio processing and API interactions.

How It Works

The hook leverages browser Web Audio APIs for recording and integrates libraries like recordrtc for cross-browser compatibility and lamejs for MP3 encoding. For silence removal, it utilizes @ffmpeg/ffmpeg. The core functionality involves capturing audio, optionally processing it (silence removal), and sending it to the OpenAI Whisper API for transcription. It supports both direct API calls and custom server integration for enhanced security and control.

Quick Start & Requirements

Install: npm i @chengsokdara/use-whisper or yarn add @chengsokdara/use-whisper
Prerequisites: OpenAI API key.
Demo: https://user-images.githubusercontent.com/2707253/224465747-0b1ee159-21dd-4cd0-af9d-6fc9b882d716.mp4
Docs: https://github.com/chengsokdara/use-whisper

Highlighted Details

Real-time streaming transcription with configurable timeSlice.
Silence removal feature to reduce API costs.
nonStop recording option with stopTimeout.
Customizable Whisper API parameters (language, response_format, temperature, prompt).
Option to handle transcription via a custom server (onTranscribe callback).

Maintenance & Community

The project is actively developed by chengsokdara, with a roadmap including React Native support. Contact information for development services is provided.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is primarily focused on web applications; React Native support is under development. The lack of an explicit license may pose a risk for commercial adoption.

use-whisper by chengsokdara

Explore Similar Projects

whisper-at by YuanGongND

LiveWhisper by Nikorasu

whispercpp by aarnphm

Whisperboard by Saik0s

Scriberr by rishikanthc

whisper-playground by saharmor

whisper-standalone-win by Purfview

speaches by speaches-ai

whisper_streaming by ufal

whisper_real_time by davabase

WhisperLive by collabora

ecoute by SevaSk