hertz-dev  by Standard-Intelligence

Open-source base model for full-duplex conversational audio

Created 10 months ago
1,757 stars

Top 24.4% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

Hertz-dev is an open-source base model for full-duplex conversational audio, enabling real-time, two-way voice communication. It targets researchers and developers building interactive voice applications, offering a foundational model for advanced audio interaction.

How It Works

The project provides a base model for full-duplex conversational audio, allowing for simultaneous speaking and listening. It includes scripts for offline inference, a client-server architecture for live interaction, and a browser-based client using Streamlit and WebRTC for easier accessibility.

Quick Start & Requirements

  • Install: pip install -r requirements.txt (after installing PyTorch with CUDA 12.1 support: pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121). For WebRTC client: pip install -r requirements_webrtc.txt.
  • Prerequisites: Python 3.10, CUDA 12.1 (recommended). Ubuntu users may need libportaudio.
  • Models: Automatically downloaded to ./ckpt/.
  • Docs: Blog post: https://si.inc/hertz-dev/

Highlighted Details

  • Enables full-duplex conversational audio.
  • Offers offline inference, client-server, and browser-based (Streamlit + WebRTC) interaction modes.
  • Tested primarily on Ubuntu (server) and macOS (client).

Maintenance & Community

No specific community channels or contributor information is detailed in the README.

Licensing & Compatibility

The license is not specified in the provided README.

Limitations & Caveats

Inference is only confirmed to work reliably on Python 3.10 and CUDA 12.1; other versions are less tested. The client-server and WebRTC components are experimental. Remote hosting of the Streamlit client requires HTTPS and potential STUN server configuration for WebRTC connections.

Health Check
Last Commit

8 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
8 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.