narrator by cbh123

Fun demo app for generating David Attenborough-style narrations

Created 2 years ago

4,412 stars

Top 11.0% on SourcePulse

View on GitHub

2 Experts Love This Project

Project Summary

This project provides a Python application that uses a webcam to capture video, processes it with an AI model to identify objects, and then uses text-to-speech to narrate the user's life in the style of David Attenborough. It's designed for users interested in creating AI-powered applications and experiencing novel AI interactions.

How It Works

The application leverages a combination of AI models. Object detection is performed using a pre-trained model, likely from a library like YOLO or similar, to identify items in the webcam feed. The identified objects are then fed into a language model (OpenAI) to generate descriptive text. Finally, this text is converted into speech using ElevenLabs' text-to-speech API, with a specific voice ID configured to mimic a particular narration style.

Quick Start & Requirements

Install dependencies: pip install -r requirements.txt
Set API keys: export OPENAI_API_KEY=<token>, export ELEVENLABS_API_KEY=<token>, export ELEVENLABS_VOICE_ID=<voice-id>
Run webcam capture: python capture.py
Run narrator: python narrator.py
Prerequisites: Python 3.x, virtualenv, Replicate account, OpenAI account, ElevenLabs account, ElevenLabs voice ID.

Highlighted Details

Real-time narration of user's environment.
Mimics David Attenborough's narration style.
Integrates webcam input with AI object detection and TTS.

Maintenance & Community

The project is maintained by cbh123. Further community or maintenance details are not provided in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial use or closed-source linking is not mentioned.

Limitations & Caveats

The project requires multiple third-party API keys (OpenAI, ElevenLabs), which may incur costs. The quality of narration is dependent on the chosen ElevenLabs voice and the accuracy of the object detection model.

Health Check

Last Commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

4 stars in the last 30 days