skills  by NoizAI

Human-like AI voice generation and bot communication skills

Created 3 weeks ago

New!

402 stars

Top 72.2% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

NoizAI/skills provides a framework for enhancing AI bots with natural, "human"-like conversational abilities through a modular "skills" system. It targets developers building interactive AI agents, offering features like local-first execution and fine-grained control over speech synthesis to create more engaging and personalized bot interactions.

How It Works

The project centers on a npx-installable "skills" management system designed for local-first, secure execution. This approach keeps sensitive data on the user's machine. Core to its functionality are character-style controls, allowing developers to tune parameters such as fillers, emotional nuances, and speaking presets to imbue bot output with a companion-like feel. It also supports production-ready voice generation, including timeline-aligned rendering and reference-audio voice cloning, for high-quality speech synthesis.

Quick Start & Requirements

  • Primary Install/Run: Use npx skills add to install skills. Examples include npx skills add NoizAI/skills --list --full-depth to list all, or npx skills add . --list --full-depth for local development.
  • Prerequisites: Node.js is required (implied by npx). An optional Noiz API key from developers.noiz.ai/api-keys is recommended for enhanced features like faster processing, emotion control, and voice cloning. Alternatively, the local Kokoro backend can be used.
  • Links: API keys: https://developers.noiz.ai/api-keys. Contribution guidelines: CONTRIBUTING.md.

Highlighted Details

  • Secure and Local-First: Skills run on your own machine, keeping sensitive text and assets localized.
  • Character-Style Controls: Tune fillers, emotion, and speaking presets for companion-like output.
  • Production-Ready Voice: Offers quick TTS generation, timeline-aligned rendering, precise duration control, and reference-audio voice cloning.
  • One-Command Delivery: Generate speech and send it as a native voice message to Feishu, Telegram, or Discord with zero extra code.
  • Available Skills: Includes TTS, chat-with-anyone, characteristic-voice, video-translation, and daily-news-caster.

Maintenance & Community

  • Community: A Discord server is available for feedback and discussion.
  • Contribution: Detailed guidelines for skill authoring, directory conventions, and pull requests are provided in CONTRIBUTING.md.

Licensing & Compatibility

  • License: Not explicitly stated in the provided README.
  • Compatibility: No specific compatibility notes for commercial use or closed-source linking are provided.

Limitations & Caveats

Full feature access, including faster processing, advanced emotion control, and voice cloning, is recommended via a Noiz API key, suggesting potential limitations or reduced functionality without one. The project appears to be actively developed, with no explicit mention of alpha/beta status or known critical bugs.

Health Check
Last Commit

6 days ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
1
Star History
404 stars in the last 24 days

Explore Similar Projects

Feedback? Help us improve.