WeClone by xming521

Digital twin one-stop solution

Created 1 year ago

16,183 stars

Top 3.0% on SourcePulse

2 Experts Love This Project

chiphuyen

Author of "AI Engineering", "Designing Machine Learning Systems"

hiyouga

Author of LLaMA-Factory

Project Summary

WeClone offers a comprehensive solution for creating digital personas by fine-tuning large language models (LLMs) with personal chat history. It targets users who want to imbue AI with their unique communication style or preserve digital legacies, enabling personalized chatbots and voice cloning.

How It Works

The project leverages the LLaMA Factory framework for fine-tuning LLMs, specifically using LoRA or QLoRA techniques for efficient parameter adaptation. Chat data, extracted via PyWxDump and preprocessed to remove PII and filter content, is used to train models like Qwen2.5-7B-Instruct. A separate module, WeClone-audio, handles voice cloning using smaller models and WeChat voice messages. The fine-tuned models can then be deployed via an API service and integrated with various chatbot platforms like AstrBot.

Quick Start & Requirements

Install dependencies using uv venv .venv --python=3.10 and uv pip install --group main -e ..
Requires Python 3.10+, PyTorch with CUDA support.
Recommended: 16GB+ VRAM for 7B models with LoRA/QLoRA.
Data extraction requires PyWxDump.
Official Docs: https://github.com/xming521/WeClone

Highlighted Details

Full pipeline for digital persona creation: data export, preprocessing, model training, and deployment.
Supports WeChat chat history fine-tuning for LLMs and voice cloning via WeClone-audio.
Integrates with multiple chatbot platforms (WeChat, Telegram, QQ, etc.) via an API service.
Offers data preprocessing to remove PII and filter content using a customizable blocked words list.

Maintenance & Community

Project is under active development with recent refactoring in v0.2.0.
Open to Issues and Pull Requests; discussions for new features are encouraged via Issues.
Development dependencies include pytest, pyright, and ruff.

Licensing & Compatibility

The repository does not explicitly state a license.
The extensive disclaimer warns against illegal use, privacy theft, and requires deletion of code within 24 hours of download, implying a non-commercial, personal-use-only restriction.

Limitations & Caveats

The project is in a rapid iteration phase, and current results are not final.
Windows environment is not strictly tested; WSL is recommended.
Fine-tuning effectiveness heavily depends on model size and data quantity/quality.
Tool calling capabilities are disabled after fine-tuning; system prompts must be manually configured in integrated bots.

Health Check

Last Commit

5 days ago

Responsiveness

1 day

Pull Requests (30d)

2

Issues (30d)

4

Star History

309 stars in the last 30 days

Explore Similar Projects

Soul-of-Waifu by jofizcd

AI companion app for interacting with characters

Created 2 years ago

Updated 4 months ago

xtts2-ui by BoltzmannEntropy

UI for text-based voice cloning using a 10-second audio sample

Created 2 years ago

Updated 1 year ago

muvtuber by cdfmlr

AI VTuber for Bilibili, with support for other platforms in development

Created 2 years ago

Updated 1 year ago

sesame_csm_openai by phildougherty

OpenAI-compatible TTS API for voice cloning

Created 10 months ago

Updated 3 months ago

unity-AI-Chat-Toolkit by zhangliwei7758

Unity toolkit for AI chat functionality

Created 2 years ago

Updated 6 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

ChatdollKit by uezo

3D virtual assistant SDK for voice-enabled chatbots using 3D models

Created 5 years ago

Updated 1 month ago

VideoChat by Henry-23

Digital human for real-time voice interaction

Created 1 year ago

Updated 3 weeks ago

Starred by

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

ChatterUI by Vali-98

Mobile app frontend for LLMs

Created 2 years ago

Updated 1 month ago

Master-AI-BOT by yesbhautik

Telegram bot for GPT-4 Turbo access with unique chat modes

Created 2 years ago

Updated 1 year ago

bilibot by linyiLYi

Local chatbot fine-tuned with user comments

Created 1 year ago

Updated 1 year ago

Linly-Talker by Kedreamix

Digital avatar conversational system

Created 2 years ago

Updated 10 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

AstrBot by AstrBotDevs

LLM chatbot/framework for multiple platforms

Created 3 years ago

Updated 10 hours ago

Feedback? Help us improve.