ancient_text_generation_LLM  by JianXiao2021

LLM for modern Chinese to classical Chinese translation

Created 1 year ago
277 stars

Top 93.7% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides a large language model capable of translating modern Chinese sentences into classical Chinese (Wenyan). It is built upon the Xunzi base model and fine-tuned using LoRA on a parallel corpus of classical and modern Chinese texts, targeting researchers and developers working with historical Chinese linguistics or text generation.

How It Works

The model leverages LoRA (Low-Rank Adaptation) for efficient fine-tuning of a large base model. This approach allows for significant adaptation with a smaller number of trainable parameters compared to full fine-tuning, reducing computational cost and memory requirements. The training utilizes a parallel corpus of modern and classical Chinese texts to teach the model the stylistic and grammatical nuances of Wenyan.

Quick Start & Requirements

Highlighted Details

  • LoRA fine-tuning workflow provided.
  • Training process visualization via SwanLab.
  • Option to resume training from checkpoints.
  • Model merging and pushing to Hugging Face supported.
  • Local inference demo included.

Maintenance & Community

  • No specific contributors, sponsorships, or community links (Discord/Slack) are mentioned in the README.

Licensing & Compatibility

  • The README does not explicitly state a license.

Limitations & Caveats

The project requires manual setup of the base model and data preparation. Training visualization relies on an external service (SwanLab), and the absence of a specified license may impact commercial use or redistribution.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.