AI-Writer  by BlinkDL

AI writing tool for generating Chinese web novels

Created 4 years ago
3,329 stars

Top 14.5% on SourcePulse

GitHubView on GitHub
Project Summary

AI-Writer is a Chinese novel generation tool that leverages the RWKV language model, offering an alternative to GPT-style models with potentially faster inference. It targets users interested in generating fantasy and romance web novels, providing GPU acceleration for NVIDIA and AMD/Intel cards, as well as CPU-only execution.

How It Works

The project utilizes the RWKV (Receptance Weighted Key Value) model, an RNN architecture that aims to combine the parallelizable training of Transformers with the efficient inference of RNNs. This approach allows for a fixed computational cost per token, regardless of sequence length, and a context length of up to 512 tokens. The model predicts the next character based on a probability distribution derived from the preceding context.

Quick Start & Requirements

Highlighted Details

  • Supports NVIDIA and AMD/Intel GPU acceleration.
  • RWKV model architecture offers efficient inference compared to traditional Transformers.
  • Special sampling methods are used to improve generation quality for smaller models.
  • Includes a web interface for easier use.

Maintenance & Community

  • The project README indicates that the current models and information are outdated, recommending the newer RWKV-Runner project.
  • QQ groups are available for support and discussion (553456870, 325154699).
  • Telegram group: https://t.me/ai_writer

Licensing & Compatibility

  • The README does not explicitly state a license. The associated RWKV-LM repository uses the Apache 2.0 license. Compatibility for commercial use is not specified.

Limitations & Caveats

The project explicitly states that training data is derived solely from web novels, leading to a lack of common sense in generated text, and that outputs are for entertainment purposes only. The README also notes that the current models are outdated.

Health Check
Last Commit

4 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
38 stars in the last 30 days

Explore Similar Projects

Starred by Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), Tri Dao Tri Dao(Chief Scientist at Together AI), and
1 more.

hnet by goombalab

1.5%
722
Hierarchical sequence modeling with dynamic chunking
Created 2 months ago
Updated 1 month ago
Feedback? Help us improve.