Open-source chatbot powered by the RWKV RNN language model
Top 5.4% on sourcepulse
ChatRWKV provides a ChatGPT-like conversational AI experience powered by the RWKV (Receptance Weighted Key Value) language model. It targets developers and researchers seeking an open-source alternative to Transformer-based models, offering comparable quality and scalability with improved speed and reduced VRAM usage due to its RNN architecture.
How It Works
ChatRWKV leverages the RWKV architecture, a novel RNN that achieves Transformer-level performance and scalability. Unlike traditional RNNs, RWKV processes sequences in a time-mixing manner, allowing it to capture long-range dependencies effectively. This RNN design inherently supports stateful inference, enabling efficient sequential processing and significant VRAM savings compared to Transformers, which require recomputing attention over the entire sequence.
Quick Start & Requirements
pip install rwkv
ninja
are required. Building CUDA kernels involves setting environment variables (RWKV_CUDA_ON=1
, PATH
, LD_LIBRARY_PATH
) and potentially reinstalling CUDA with VC++ extensions for Windows.Highlighted Details
cuda fp16
, INT8) for optimized speed and VRAM usage.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 months ago
1 day