Dataset maker for ChatGLM finetuning
Top 54.7% on sourcepulse
This tool assists users in creating custom datasets for fine-tuning ChatGLM models, enabling personalized AI responses. It's designed for users who want to generate or curate question-answer pairs for model training, particularly for creating specialized conversational agents.
How It Works
The tool provides a web interface built with Streamlit. Users input questions and can either manually write answers or leverage GPT's API to generate responses. The generated question-answer pairs are then saved in a JSON format compatible with several popular ChatGLM fine-tuning projects.
Quick Start & Requirements
pip install openai==0.28.0 streamlit
streamlit run dataset.py --server.port 2323
Highlighted Details
Maintenance & Community
No specific information on maintainers, community channels, or roadmap is provided in the README.
Licensing & Compatibility
The repository's license is not specified in the README. Compatibility for commercial use or closed-source linking is undetermined.
Limitations & Caveats
The project explicitly requires an older version of the openai
package (0.28.0
) due to API changes, which may pose compatibility issues with newer projects. The README does not specify the project's license.
1 year ago
1 week