TCM LLM for traditional Chinese medicine (TCM) knowledge and consultation
Top 72.9% on sourcepulse
ShenNong-TCM is a large language model specifically designed for Traditional Chinese Medicine (TCM). It aims to enhance LLMs' knowledge and capabilities in TCM, enabling them to provide more informed responses to medical inquiries and support the inheritance of TCM knowledge. The project is targeted at researchers and developers interested in applying LLMs to specialized medical domains.
How It Works
ShenNong-TCM is built upon the LLaMA base model and fine-tuned using the ShenNong_TCM_Dataset. This dataset was created using an "entity-centric self-instruct" method, focusing on core TCM entities and various intent scenarios. The method leverages ChatGPT to generate over 110,000 instruction-following data points related to TCM, derived from an open-source TCM knowledge graph. This approach allows for efficient and targeted fine-tuning, resulting in improved TCM-specific performance.
Quick Start & Requirements
python src/entity_centric_self_instruct.py your_KG_triples.txt your_output_file.jsonl
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project acknowledges that the TCM knowledge within the models is limited by the quality and quantity of the training data and is subject to ongoing improvement. The models are not intended for medical diagnosis or advice.
1 year ago
Inactive