Discover and explore top open-source AI tools and projects—updated daily.
2020MEAILLM for Traditional Chinese Medicine clinical support
Top 100.0% on SourcePulse
This project addresses the need for intelligent assistance in Traditional Chinese Medicine (TCM) clinical practice, specifically for syndrome diagnosis and prescription recommendation, by leveraging large language models. It aims to accelerate advancements in TCM knowledge Q&A and clinical decision support for practitioners and researchers.
How It Works
The project introduces TCMLLM-PR, a large language model fine-tuned for TCM prescription recommendation. It utilizes instruction fine-tuning on the ChatGLM base model, trained on a custom dataset comprising 68,000 data entries (10 million tokens). This dataset integrates diverse sources including real-world clinical cases, classic medical texts, and textbooks, enabling the model to generate relevant prescriptions based on patient symptoms.
Quick Start & Requirements
To get started, users must first download the ChatGLM-6B original model code and parameters, then configure the dependency environment. Subsequently, download the TCMLLM model parameters and extract them into the ChatGLM-6B/ptuning/ directory. Data examples from the project's data/ folder should be placed in ptuning/, along with the TCMLLM_output_demo.py script. Configuration of file and data paths within the script is necessary for batch testing.
iwg3)Highlighted Details
Maintenance & Community
This project was developed by the Medical Intelligence Team at Beijing Jiaotong University. Key contributors include Tian Haoyu, Dong Xin, Xu Kuan, Hua Rui, Zhao Chenxi, Wang Hongyan, Ye Mingwei, and Hu Minjie, with project leadership from Yang Kuo and Zhou Xuezhong. The project acknowledges contributions from cooperating units that provided medical data. No specific community channels (e.g., Discord, Slack) are listed.
Licensing & Compatibility
The project explicitly states that its resources are for academic research only and strictly prohibited for commercial use. It also requires adherence to the licenses of its dependencies (ChatGLM-6B, LLaMA, etc.). Compatibility for commercial applications or integration into closed-source systems is not supported due to the non-commercial license and data generation caveats.
Limitations & Caveats
The dataset is largely model-generated and should not be used for actual medical diagnosis. The accuracy of model outputs is not guaranteed due to inherent LLM limitations. The project disclaims all legal liability for any use or consequences arising from the model's outputs or resources. Commercial use is strictly forbidden.
1 year ago
Inactive