Qwen3-Medical-SFT by Zeyi-Lin

LLM fine-tuning for specialized medical chat

Created 9 months ago

282 stars

Top 92.7% on SourcePulse

Project Summary

This repository offers fine-tuned versions of the Qwen3-1.7B language model, specifically adapted for medical domain chat applications with an "R1 inference style". It targets developers and researchers seeking specialized medical LLMs, providing a dataset and scripts for both full parameter and LoRA fine-tuning to facilitate the creation of models adept at medical query response.

How It Works

The project fine-tunes the Qwen3-1.7B base model using either full parameter updates or the memory-efficient LoRA technique. It employs the delicate_medical_r1_data dataset and includes dedicated scripts (train.py, train_lora.py) for each training methodology. The objective is to achieve a distinct "R1 inference style" tailored for medical conversations, as illustrated by the provided example dialogue.

Quick Start & Requirements

Installation: pip install -r requirements.txt
Prerequisites: Python environment.
Hardware: Full parameter fine-tuning requires 32GB VRAM; LoRA fine-tuning requires 28GB VRAM. Lower VRAM usage is possible with the Qwen3-0.6B model or by reducing MAX_LENGTH.
Data Preparation: Execute python data.py for automated dataset download, preprocessing, and validation split.
Training: Use python train.py (full parameter) or python train_lora.py (LoRA).
Inference: Utilize python inference.py (full parameter) or python inference_lora.py (LoRA).
Dependencies: SwanLab for logging, HuggingFace Transformers, and PEFT library.

Highlighted Details

Supports both full parameter and LoRA fine-tuning methods.
Internal testing indicates full parameter fine-tuning outperforms LoRA.
Model is specifically tuned for an "R1 inference style" in medical contexts.
Features automated data preparation and logging via SwanLab.

Maintenance & Community

The README provides no specific information regarding maintainers, community channels (e.g., Discord, Slack), or project roadmaps.
Mentions integration with SwanLab, HuggingFace Transformers, and PEFT libraries.

Licensing & Compatibility

The README does not specify the license for the code or the fine-tuned model, leaving its terms of use and compatibility for commercial applications unclear.

Limitations & Caveats

Significant VRAM requirements (28-32GB) are necessary for fine-tuning the 1.7B parameter model.
The absence of explicit licensing information poses a potential adoption blocker for commercial use.
Performance claims are based on internal testing results.

Health Check

Last Commit

9 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

15 stars in the last 30 days

Explore Similar Projects

finetuned-qlora-falcon7b-medical by iamarunbrahma

Falcon-7B finetuned for mental health conversations

Created 2 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

Zhongjing by SupritYoung

Chinese medical chatbot based on LLaMa, trained with RLHF

Created 2 years ago

Updated 2 years ago

LLM-Pretrain-FineTune by X-jun-0130

LLM pretraining and fine-tuning for medical dialogue

Created 3 years ago

Updated 1 year ago

MedQA-ChatGLM by WangRongsheng

Fine-tuning script for medical QA ChatGLM models

Created 2 years ago

Updated 2 years ago

DISC-MedLLM by FudanDISC

Medical LLM for conversational healthcare services

Created 2 years ago

Updated 2 years ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

ToD-BERT by jasonwu0731

Pre-trained NLU models for task-oriented dialogue

Created 5 years ago

Updated 2 years ago

Starred by

Laurent Mazare

Laurent Mazare(Cofounder of Kyutai).

moshi-finetune by kyutai-labs

Fine-tune audio models with LoRA

Created 11 months ago

Updated 4 months ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect) and

Wing Lian

Wing Lian(Founder of Axolotl AI).

medAlpaca by kbressem

LLM finetuned for medical question answering

Created 2 years ago

Updated 2 years ago

DoctorGLM by xionghonglin

Chinese medical Q&A model based on ChatGLM-6B

Created 2 years ago

Updated 2 years ago

BianQue by scutcyr

Chinese medical dialogue model for proactive health applications

Created 2 years ago

Updated 2 years ago

Starred by

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai),

Wing Lian

Wing Lian(Founder of Axolotl AI), and

2 more.

UltraChat by thunlp

Multi-round dialogue dataset and models for chat language model training

Created 2 years ago

Updated 1 year ago

Med-ChatGLM by SCIR-HI

ChatGLM fine-tune for Chinese medical QA

Created 2 years ago

Updated 2 years ago

Feedback? Help us improve.