Discover and explore top open-source AI tools and projects—updated daily.
Bolin97LLM for proactive medical dialogue with imperfect information
Top 50.5% on SourcePulse
Summary
MedArk addresses the challenge of Large Language Models (LLMs) struggling with imperfect initial information in multi-turn dialogues, particularly in medical consultations. It enables LLMs to proactively ask clarifying questions and retrieve relevant knowledge, thereby improving the reliability and accuracy of responses. This project is targeted at researchers and developers seeking to enhance LLM collaboration in scenarios where initial user input is incomplete or ambiguous, offering a framework to mitigate hallucinations and foster more effective human-AI interaction.
How It Works
The core innovation is the "Ask and Retrieve Knowledge framework (Ark)," which empowers LLMs to self-reason and decide whether to "ask" for more information or "tell" an answer at each dialogue turn. This decision-making process is enhanced by retrieving knowledge pertinent to the user's input. By generating action paths for dialogues and training the MedArk model on this data, the framework equips LLMs with the ability to proactively seek missing information, mitigate medical hallucinations through active knowledge retrieval, and make informed decisions about subsequent actions.
Quick Start & Requirements
hashlib, TypedDict, torch, transformers, chroma-db, gradio. A Go environment is required for construction-sft/second_pass_gen.ipynb.tagged.pkl from dialogues, building a ChromaDB vector store, and generating SFT/DPO training data via multiple Python scripts and Jupyter notebooks. This is followed by SFT and DPO training stages.deploy/depl_web_ui.py.Highlighted Details
Maintenance & Community
No specific community channels (e.g., Discord, Slack), active contributor information beyond the listed authors, or roadmap details are provided in the README. The project appears primarily linked to its associated academic publication.
Licensing & Compatibility
The README does not specify a software license. Users should verify compatibility for commercial use or integration with closed-source projects.
Limitations & Caveats
The current code is a prototype intended solely for demonstration purposes and is not production-ready. The usage instructions are described as not user-friendly, and significant manual effort is required for data preparation and model training pipelines. A Go environment is a specific, non-Python dependency for a key data generation step.
7 months ago
Inactive
microsoft