Discover and explore top open-source AI tools and projects—updated daily.
SamuelSchmidgallMultimodal agent benchmark for AI in simulated clinical diagnosis
Top 99.6% on SourcePulse
<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> AgentClinic provides a multimodal agent benchmark for evaluating AI performance in simulated clinical environments. It addresses the need for standardized assessment of AI agents in complex medical diagnosis by simulating doctor-patient interactions. This benefits AI researchers and developers by offering a framework to test diagnostic accuracy, safety, and robustness of AI models in healthcare.
How It Works
The project simulates clinical environments using language and vision agents, enabling multimodal AI evaluation. It employs LLMs to act as doctors, patients, or measurement/moderator agents within these simulated scenarios. This approach allows for the assessment of diagnostic reasoning, the impact of simulated biases on decision-making, and the overall effectiveness of AI in clinical contexts.
Quick Start & Requirements
pip install -r requirements.txtHF_mistralai/Mixtral-8x7B-v0.1).Highlighted Details
Maintenance & Community
No specific details on maintenance, community channels (like Discord/Slack), or active contributors are provided in the README.
Licensing & Compatibility
The license type is not explicitly stated in the provided README.
Limitations & Caveats
The README notes that running evaluations, particularly with local HuggingFace models, "Can be quite slow." The MIMIC-IV dataset requires a separate approval process from PhysioNet, adding an adoption hurdle for that specific dataset.
10 months ago
Inactive