Research paper for tool-augmented LLM access to biomedical information
Top 72.3% on sourcepulse
GeneGPT is a tool-augmented LLM designed to improve the accuracy and reliability of biomedical information retrieval by enabling LLMs to interact with NCBI Web APIs. It targets researchers and professionals in the life sciences who require precise answers to complex biological questions, offering a significant improvement over standard LLMs by reducing hallucinations and providing verifiable, tool-backed responses.
How It Works
GeneGPT employs in-context learning to teach LLMs how to utilize external tools, specifically NCBI Web APIs. A novel decoding algorithm identifies when an API call is necessary, constructs the appropriate query, executes it, and integrates the results into its response. This approach leverages the LLM's natural language understanding while grounding its answers in real-time, domain-specific data, outperforming existing biomedical LLMs and general-purpose models on specialized tasks.
Quick Start & Requirements
pip install -r requirements.txt
config.py
.python main.py <documentation_flags>
(e.g., python main.py 111111
to use all demonstrations and documentations).python evaluate.py ${RESULT_DIRECTORY}
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 months ago
1 day