Open platform for LLM tool learning (ICLR'24 spotlight)
Top 9.8% on sourcepulse
ToolBench is an open platform for training, serving, and evaluating Large Language Models (LLMs) for tool learning. It provides a comprehensive dataset of over 16,000 real-world APIs and a framework for fine-tuning LLMs to master these APIs, enabling them to perform complex tasks involving single and multi-tool interactions. The project targets researchers and developers looking to enhance LLM capabilities in practical API usage.
How It Works
ToolBench leverages a large-scale dataset automatically generated using ChatGPT with enhanced function call capabilities. It employs a novel Depth-First Search based Decision Tree (DFSDT) method for data annotation, which improves efficiency and handles complex instructions requiring planning and reasoning. The platform includes an API retrieval mechanism to equip LLMs with open-domain tool-using abilities and offers a robust evaluation suite, ToolEval, which correlates highly with human judgment.
Quick Start & Requirements
pip install -r requirements.txt
(Python >= 3.9)data.zip
from Google Drive.Highlighted Details
Maintenance & Community
The project is actively maintained with regular updates, including new versions of ToolEval and models like ToolLLaMA-2. A Discord server is available for community engagement.
Licensing & Compatibility
Distributed under the Apache License 2.0. Intended solely for research and educational purposes.
Limitations & Caveats
Customized API usage is currently limited to closed-domain settings, with open-domain support planned. The project requires substantial computational resources for fine-tuning.
2 months ago
1 day