Evaluation framework for function-calling LLM, NexusRaven-13B
Top 86.7% on sourcepulse
NexusRaven-13B is an open-source LLM specifically designed for function calling, aiming to surpass existing state-of-the-art models in this domain. It is targeted at developers and researchers needing robust and efficient API interaction capabilities from LLMs, offering significant performance gains and commercial viability.
How It Works
NexusRaven-13B is trained for function calling, accepting Python function signatures and docstrings to generate appropriate API calls. It is designed to generalize to unseen tools and is compatible with frameworks like LangChain. The model's output often includes a "reflection" step, which the authors recommend bypassing by using a specific stop criterion (["\nReflection:"]
) to prioritize the "Initial Call" for efficiency and direct execution.
Quick Start & Requirements
pip install transformers accelerate
transformers
library. GPU recommended for inference.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The model may generate reflections that are not always helpful; using a stop criterion is recommended. It performs best with a retriever when dealing with many functions, as a large number can saturate the context window. The model can be prone to generating incorrect calls, necessitating guardrails.
1 year ago
1+ week