Language agent fine-tuning research paper
Top 93.9% on sourcepulse
This repository provides the code, data, and prompts for FireAct, a framework for fine-tuning language agents. It enables agents to interact with tools and execute tasks, targeting researchers and developers working on agent-based AI systems.
How It Works
FireAct leverages a ReAct-style approach, defining tools and tasks within dedicated directories. Data generation and experimentation are driven by generation.py
, which orchestrates agent interactions with tools and models. This allows for systematic collection of trajectories for fine-tuning, aiming to improve agent performance on complex tasks.
Quick Start & Requirements
pip install -r requirements.txt
.OPENAI_API_KEY
), SERP API key (as SERPAPI_API_KEY
), Python 3.9+.conda create -n fireact python=3.9
, conda activate fireact
).python generation.py --task hotpotqa --backend gpt-4 --promptpath default --evaluate --random --task_split val --temperature 0 --task_end_index 5
(Note: High --task_end_index
values are costly).finetune/llama_lora/
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
Data generation using GPT-4 and SERP API can be costly. The project does not explicitly state its license, which may impact commercial use. Community support channels are not readily available.
1 year ago
Inactive