SkillOpt by microsoft

Text-space optimization for LLM agent skills

Created 2 months ago

13,819 stars

Top 3.8% on SourcePulse

1 Expert Loves This Project

ebursztein

Cybersecurity Lead at Google DeepMind

Project Summary

Summary

SkillOpt optimizes reusable natural-language skills for frozen LLM agents. It enables self-evolving agent capabilities by training skills via trajectory-driven edits and validation gates, without modifying core model weights. This approach benefits researchers and engineers seeking to enhance LLM agent performance and adaptability.

How It Works

SkillOpt treats agent skill training akin to neural network training, employing concepts like epochs, batch sizes, and learning rates. Its core innovation lies in "trajectory-driven edits" and "validation-gated updates" to refine skills. This method allows for iterative improvement and the generation of deployable best_skill.md artifacts without altering the underlying LLM's weights, promoting efficient skill acquisition.

Quick Start & Requirements

Installation requires Python 3.10+ and a standard pip install -e . after cloning the repository.
Configuration involves setting up API credentials for LLM providers (Azure OpenAI, OpenAI, Anthropic, Qwen) via a .env file.
Data preparation is crucial: users must provide data in a split_dir containing train/, val/, and test/ subdirectories, each with a JSON file formatted according to benchmark specifications. Benchmark datasets are not included.
An optional ALFWorld benchmark installation and WebUI dashboard are available.
A demo video is linked.

Highlighted Details

Supports training and evaluation across six benchmarks: SearchQA, ALFWorld, DocVQA, LiveMathematicianBench, Spreadsheet

Health Check

Last Commit

15 hours ago

Responsiveness

Inactive

Pull Requests (30d)

48

Issues (30d)

23

Star History

4,380 stars in the last 30 days

Explore Similar Projects

awesome-in-context-rl by dunnolab

Advancing reinforcement learning through in-context learning paradigms

Created 1 year ago

Updated 10 months ago

SkillX by zjunlp

Automating skill knowledge base construction for LLM agents

Created 6 months ago

Updated 3 weeks ago

Ctx2Skill by S1s-Z

Autonomous skill discovery for LLM context learning

Created 3 months ago

Updated 3 weeks ago

skillport by gotalab

AI agent skill management and serving toolkit

Created 7 months ago

Updated 1 month ago

opencode-agent-skills by joshuadavidthomas

Agent skills management for OpenCode

Created 8 months ago

Updated 3 days ago

MemSkill by ViktorAxelsen

Self-evolving agents with learned and evolving memory skills

Created 5 months ago

Updated 2 months ago

AutoSkill by ECNU-ICALK

Experience-driven lifelong learning for agent skill evolution

Created 5 months ago

Updated 2 months ago

SkVM by SJTU-IPADS

Compile and run LLM agent skills across heterogeneous models and harnesses

Created 3 months ago

Updated 2 weeks ago

awesome-hermes-skills by ZeroPointRepo

Curated skills for a self-improving AI agent

Created 3 months ago

Updated 3 weeks ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI).

SkillRL by aiming-lab

Recursive skill-augmented reinforcement learning for evolving LLM agents

Created 5 months ago

Updated 2 months ago

Starred by

Philipp Moritz

Philipp Moritz(Cofounder of Anyscale).

MetaClaw by aiming-lab

Agent learning and evolution through conversation

Created 4 months ago

Updated 1 month ago

Starred by

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI),

Dan Guido

Dan Guido(Cofounder of Trail of Bits), and

4 more.

agent-lightning by microsoft

Train any AI agent with rollouts and feedback

Created 1 year ago

Updated 1 week ago

Feedback? Help us improve.