Discover and explore top open-source AI tools and projects—updated daily.
sentient-agiAI agent skill discovery and self-improvement framework
New!
Top 89.9% on SourcePulse
EvoSkill is an open-source framework designed to automatically discover and synthesize reusable agent skills, significantly enhancing AI agent performance on long-horizon tasks, particularly in coding. It targets researchers and engineers seeking to move beyond manual prompt engineering by providing a self-improving system that iterates on agent configurations.
How It Works
EvoSkill employs an evolutionary loop to refine agent performance. It begins with a base agent attempting benchmark questions, then analyzes failures to propose targeted skill or prompt improvements. A generator creates these changes, and an evaluator scores the new program variants on a validation set. The framework maintains a "frontier" of the top-N performing programs, tracked via git branches for full reproducibility, ensuring that only the best configurations survive to the next iteration.
Quick Start & Requirements
uv sync or pip install -e ..export ANTHROPIC_API_KEY=your-key-here).--sdk and --model flags (e.g., claude, opencode with deepseek-ai/DeepSeek-V3, google/gemini-2.0-flash-exp)..dataset/ directory (e.g., .dataset/dabstep_data.csv, .dataset/seal-0.csv).src.api).Highlighted Details
Maintenance & Community
The project is associated with research efforts, indicated by a 2025 Zenodo publication for a related framework. Specific community channels (like Discord or Slack) or detailed contributor information are not detailed in the README.
Licensing & Compatibility
The project is licensed under the Apache 2.0 License. This license is permissive and generally compatible with commercial use and linking within closed-source projects.
Limitations & Caveats
Operation requires specific API keys for agent SDKs and Docker for secure code execution during evaluations. Manual preparation of benchmark datasets into the specified directory structure is necessary before running the framework.
2 weeks ago
Inactive
karthikscale3
microsoft
microsoft