Discover and explore top open-source AI tools and projects—updated daily.
alchaincyfAgent Skill optimization via autonomous evolution
New!
Top 22.3% on SourcePulse
Autonomous skill optimization for agent systems, inspired by Andrej Karpathy's autoresearch, alchaincyf/darwin-skill addresses the challenge of managing and improving a growing number of agent skills. It targets users of Claude Code and other skills.sh-compatible platforms, offering a system to automatically evaluate, refine, and retain only measurably improved skills, preventing degradation over time.
How It Works
This project adapts the autoresearch paradigm to skill optimization. It implements an autonomous loop where each SKILL.md file is treated as a program to be optimized. The system employs a dual evaluation approach: static analysis scores structural quality (60 points), while runtime tests assess actual performance (40 points), with performance being the most heavily weighted factor. A core "ratchet" mechanism ensures that only changes leading to a quantifiable improvement are kept; regressions are automatically reverted via Git, guaranteeing that the skill's score only increases. Scoring is performed by a separate sub-agent to mitigate bias.
Quick Start & Requirements
npx skills add alchaincyf/darwin-skill within a compatible agent environment. Alternatively, download the darwin-skill.zip archive, extract it, and place the SKILL.md file into ~/.claude/skills/darwin-skill/.SKILL.md format, such as Claude Code, Codex, OpenClaw, Trae, or CodeBuddy. No specific hardware or non-standard software dependencies are listed.Highlighted Details
autoresearch methodology.Maintenance & Community
The project is maintained by Huashu (@AlchainHust), with links provided to personal websites and social media channels. A related project, alchaincyf/nuwa-skill, is mentioned for skill creation. No direct community channels like Discord or Slack are listed.
Licensing & Compatibility
The project is released under the MIT License, permitting broad use, modification, and distribution, including for commercial purposes and integration into closed-source applications. It is compatible with the skills.sh ecosystem.
Limitations & Caveats
The system is not fully autonomous due to the mandatory "human-in-the-loop" confirmation required between optimization phases for each skill. Its effectiveness is contingent on the quality and comprehensiveness of the provided test prompts (test-prompts.json) and the scoring agents.
1 week ago
Inactive
microsoft