Discover and explore top open-source AI tools and projects—updated daily.
kayba-aiAutonomous optimization for agent harnesses
New!
Top 96.2% on SourcePulse
This project provides an autonomous control plane for optimizing agent harnesses, enabling users to automatically improve prompt, configuration, middleware, and source code changes. It targets engineers and researchers seeking to enhance the reliability and performance of production agents by iteratively refining their underlying harnesses based on automated evaluations. The primary benefit is reduced manual effort in optimization and a more robust agent system.
How It Works
Autoharness operates by inspecting a target harness repository and defining an optimization campaign. It utilizes a guide command to set up an autoharness.yaml configuration, which specifies a benchmark command for evaluating candidate changes. The system supports various adapters for running benchmarks (e.g., pytest, harbor) and multiple proposal generators (e.g., openai_responses, codex_cli, claude_code) to create potential improvements. Iterative optimize runs generate, evaluate, and promote candidate changes, persisting state and champions within the .autoharness/ directory. This approach allows for resumable, automated optimization loops.
Quick Start & Requirements
pipx install "git+https://github.com/kayba-ai/autoharness.git"autoharness guide. For AI-assisted setup, use --assistant codex --print-next-prompt or --assistant claude --print-next-prompt.https://github.com/kayba-ai/autoharness.gitHighlighted Details
pytest, harbor) and proposal generators (e.g., openai_responses, codex_cli, claude_code).Maintenance & Community
The project is developed by Kayba and the open-source community. No specific community channels (like Discord/Slack), roadmap links, or notable contributor details are provided in the README.
Licensing & Compatibility
The license is not explicitly stated in the provided README. This lack of clarity presents a significant adoption blocker, particularly for commercial use or integration into closed-source projects.
Limitations & Caveats
Optimization outcomes are contingent on the specific benchmark, harness implementation, and evaluation setup; certain interventions may lead to regressions. The setup process can be complex, potentially requiring AI model API keys or specific configurations. The absence of a stated license is a critical caveat for adoption.
2 weeks ago
Inactive
microsoft
affaan-m