agentic-harness-engineering by china-qijizhifeng

Observability-driven evolution for coding agents

Created 2 months ago

724 stars

Top 46.7% on SourcePulse

View on GitHub

1 Expert Loves This Project

Yaowei Zheng

Author of LLaMA-Factory

Project Summary

Summary

Agentic Harness Engineering (AHE) is an open observability system for automatically evolving coding-agent harnesses around a fixed base model. It targets researchers and engineers seeking to enhance agent performance by optimizing system prompts, tool descriptions, implementations, and middleware. AHE significantly boosts agent capabilities, demonstrated by high benchmark pass rates and harnesses that generalize across models.

How It Works

AHE uses an iterative evaluate-analyze-improve loop driven by three observability layers: component tracking (git), experience distillation (Agent Debugger processing traces), and decision support (Evolve Agent proposing evidence-backed edits). Harness components like prompts, tools, and skills are refined based on trace analysis. Each iteration's evaluation falsifies predictions, guiding further refinement and encoding general engineering experience.

Quick Start & Requirements

Requires Python ≥ 3.13, uv, and tmux. Installation: git clone, uv sync. Configure environment variables for LLM/sandbox API keys (e.g., LLM_API_KEY, E2B_API_KEY). Experiments run in E2B sandboxes (SaaS/self-hosted). Pre-build E2B templates: uv run python scripts/build_templates.py --dataset-dir /path/to/dataset -j 16. Launch experiments via ./scripts/evolve.sh configs/experiments/exp-003-simple-code-gpt54.yaml. Datasets can be local paths or referenced via dataset: "<name>@<ver>".

agentic-harness-engineering by china-qijizhifeng

Explore Similar Projects

autoharness by kayba-ai

roach-pi by tmdgusya

Aegis by GanyuanRan

agentops by boshu2

auto-harness by neosigmaai

rosetta by griddynamics

agents-md by FerroxLabs

loopkit by Archive228

a-evolve by A-EVO-Lab

EvoSkill by sentient-agi

agent-md by iamfakeguru

harness-engineering-from-cc-to-ai-coding by ZhangHanDong