repo-task-proof-loop  by DenisSergeevitch

Agentic workflow for repo-local coding tasks

Created 1 week ago

New!

598 stars

Top 54.6% on SourcePulse

GitHubView on GitHub
Project Summary

Summary Repo Task Proof Loop addresses the common failure modes of large AI coding agent tasks by enforcing a rigorous, repo-local workflow. It targets developers and researchers using AI agents for complex coding, providing a structured approach that ensures durable proof of work, separates implementation from verification, and maintains task auditability. The core benefit is a more reliable and reproducible agentic development process.

How It Works

This skill implements a strict, six-phase loop: spec freezebuildevidencefresh verifyminimal fixfresh verify. It leverages role-separated subagents (spec-freezer, builder, verifier, fixer) for both Codex and Claude Code. All task artifacts, including specifications, evidence, build outputs, and verdicts, are managed within a dedicated .agent/tasks/<TASK_ID>/ directory in the repository. This design ensures all proof resides locally, facilitates easy resumption and auditing, and maintains clear separation between implementation and verification roles.

Quick Start & Requirements

Installation involves copying the skill directory to .agents/skills/repo-task-proof-loop/ (Codex) or .claude/skills/repo-task-proof-loop/ (Claude Code). The primary requirement is a host agent product capable of spawning subagents, such as Codex or Claude Code. Users initiate tasks via agent prompts (init, status, build) after installing the skill. No external documentation links are provided beyond the README.

Highlighted Details

  • Repo-Local Artifacts: Manages a comprehensive suite of artifacts within .agent/tasks/<TASK_ID>/, including spec.md, evidence.md, evidence.json, raw build/test outputs, verdict.json, and problems.md.
  • Role-Separated Subagents: Installs four distinct subagent roles (task-spec-freezer, task-builder, task-verifier, task-fixer) for both Codex and Claude Code, each with defined boundaries and responsibilities.
  • Strict Workflow Loop: Enforces a clear separation between implementation, evidence gathering, and verification phases, ensuring a reproducible and auditable process.
  • Managed Guidance: Updates AGENTS.md and CLAUDE.md in place, preserving unrelated user content while integrating workflow instructions.

Maintenance & Community

The provided README does not contain specific details regarding notable contributors, sponsorships, or community channels like Discord or Slack.

Licensing & Compatibility

No license information is explicitly stated in the provided README text. Compatibility for commercial use or closed-source linking is therefore undetermined.

Limitations & Caveats

The skill's functionality is dependent on the host agent product's ability to spawn and manage subagents. If a platform cannot maintain the same builder agent across build and evidence phases, a fallback mechanism is employed. The exact behavior of subagent spawning may vary between different host environments.

Health Check
Last Commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
1
Star History
601 stars in the last 12 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy) and Didier Lopes Didier Lopes(Founder of OpenBB).

CCPlugins by brennercruvinel

0.3%
3k
Claude Code CLI enhancement framework
Created 8 months ago
Updated 5 months ago
Feedback? Help us improve.