poetiq-arc-agi-solver  by poetiq-ai

Advanced reasoning solver for abstract intelligence benchmarks

Created 2 months ago
1,116 stars

Top 34.3% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides the implementation for Poetiq's state-of-the-art (SOTA) reasoning system, enabling reproduction of its record-breaking submissions to the ARC-AGI-1 and ARC-AGI-2 benchmarks. It targets AI researchers and engineers focused on advanced reasoning capabilities and benchmark performance, offering a direct path to understanding and leveraging top-tier solutions for complex abstract reasoning tasks.

How It Works

The project facilitates replication of Poetiq's approach to achieving SOTA performance on abstract reasoning challenges. While specific algorithmic details are elaborated in linked blog posts, the core functionality revolves around a sophisticated reasoning engine designed to tackle the ARC-AGI benchmark's demanding problem sets. This approach has proven effective in surpassing existing benchmarks and establishing new performance standards.

Quick Start & Requirements

Highlighted Details

  • Enables reproduction of record-breaking submissions to the ARC-AGI-1 and ARC-AGI-2 benchmarks.
  • Achieves state-of-the-art (SOTA) reasoning performance on official leaderboards.
  • Provides verifiable results for public and private evaluation datasets.

Maintenance & Community

For inquiries or discussions regarding AI reasoning, contact poetiq@poetiq.ai. Users are encouraged to cite the project's launch post for academic or research use.

Licensing & Compatibility

The provided README does not specify a software license. This absence necessitates caution regarding usage, modification, and distribution, particularly for commercial applications or integration into closed-source projects.

Limitations & Caveats

The solver's functionality is dependent on external proprietary AI models (e.g., Gemini, OpenAI), requiring valid API keys and incurring associated costs. The primary focus is on reproducing benchmark results, and general-purpose reasoning capabilities beyond the ARC-AGI scope are not detailed.

Health Check
Last Commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
1
Star History
356 stars in the last 30 days

Explore Similar Projects

Starred by Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and
7 more.

reasoning-gym by open-thought

0.5%
1k
Procedural dataset generator for reasoning models
Created 11 months ago
Updated 3 weeks ago
Feedback? Help us improve.