Python puzzle dataset for AI programming proficiency research
Top 37.7% on SourcePulse
This repository provides a dataset of Python programming puzzles designed to teach and evaluate AI's programming proficiency. It offers a diverse range of problems, from trivial to open research questions, with code-based specifications for unambiguous evaluation. The dataset is valuable for AI research in code generation and self-improvement.
How It Works
The core of the dataset consists of Python functions, each defining a puzzle. A puzzle is solved by providing an input that makes the function return True
. This code-based specification eliminates the ambiguity of natural language descriptions and the need for separate test cases. The repository also includes example solutions generated by AI models like OpenAI's Codex, demonstrating the dataset's utility for benchmarking AI performance.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 year ago
Inactive