PythonProgrammingPuzzles by microsoft

Python puzzle dataset for AI programming proficiency research

Created 4 years ago

993 stars

Top 37.4% on SourcePulse

1 Expert Loves This Project

winglian

Founder of Axolotl AI

Project Summary

This repository provides a dataset of Python programming puzzles designed to teach and evaluate AI's programming proficiency. It offers a diverse range of problems, from trivial to open research questions, with code-based specifications for unambiguous evaluation. The dataset is valuable for AI research in code generation and self-improvement.

How It Works

The core of the dataset consists of Python functions, each defining a puzzle. A puzzle is solved by providing an input that makes the function return True. This code-based specification eliminates the ambiguity of natural language descriptions and the need for separate test cases. The repository also includes example solutions generated by AI models like OpenAI's Codex, demonstrating the dataset's utility for benchmarking AI performance.

Quick Start & Requirements

Explore puzzles and AI solutions via the provided Binder link: https://mybinder.org/v2/gh/microsoft/PythonProgrammingPuzzles/main?labpath=intro.ipynb
Requires Python 3.x.

Highlighted Details

Dataset spans trivial programming exercises to open problems in computer science and mathematics.
Includes classic puzzles like Towers of Hanoi, verbal arithmetic, and Game of Life variants.
Features competitive programming problems from Codeforces and Olympiad problems (ICPC, IMO).
Contains open problems in graph theory (Conway's 99-graph problem) and number theory (factoring, discrete log).

Maintenance & Community

Actively maintained by Microsoft researchers.
Welcomes contributions via pull requests; a CLA is required.
Follows the Microsoft Open Source Code of Conduct.

Licensing & Compatibility

The repository itself does not explicitly state a license in the README.
Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The dataset includes open problems, meaning some puzzles may not have known solutions.
The README mentions that some AI solutions were found after many attempts, indicating potential difficulty for AI models.
No explicit license is stated, which may pose a barrier for commercial adoption.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

Awesome-Code-Intelligence by QiushiSun

Survey paper resource for neural code intelligence

Created 2 years ago

Updated 4 months ago

aoc-gpt by max-sixty

GPT-3 code-generation for Advent of Code challenges

Created 3 years ago

Updated 3 years ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Yiran Wu

Yiran Wu(Coauthor of AutoGen), and

2 more.

aimo-progress-prize by project-numina

Code for replicating a math problem-solving solution

Created 1 year ago

Updated 1 year ago

Starred by

Anton Osika

Anton Osika(Cofounder of Lovable).

codiumai-vscode-release by Codium-ai

AI-powered coding assistant for code generation, testing, and review

Created 2 years ago

Updated 3 weeks ago

Starred by

Yiran Wu

Yiran Wu(Coauthor of AutoGen).

tree-of-thought-puzzle-solver by jieyilong

Tree-of-Thoughts (ToT) framework demo for solving reasoning tasks using LLMs

Created 2 years ago

Updated 1 year ago

Starred by

Johannes Hagemann

Johannes Hagemann(Cofounder of Prime Intellect) and

Jinze Bai

Jinze Bai(Research Scientist at Alibaba Qwen).

apps by hendrycks

Dataset for measuring coding challenge competence (NeurIPS 2021)

Created 4 years ago

Updated 1 year ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA),

Eric Zhu

Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and

7 more.

reasoning-gym by open-thought

Procedural dataset generator for reasoning models

Created 10 months ago

Updated 2 weeks ago

Java-AI-Book-Code by mark-watson

Java AI book code examples

Created 14 years ago

Updated 1 month ago

AIGoodGames by EmbraceAGI

AI games collection, blending code and text for dreamlike experiences

Created 2 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Wei-Lin Chiang

Wei-Lin Chiang(Cofounder of LMArena), and

2 more.

grade-school-math by openai

Dataset for grade school math word problems

Created 4 years ago

Updated 1 year ago

AiLearning-Theory-Applying by ben1234560

AI learning resource for theory and application

Created 5 years ago

Updated 7 months ago

Machine-Learning-Interviews by alirezadir

ML interview prep guide for landing roles at big tech

Created 4 years ago

Updated 3 days ago

Feedback? Help us improve.