Blog
May. 12, 2026
GPT 5.5 high Solves First Instance!
GPT 5.5 with high reasoning becomes the first model to fully resolve a ProgramBench task instance.
May. 4, 2026
Is ProgramBench Impossible?
ProgramBench is hard by design, but it is solvable by construction. We address common concerns about feasibility.
May. 4, 2026
ProgramBench Released
Introducing ProgramBench with 200 task instances across C/C++, Go, and Rust.