Archive·1 story·Jun 2026 – Jun 2026·Updated 19:10 UTC

Archive

Every processed story in chronological order, with the newest coverage first. Filter by tag, source, or score to drill in.

Total · all-time1

Avg score6.5▲ 0.8 vs all tags

Verdict

Steady

Stories / monthPeak 1

Jul 25Oct 25Jan 26Apr 26Jun 26

1 storyShowing 1–1Page 1 of 1

Sort

NewestScore

Density

StandardCompact

W251 story · Jun 15–21

6.5
Jun 16, 2026·Tongxu Luo, Rongsheng Wang, Jiaxi Bi·Research Papers·1 min read
GameCraft-Bench tests if agents can build full games in Godot
GameCraft-Bench exposes a concrete ceiling on current coding agents' ability to produce fully playable games, showing that even the best frontier models fall below 41.46% on a task requiring integrated scripts, scenes, assets, and runtime interaction — a gap that partial code-generation benchmarks do not capture.

GameCraft-Bench tests if agents can build full games in Godot