OpenAI Codex Runs 25 Hours to Build a Design…

developers.openai.com

ksl

|

2h ago

Derrick Choi from OpenAI ran GPT-5.3-Codex continuously for roughly 25 hours on a single task – building a full design tool from scratch. It consumed 13 million tokens and produced around 30,000 lines of code, including real-time collaboration, prototype mode, and multi-format export. The trick wasn’t model intelligence alone but a set of markdown files acting as durable memory: frozen specs, milestone plans, operational runbooks, and live decision logs that kept the agent coherent across an extended run. METR’s benchmarks show task complexity doubling every seven months for frontier agents, and this cookbook entry reads less like a tutorial than a proof point for that trend. The gap between coding assistant and autonomous teammate keeps narrowing.

Source link

What's Hot

KGMU Jr doc injured by manjha | Lucknow News

India’s international air travel hit record 2 crore passengers in October-December | India News

When AI Coding Tools Undercut the Developer …

Chepauk and its colourful legacy of delivering classics

T20 World Cup 2026 | New Zealand sends Sri Lanka packing

T20 World Cup: Pressure does funny things to teams, says Zimbabwe’s Burl

Ranji Trophy final | A headbutt, an apology, and an all is well: the events that transpired between Dogra and Aneesh

T20 World Cup | South Africa, West Indies hope to inch closer to the semifinals

OpenAI Codex Runs 25 Hours to Build a Design…

When AI Coding Tools Undercut the Developer …

Why AlphaZero and Carlsen Play Chess the Sam…

Using Claude Skills to Automate Paid Ad Anal…

Why One Designer Waited Six Months Before Us…

Kubernetes 1.35 Adds Gang Scheduling for AI …

MoonPay Builds Payment Rails for Autonomous …

News

Company

Services

What's Hot

OpenAI Codex Runs 25 Hours to Build a Design…

Keep Reading

News

Company

Services

Subscribe to Updates