×
Step to this: GPT-5 beats Pokémon Red in 6,470 steps, smashing AI record
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Sometimes it’s better to not get your steps in.

OpenAI’s GPT-5 has set a new world record for completing Pokémon Red, finishing the classic Game Boy game in just 6,470 steps—nearly three times faster than the previous record holder, ChatGPT-o3. This achievement demonstrates the rapid advancement of AI gaming capabilities, with models now completing complex video games at unprecedented speeds compared to just months ago when competing AI systems struggled to even finish the game.

The big picture: AI models are increasingly using video games as benchmarks to showcase their problem-solving capabilities, with Pokémon serving as a particularly effective test case for strategic thinking and long-term planning.

Key performance metrics: GPT-5’s record-breaking run translates to approximately seven days of gameplay, compared to over 15 days for ChatGPT-o3’s previous record of 18,184 steps.

  • This represents a dramatic improvement from earlier in the year when Gemini 2.5 and Claude 3.7 Sonnet were still racing just to complete the game at any speed.
  • The AI gaming attempts have gained a following on streaming platforms like Twitch, where channels like GPT_Plays_Pokemon attract regular viewers and subscribers.

How it worked: GPT-5 employed a strategy familiar to many childhood players—focusing on leveling up a single Pokémon while neglecting the other five party members.

  • As one Reddit user noted: “Learned that sticking to one Pokémon and hard tanking everybody is the easier way.”
  • This approach essentially brute-forced victory rather than demonstrating sophisticated team-building strategies that experienced players typically use.

What’s next: Following its Pokémon Red success, GPT-5 will now attempt to conquer Pokémon Crystal, the 2000 sequel that features double the content with both the Johto and Kanto regions.

Industry context: Companies like Anthropic, an AI research company, have specifically chosen Pokémon as a benchmark for AI capabilities, with developers explaining that GameFreak’s iconic franchise provides an ideal framework for assessing AI problem-solving skills through livestreamed gameplay demonstrations.

GPT-5 just completed Pokémon Red in a new world-record time – Claude, Gemini, and ChatGPT o3 aren’t even close

Recent News

Virginia Tech releases 7-principle AI framework for campus use

One of higher education's most comprehensive approaches to institutional AI governance.

MrBeast warns AI threatens YouTube’s creator economy (unless you’re creating with AI?)

The irony is rich: MrBeast previously tried AI thumbnails before fan backlash forced a retreat.

Microsoft commits $33B to secure 100K Nvidia chips from neocloud providers

Each GPU server rack costs $3 million, revealing the staggering economics of AI.