Close Menu
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
What's Hot

Hackers use GitHub repository to host Amadey Malware and Data Stealers and bypass filters

Openai launches a general purpose agent with ChatGpt

Rivian will resume work at the Georgia factory, emails show

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
Fyself News
Home » Google’s Gemini panic when playing Pokemon
Startups

Google’s Gemini panic when playing Pokemon

userBy userJune 17, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

AI companies are fighting to dominate the industry, but sometimes they are also fighting at Pokemon gyms.

As both Google and humanity are studying how modern AI models navigate early Pokemon games, the results can be just as interesting as enlightening. This time, Google Deepmind writes in a report that the Gemini 2.5 Pro relies on panic when Pokémon is nearing death. This means that AI performance, according to the report, may experience “qualitatively observable degradation in the model’s inference ability.”

AI benchmarks – or the process of comparing the performance of different AI models – are suspicious art that provides little context for the actual functionality of a particular model. However, some researchers believe it may be useful to study how AI models play video games (or at least a kind of funny).

Over the past few months, two non-related developers of Google and humanity have set up their own Twitch streams, called “Gemini Plays Pokémon” and “Claude Plays Pokémon.”

Each stream displays the AI’s “inference” process, or a natural language translation of how the AI ​​evaluates the problem and reaches the response. It gives insight into how these models work.

Image credit: Google

The progress of these AI models is impressive, but I’m not very good at playing Pokemon yet. It takes hundreds of hours through a game that a Gemini can complete in an exponentially short time.

What’s interesting about watching AI navigate Pokemon games is not the time to complete, but how it behaves along the way.

“In the playthrough process, the Gemini 2.5 Pro falls into a variety of situations and simulates ‘panic’ in the model,” the report states.

This “panic” state can cause a deterioration in model performance as AI can suddenly stop using certain tools that are free to use for a set of gameplay. AI does not think or experience emotions, but their actions mimic the way humans make poor and hurry decisions under stress.

“This behavior occurred in enough individual instances enough that Twitch chat members were actively aware of when they were occurring,” the report states.

Claude also showed some strange behavior on his journey across Kant. In one example, AI took up the pattern in which once all Pokemon have exhausted their health, the player’s character “whiteouts” and returns to the Pokemon Center.

When Claude gets stuck in Moon Cave Mountain, he mistakenly hypothesized that if it intentionally disappoints all of its Pokemon, it will be transported across the cave to the next town’s Pokemon Center.

But that’s not how the game works. When all Pokemon die, you will return, not geographically, not the closest to the recently used Pokemon Center. Viewers were watching in horror as the AI ​​essentially tried to kill itself in the game.

Despite its drawbacks, there are several ways in which AI is better than human players. At the time of Gemini 2.5 Pro’s release, AI can solve puzzles with impressive accuracy.

AI, with the help of several human beings, created an agent tool – prompted an instance of Gemini 2.5 Pro for a specific task – solved the game’s boulder puzzle and found an efficient route to reach its destination.

“Just explaining Boulder’s physics and how to verify valid paths, the Gemini 2.5 Pro can take a shot of some of these complex boulder puzzles needed to go on the winning path,” the report says.

Gemini 2.5 Pro did a lot of work to create these tools on its own, so Google theorizes that current models can create these tools without human intervention. Perhaps Gemini will treat themselves to create a “don’t panic” module.


Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI’s AI Technology to Revolutionize Military Operations?
Next Article EVs dominate the index of most American-made cars, but it’s not just Tesla
user
  • Website

Related Posts

Openai launches a general purpose agent with ChatGpt

July 17, 2025

Rivian will resume work at the Georgia factory, emails show

July 17, 2025

Boulevard raises $80 million to power the self-care boom driven by Botox and GLP-1 surges

July 17, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

Hackers use GitHub repository to host Amadey Malware and Data Stealers and bypass filters

Openai launches a general purpose agent with ChatGpt

Rivian will resume work at the Georgia factory, emails show

Boulevard raises $80 million to power the self-care boom driven by Botox and GLP-1 surges

Trending Posts

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

BREAKING: TwinH Set to Revolutionize Legal Processes – Presented Today at ICEX Forum 2025

Building AGI: Zuckerberg Commits Billions to Meta’s Superintelligence Data Center Expansion

ICEX Forum 2025 Opens: FySelf’s TwinH Showcases AI Innovation

The Future of Process Automation is Here: Meet TwinH

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.