Close Menu
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
What's Hot

Record Foreversion Chemicals Pollutions is located on 98% of the UK rivers

Q&A with ICC Wales for 2025 Sustainable Food Day

Jerome Vileta, Senior Project Coordinator at Impact XM

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
Fyself News
Home » Google’s Gemini panic when playing Pokemon
Startups

Google’s Gemini panic when playing Pokemon

userBy userJune 17, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

AI companies are fighting to dominate the industry, but sometimes they are also fighting at Pokemon gyms.

As both Google and humanity are studying how modern AI models navigate early Pokemon games, the results can be just as interesting as enlightening. This time, Google Deepmind writes in a report that the Gemini 2.5 Pro relies on panic when Pokémon is nearing death. This means that AI performance, according to the report, may experience “qualitatively observable degradation in the model’s inference ability.”

AI benchmarks – or the process of comparing the performance of different AI models – are suspicious art that provides little context for the actual functionality of a particular model. However, some researchers believe it may be useful to study how AI models play video games (or at least a kind of funny).

Over the past few months, two non-related developers of Google and humanity have set up their own Twitch streams, called “Gemini Plays Pokémon” and “Claude Plays Pokémon.”

Each stream displays the AI’s “inference” process, or a natural language translation of how the AI ​​evaluates the problem and reaches the response. It gives insight into how these models work.

Image credit: Google

The progress of these AI models is impressive, but I’m not very good at playing Pokemon yet. It takes hundreds of hours through a game that a Gemini can complete in an exponentially short time.

What’s interesting about watching AI navigate Pokemon games is not the time to complete, but how it behaves along the way.

“In the playthrough process, the Gemini 2.5 Pro falls into a variety of situations and simulates ‘panic’ in the model,” the report states.

This “panic” state can cause a deterioration in model performance as AI can suddenly stop using certain tools that are free to use for a set of gameplay. AI does not think or experience emotions, but their actions mimic the way humans make poor and hurry decisions under stress.

“This behavior occurred in enough individual instances enough that Twitch chat members were actively aware of when they were occurring,” the report states.

Claude also showed some strange behavior on his journey across Kant. In one example, AI took up the pattern in which once all Pokemon have exhausted their health, the player’s character “whiteouts” and returns to the Pokemon Center.

When Claude gets stuck in Moon Cave Mountain, he mistakenly hypothesized that if it intentionally disappoints all of its Pokemon, it will be transported across the cave to the next town’s Pokemon Center.

But that’s not how the game works. When all Pokemon die, you will return, not geographically, not the closest to the recently used Pokemon Center. Viewers were watching in horror as the AI ​​essentially tried to kill itself in the game.

Despite its drawbacks, there are several ways in which AI is better than human players. At the time of Gemini 2.5 Pro’s release, AI can solve puzzles with impressive accuracy.

AI, with the help of several human beings, created an agent tool – prompted an instance of Gemini 2.5 Pro for a specific task – solved the game’s boulder puzzle and found an efficient route to reach its destination.

“Just explaining Boulder’s physics and how to verify valid paths, the Gemini 2.5 Pro can take a shot of some of these complex boulder puzzles needed to go on the winning path,” the report says.

Gemini 2.5 Pro did a lot of work to create these tools on its own, so Google theorizes that current models can create these tools without human intervention. Perhaps Gemini will treat themselves to create a “don’t panic” module.


Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI’s AI Technology to Revolutionize Military Operations?
Next Article Openai’s $200 million DOD deal could narrow down Frenemy Microsoft
user
  • Website

Related Posts

Police have closed Cluely’s party, “All Cheats” startup

June 17, 2025

Sam Altman says Meta tried to poach open eye talent with a $100 million offer

June 17, 2025

Openai’s $200 million DOD deal could narrow down Frenemy Microsoft

June 17, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

Record Foreversion Chemicals Pollutions is located on 98% of the UK rivers

Q&A with ICC Wales for 2025 Sustainable Food Day

Jerome Vileta, Senior Project Coordinator at Impact XM

How can the feed industry drive a more sustainable livestock production sector?

Trending Posts

Sana Yousaf, who was the Pakistani Tiktok star shot by gunmen? |Crime News

June 4, 2025

Trump says it’s difficult to make a deal with China’s xi’ amid trade disputes | Donald Trump News

June 4, 2025

Iraq’s Jewish Community Saves Forgotten Shrine Religious News

June 4, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

Top 10 Startup and Tech Funding News – June 17, 2025

OpenAI’s AI Technology to Revolutionize Military Operations?

Elon Musk’s AI startup Xai raises $4.3 billion in equity funding in addition to $5 billion in debt transactions during the surge in AI costs

Sword Health lands $40 million to expand AI care into mental health, valuing $4 billion

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.