Close Menu
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
What's Hot

Nvidia-backed Sandboxaq accelerates drug discovery by launching AI molecular datasets

Hackers steal and destroy millions of Iran’s biggest crypto exchanges

Over 1,500 Minecraft players infected with Java malware pretending to be game mods on GitHub

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
Fyself News
Home » Google’s Gemini panic when playing Pokemon
Startups

Google’s Gemini panic when playing Pokemon

userBy userJune 17, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

AI companies are fighting to dominate the industry, but sometimes they are also fighting at Pokemon gyms.

As both Google and humanity are studying how modern AI models navigate early Pokemon games, the results can be just as interesting as enlightening. This time, Google Deepmind writes in a report that the Gemini 2.5 Pro relies on panic when Pokémon is nearing death. This means that AI performance, according to the report, may experience “qualitatively observable degradation in the model’s inference ability.”

AI benchmarks – or the process of comparing the performance of different AI models – are suspicious art that provides little context for the actual functionality of a particular model. However, some researchers believe it may be useful to study how AI models play video games (or at least a kind of funny).

Over the past few months, two non-related developers of Google and humanity have set up their own Twitch streams, called “Gemini Plays Pokémon” and “Claude Plays Pokémon.”

Each stream displays the AI’s “inference” process, or a natural language translation of how the AI ​​evaluates the problem and reaches the response. It gives insight into how these models work.

Image credit: Google

The progress of these AI models is impressive, but I’m not very good at playing Pokemon yet. It takes hundreds of hours through a game that a Gemini can complete in an exponentially short time.

What’s interesting about watching AI navigate Pokemon games is not the time to complete, but how it behaves along the way.

“In the playthrough process, the Gemini 2.5 Pro falls into a variety of situations and simulates ‘panic’ in the model,” the report states.

This “panic” state can cause a deterioration in model performance as AI can suddenly stop using certain tools that are free to use for a set of gameplay. AI does not think or experience emotions, but their actions mimic the way humans make poor and hurry decisions under stress.

“This behavior occurred in enough individual instances enough that Twitch chat members were actively aware of when they were occurring,” the report states.

Claude also showed some strange behavior on his journey across Kant. In one example, AI took up the pattern in which once all Pokemon have exhausted their health, the player’s character “whiteouts” and returns to the Pokemon Center.

When Claude gets stuck in Moon Cave Mountain, he mistakenly hypothesized that if it intentionally disappoints all of its Pokemon, it will be transported across the cave to the next town’s Pokemon Center.

But that’s not how the game works. When all Pokemon die, you will return, not geographically, not the closest to the recently used Pokemon Center. Viewers were watching in horror as the AI ​​essentially tried to kill itself in the game.

Despite its drawbacks, there are several ways in which AI is better than human players. At the time of Gemini 2.5 Pro’s release, AI can solve puzzles with impressive accuracy.

AI, with the help of several human beings, created an agent tool – prompted an instance of Gemini 2.5 Pro for a specific task – solved the game’s boulder puzzle and found an efficient route to reach its destination.

“Just explaining Boulder’s physics and how to verify valid paths, the Gemini 2.5 Pro can take a shot of some of these complex boulder puzzles needed to go on the winning path,” the report says.

Gemini 2.5 Pro did a lot of work to create these tools on its own, so Google theorizes that current models can create these tools without human intervention. Perhaps Gemini will treat themselves to create a “don’t panic” module.


Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI’s AI Technology to Revolutionize Military Operations?
Next Article Openai’s $200 million DOD deal could narrow down Frenemy Microsoft
user
  • Website

Related Posts

Hackers steal and destroy millions of Iran’s biggest crypto exchanges

June 18, 2025

Police have closed Cluely’s party, “All Cheats” startup

June 17, 2025

Sam Altman says Meta tried to poach open eye talent with a $100 million offer

June 17, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

Nvidia-backed Sandboxaq accelerates drug discovery by launching AI molecular datasets

Hackers steal and destroy millions of Iran’s biggest crypto exchanges

Over 1,500 Minecraft players infected with Java malware pretending to be game mods on GitHub

Pro-Israel hackers will destroy $90 million with Iranian code, the company says

Trending Posts

Sana Yousaf, who was the Pakistani Tiktok star shot by gunmen? |Crime News

June 4, 2025

Trump says it’s difficult to make a deal with China’s xi’ amid trade disputes | Donald Trump News

June 4, 2025

Iraq’s Jewish Community Saves Forgotten Shrine Religious News

June 4, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

Nvidia-backed Sandboxaq accelerates drug discovery by launching AI molecular datasets

BTCC Exchange celebrates its 14th anniversary with the launch of its first user badge program

Top 10 Startup and Tech Funding News – June 17, 2025

OpenAI’s AI Technology to Revolutionize Military Operations?

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.