Close Menu
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
What's Hot

Federal judge blocks Trump’s efforts to prevent Harvard from hosting foreign students

View the double: 15 twins who graduated from the same New York High School

Character.ai taps Meta’s former Vice President of Business Products as CEO

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
Fyself News
Home » Humanity has used Pokemon to benchmark the latest AI models
Startups

Humanity has used Pokemon to benchmark the latest AI models

userBy userFebruary 24, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Humanity has used Pokemon to benchmark the latest AI models. Yes, really.

In a blog post published Monday, Anthropic said it had tested its latest model, the Claude 3.7 Sonnet, with Game Boy Classic Pokémon Red. The company equips the model with basic memory, screen pixel input, and function calls to press buttons, allowing it to navigate around the screen and play Pokemon continuously.

A unique feature of Claude 3.7 Sonnet is its ability to engage in “extended thinking.” Like Openai’s O3-Mini and Deepseek’s R1, Claude 3.7 Sonnet can “infer” through challenging problems by applying more computing and spending more time.

It was obviously convenient in Pokemon Red.

Compared to the previous version of Claude 3.0 sonnet, Claude 3.7 sonnet, unable to leave the house in Palette Town, where the story begins, fought against three Pokemon Gym leaders and won a badge.

Human Pokémon Red
Image credits: Humanity

Currently, it is not clear how much computing it takes for Claude 3.7 sonnets to reach those milestones, and how long each took. Humanity only said that the model took 35,000 actions to reach the last gym leader, the Surge.

Until some enterprising developers are found, that’s definitely not the case.

Pokemon Red is the benchmark for toys above all else. However, there is a long history of games used for AI benchmark purposes. In the past few months alone, many new apps and platforms have emerged to test the model’s gameplay abilities in titles ranging from Street Fighter to Pictory.


Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleSpaceX says the spacecraft self-destructed after the propellant leak caused a fire and Comms Blackout
Next Article Apple announces $500 million investment in the US Donald Trump News
user
  • Website

Related Posts

Character.ai taps Meta’s former Vice President of Business Products as CEO

June 20, 2025

The X app code refers to the physical card that comes to X money

June 20, 2025

Nvidia participates in the nuclear renaissance and invests in terra power that was backed by Bill Gates Back

June 20, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

Federal judge blocks Trump’s efforts to prevent Harvard from hosting foreign students

View the double: 15 twins who graduated from the same New York High School

Character.ai taps Meta’s former Vice President of Business Products as CEO

Elon Musk’s AI startup Xai will increase bond yields to 12.5% ​​with a $5 billion debt hike due to weak investor demand

Trending Posts

Sana Yousaf, who was the Pakistani Tiktok star shot by gunmen? |Crime News

June 4, 2025

Trump says it’s difficult to make a deal with China’s xi’ amid trade disputes | Donald Trump News

June 4, 2025

Iraq’s Jewish Community Saves Forgotten Shrine Religious News

June 4, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

Elon Musk’s AI startup Xai will increase bond yields to 12.5% ​​with a $5 billion debt hike due to weak investor demand

Meta hires safe bipartisan executives after CEO Ilya Sutskever rejects $32 billion acquisition offer

Meta Earth Network 2.0: Pioneering Web3 Innovation with Rewards and Global Events

Top 10 Startups and High-Tech Funding News – June 19, 2025

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.