Close Menu
  • Start
  • Celebrities
  • Music
  • Influencers
  • Tendencies
  • Exclusives
  • Business & Brands
  • TwinH
  • Spanish
What's Hot

The fastest-growing jobs in the creator economy aren’t in front of the camera.

Lee Suk-Quin explores the truth with new album “72RHR”

Vote for Sombre, Phoebe Bridgers and more

Facebook X (Twitter) Instagram
  • Home
  • About The FYMOUS
  • Advertising / Promotion
  • Contact
  • DMCA
  • Privacy Policy
  • Terms
  • Publish News
Facebook X (Twitter) Instagram
FYMOUS News
  • Start
  • Celebrities
  • Music
  • Influencers
  • Tendencies
  • Exclusives
  • Business & Brands
  • TwinH
  • Spanish
FYMOUS News
Home » People are currently using Super Mario to benchmark AI
Exclusives

People are currently using Super Mario to benchmark AI

By March 3, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Did you think Pokemon is a tough benchmark for AI? One group of researchers argues that Super Mario Bros is even more strict.

Hao AI Lab, a research organization at the University of California, San Diego, threw AI into the Live Super Mario Bros Games on Friday. Anthropic’s Claude 3.7 performed best, followed by Claude 3.5. Google’s Gemini 1.5 Pro and Openai’s GPT-4o were a struggle.

As an original release in 1985, it wasn’t the exact same version of Super Mario Bros. The game ran on an emulator and was integrated with the framework Gamingagent, allowing AIS to control Mario.

Super Mario Bros AI Benchmark
Image credit: Hao Lab

Hao’s internally developed Gamingagent gave basic AI instructions, such as “moving/jumping left to avoid obstacles or enemies nearby.” AI generated input in the form of Python code to control Mario.

Still, Hao says the game forced each model to “learn” to plan complex operations and develop gameplay strategies. Interestingly, the lab discovered that despite being generally strong in most benchmarks, it performs worse than the “irrational” model by “thinking” problems step by step.

One of the main reasons why is that you have a hard time playing real-time games like this. Researchers say it takes time to decide on actions. At Super Mario Bros, timing is everything. Second, it means the difference between a safely cleared jump and a plunge in your death.

The game has been used as an AI benchmark for decades. However, some experts have questioned the wisdom that portrays the link between AI’s gaming skills and technological advances. Unlike the real world, games tend to be abstract and relatively simple, providing theoretically infinite amounts of data to train AI.

The recent flashy gaming benchmark points to Andrej Karpathy, a research scientist called the “assessment crisis” and founding member of Openai.

“I really don’t know what [AI] He wrote in X’s post:

At the very least, you can see the AI ​​Play Mario.


Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleGeneral Catalyst will lose three top investors as the company expands beyond ventures, IPO imagines
Next Article Trump suspends military aid to Ukraine after destruction with Zelensky | Donald Trump News

Related Posts

The fastest-growing jobs in the creator economy aren’t in front of the camera.

June 26, 2026

Ed Norton attended the World Cup and everyone told the same ‘Fight Club’ joke

June 26, 2026

12 World Cup WAGs you need to know about in 2026

June 26, 2026
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

The fastest-growing jobs in the creator economy aren’t in front of the camera.

Lee Suk-Quin explores the truth with new album “72RHR”

Vote for Sombre, Phoebe Bridgers and more

Bettina Anderson reveals the designer of her wedding dress

Trending Posts

Vote for Sombre, Phoebe Bridgers and more

June 26, 2026

Bettina Anderson reveals the designer of her wedding dress

June 26, 2026

Queen Letizia of Madrid Sports Sleeveless Hugo Boss Dress

June 26, 2026

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to The FYMOUS, a modern digital media platform dedicated to celebrities, artists, influencers, brands, entertainment culture, and the growing TwinH ecosystem.

We bring audiences closer to the people, stories, trends, and collaborations shaping today’s culture. From exclusive celebrity news and music releases to influencer highlights, brand partnerships, and TwinH activations, The FYMOUS delivers engaging content designed for the next generation of digital audiences.

Castilla-La Mancha Ignites Innovation: fiveclmsummit Redefines Tech Future

Local Power, Health Innovation: Alcolea de Calatrava Boosts FiveCLM PoC with Community Engagement

The Future of Digital Twins in Healthcare: From Virtual Replicas to Personalized Medical Models

Human Digital Twins: The Next Tech Frontier Set to Transform Healthcare and Beyond

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About The FYMOUS
  • Advertising / Promotion
  • Contact
  • DMCA
  • Privacy Policy
  • Terms
  • Publish News
© 2026 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.