By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
  • Technology
    TechnologyShow More
    e& Reports Record Growth and Expands AI and 5G Initiatives
    March 11, 2026
    Microsoft Unleashes AI and Productivity Upgrades in March 2026
    March 10, 2026
    Nothing Headphone (a) Just Launched – Here’s Everything You Need to Know
    March 7, 2026
    Xiaomi Expands AIoT Portfolio with Devices Built for Travel and Productivity
    March 6, 2026
    Nothing Phone 4a Officially Released: Key Features You Need to Know
    March 6, 2026
  • Business
    BusinessShow More
    Revenue reaching AED 15.9 billion, du reports strong growth in the UAE telecom market
    March 13, 2026
    e& Reports Record Growth and Expands AI and 5G Initiatives
    March 11, 2026
    Microsoft Unleashes AI and Productivity Upgrades in March 2026
    March 10, 2026
    What Investors Are Looking for in 2026 Tech Startups
    March 7, 2026
    Dubai Financial Market and Nasdaq Dubai Set to Reopen Tomorrow After Temporary Pause
    March 3, 2026
  • Transportation Technology
    Transportation TechnologyShow More
    Dubai’s Roads Are Getting Smarter: How RTA’s AI and Dubai Loop Will Change Your Commute
    March 3, 2026
    Dubai Flying Taxi 2026: Launch Date, Locations & Future of Urban Mobility
    February 26, 2026
    Porsche to Introduce 11-kW Wireless Charging for EVs in 2026
    September 15, 2025
    BYD UAE Unveils Next-Gen Mobility Technologies at Super Hybrid Tech Day in Dubai
    September 12, 2025
    Chery Unveils Next-Gen Tech and Global Vision at 2025 Shanghai Auto Show and Global Business Conference
    May 27, 2025
  • Videos
    VideosShow More
    Sulmi EB-One: The UAE’s First E-Motorbike Unveiled at Gitex Today
    October 16, 2024
    The first communication between two humans in dreams has been officially achieved
    October 13, 2024
    Musk Unveils Tesla’s Self-Driving Taxi, Promising a Price Under $30,000
    October 11, 2024
    Discover the Hollyland LARK M2: A Superior Microphone for Your iPhone
    September 26, 2024
    Hands-On: A New Way to Manage Health with Samsung Watch
    September 25, 2024
  • Editor Preference
    Editor PreferenceShow More
    Nothing Headphone (a) Just Launched – Here’s Everything You Need to Know
    March 7, 2026
    A Powerhouse Tablet for Work and Gaming
    March 1, 2025
    Samsung Galaxy Ring – A Seamless Addition to Your Health and Fitness Gear
    November 13, 2024
    Elon Musk Plans to Build a One Million Person City on Mars by 2054
    October 23, 2024
    Meta is reintroducing facial recognition for Facebook and Instagram three years after discontinuing the feature—this time to combat scammers.
    October 23, 2024
  • Entrepreneurship
    EntrepreneurshipShow More
    Temu Founder Becomes Richest Person in China Thanks to Cheap Goods
    September 28, 2024
  • Gaming
    GamingShow More
    Intel’s 18A Era Arrives: How Panther Lake and the Core Ultra 200S Plus are Shattering Benchmarks
    March 16, 2026
    Ultimate 2-in-1 Gaming Laptop for Gamers and Creators
    March 8, 2026
    ASUS Showcases Zenbook Ceraluminum™ and World’s Lightest Copilot+ PC at Dubai Design Week 2025
    November 6, 2025
    DJI Mavic 4 Pro Review: A Triple-Camera Powerhouse with 6K Video, Enhanced Gimbal, and Smarter Flight Tech
    May 14, 2025
    Acer Unleashes Two Cutting-Edge QD-OLED Predator Gaming Monitors
    April 17, 2025
  • العربيةالعربية
Reading: Is Math Out? Mario Challenges AI to the Ultimate Test
Share
  • Gitex 2024
  • AI
  • Interviews
  • Middle East
  • Saudi Arabia
  • United Arab of Emirates
  • Cyber Security
Font ResizerAa
TECHNOLOGY MEATECHNOLOGY MEA
  • Gitex 2025
  • AI
  • Interviews
  • Middle east
  • Saudi Arabia
  • United Arab of Emirates
  • Cyber Security
Search
  • Technology
  • Business
  • Transportation Technology
  • Videos
  • Editor Preference
  • Entrepreneurship
  • Gaming
  • العربيةالعربية
Follow US
© Copyright TECHNOLOGY MEA. All Rights Reserved.
AI

Is Math Out? Mario Challenges AI to the Ultimate Test

Last updated: March 17, 2025 6:07 PM
4 Min Read
Share
SHARE

Researchers have tested artificial intelligence’s ability to adapt quickly through the classic game Super Mario Bros.

The Claude 3.7 model excelled in fast responses and jump planning, while other models faced noticeable difficulties.

The experiment raised questions about how relevant game-based tests are to real-world AI capabilities.

As the quest to measure AI abilities continues, researchers are turning to a new approach that goes beyond traditional mathematical tests and enters the realm of games—something both fun and equally challenging.

Following Anthropic’s testing of its latest Claude 3.7 Sonnet model in Pokémon, a fresh attempt emerged using the iconic Super Mario Bros., a game released by Nintendo in 1985. This now serves as a new testing platform for AI’s capabilities, symbolizing a shift from classic logical puzzles to dynamic jumping challenges.

This innovative approach comes from the Hao AI lab at the University of California, San Diego, where researchers tested multiple advanced AI models using Super Mario as an evaluation tool. Rather than using traditional metrics, the team decided to assess AI in an environment that humans instinctively understand.

To carry out the experiment, they used an emulator version of the game combined with the GamingAgent system—a custom framework developed by the lab. This system provided the AI models with basic control instructions, guidance conditions, and real-time screenshots of the game, with the AI models controlling Mario’s movements via Python code. Although Super Mario Bros. is a relatively simple adventure game, researchers at Hao AI discovered that it required the AI to engage in complex planning and adapt rapidly. Success wasn’t just about computational power but also about making strategic decisions and performing precise, sequential actions in a fast-changing environment.

At the conclusion of the experiment, the Claude 3.7 model from Anthropic stood out as the most impressive, displaying rapid responses and expertly timed jumps while avoiding enemies. The Claude 3.5 model also performed well. However, the real surprise came with AI models designed for logical reasoning, such as GPT-4o from OpenAI and Gemini 1.5 Pro from Google, which struggled to keep pace with the demands of the game.

Researchers highlighted timing as a key factor in this test, noting that a fraction of a second could make the difference between success and failure. Models relying on deep logical reasoning tend to process information in sequential steps, which makes them slower to respond to quickly changing scenarios, leading to frequent game losses.

While using games to assess AI abilities isn’t new, some experts are questioning how relevant these tests are to real-world AI. Games often simplify real-world complexity, offering limited training data compared to the unpredictability and intricacy of the actual world.

In this context, AI researcher Andrej Karpathy raised concerns about a “valuation crisis” in the field, suggesting that the current testing methods—especially those involving games—might not provide an accurate picture of true AI progress.

An interesting and somewhat amusing question arises: If AI struggles to navigate the Mushroom Kingdom, can we trust it to handle the complexities of the real world? While the Super Mario test is an exciting way to explore AI’s capabilities, it also serves as a reminder of the challenges that even seemingly simple tasks present. For those interested in exploring this further, the Hao AI lab has made the GamingAgent framework open-source on GitHub.

You Might Also Like

Top 7 Coolest Gadgets from MWC Barcelona 2026 You Can Actually Use

A woman is suing an artificial intelligence company, accusing it of causing her teenage son’s death.

UAE Strengthens Digital Finance with World-First AI Sovereign Cloud Infrastructure

Luminance has introduced an AI assistant for lawyers designed to handle some of the more tedious aspects of legal work

RetailGPT: AI Insights Increase Shopper Understanding from 10% to 45%

TAGGED:super mario challenge
Share This Article
Facebook Twitter Email Print
Previous Article Kia EV4, PV5, and Concept EV2 Unveiled at 2025 Kia EV Day as Key Pillars of Enhanced Global EV Strategy
Next Article Amazon, Google, and Meta Call for Tripling Global Nuclear Power Capacity by 2050 to Strengthen Energy Security and Tackle Climate Change

Stay Connected

FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe

Recent Post

Intel’s 18A Era Arrives: How Panther Lake and the Core Ultra 200S Plus are Shattering Benchmarks
Gaming
Revenue reaching AED 15.9 billion, du reports strong growth in the UAE telecom market
Business United Arab of Emirates
$32 Billion Acquisition Makes Google a Cloud Security Powerhouse
Cyber Security
e& Reports Record Growth and Expands AI and 5G Initiatives
Business Saudi Arabia Technology United Arab of Emirates

You Might Also Like

AI

OpenAI Eyes India for Major AI Data Centre Under Stargate Program

September 2, 2025
AI

Astra Tech UAE Joins Forces with World Economic Forum to Shape Global AI Innovations

October 24, 2024
AIMobiles

HONOR Magic7 Pro Launches in MEA: Redefining AI-Powered Camera and Performance Excellence

January 16, 2025
AI

Successful AI Will Simply Become Part of Life

January 30, 2026

Join us as we explore how technology is shaping our world and discover insights that will keep you ahead of the curve.

Follow Us

Quick Links

  • Technology
  • Business
  • Transportation Technology
  • Videos
  • Editor Preference
  • Entrepreneurship
  • Gaming

Information

  • Contact Us
  • Privacy Policy
  • Terms of use

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

Loading
© Copyright TECHNOLOGY MEA. All Rights Reserved.