By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
  • Technology
    TechnologyShow More
    Snap Inc. Showcases AR, Spectacles, and Generative AI at Web Summit Qatar
    February 4, 2026
    Xiaomi Raises the Bar with Redmi Note 15 Series Launch in the UAE
    January 17, 2026
    HMD Powers the Global Dumbphone Comeback as Users Seek Digital Balance
    November 29, 2025
    Meet the Robot Vacuum That Works for You: Shark Ninja Detect Clean & Empty
    November 7, 2025
    ASUS Showcases Zenbook Ceraluminum™ and World’s Lightest Copilot+ PC at Dubai Design Week 2025
    November 6, 2025
  • Business
    BusinessShow More
    Why Smartphone Prices Are Expected to Increase in 2026, According to Nothing CEO Carl Pei
    January 17, 2026
    TOURISE Summit 2025 Positions Saudi Arabia as a Catalyst for Global Tourism Growth
    November 13, 2025
    stc Group and Telefónica Global Solutions Partner to Expand Advanced Satellite Connectivity
    November 2, 2025
    Honeywell Launches Breakthrough Renewable Fuel Technology That Converts Biomass into Low-Cost, High-Quality Energy
    November 1, 2025
    xCube Launches ‘xCube Access’ to Link Investors with Accredited Advisors
    April 15, 2025
  • Transportation Technology
    Transportation TechnologyShow More
    Porsche to Introduce 11-kW Wireless Charging for EVs in 2026
    September 15, 2025
    BYD UAE Unveils Next-Gen Mobility Technologies at Super Hybrid Tech Day in Dubai
    September 12, 2025
    Chery Unveils Next-Gen Tech and Global Vision at 2025 Shanghai Auto Show and Global Business Conference
    May 27, 2025
    SelfDrive Mobility Launches OTO: A Luxury Chauffeur-Driven Service for the Modern Traveler
    May 12, 2025
    Test Driving the Future: My Two Weeks with the 2025 Cadillac Escalade
    May 7, 2025
  • Videos
    VideosShow More
    Sulmi EB-One: The UAE’s First E-Motorbike Unveiled at Gitex Today
    October 16, 2024
    The first communication between two humans in dreams has been officially achieved
    October 13, 2024
    Musk Unveils Tesla’s Self-Driving Taxi, Promising a Price Under $30,000
    October 11, 2024
    Discover the Hollyland LARK M2: A Superior Microphone for Your iPhone
    September 26, 2024
    Hands-On: A New Way to Manage Health with Samsung Watch
    September 25, 2024
  • Editor Preference
    Editor PreferenceShow More
    A Powerhouse Tablet for Work and Gaming
    March 1, 2025
    Samsung Galaxy Ring – A Seamless Addition to Your Health and Fitness Gear
    November 13, 2024
    Elon Musk Plans to Build a One Million Person City on Mars by 2054
    October 23, 2024
    Meta is reintroducing facial recognition for Facebook and Instagram three years after discontinuing the feature—this time to combat scammers.
    October 23, 2024
    The first communication between two humans in dreams has been officially achieved
    October 13, 2024
  • Entrepreneurship
    EntrepreneurshipShow More
    Temu Founder Becomes Richest Person in China Thanks to Cheap Goods
    September 28, 2024
  • Gaming
    GamingShow More
    ASUS Showcases Zenbook Ceraluminum™ and World’s Lightest Copilot+ PC at Dubai Design Week 2025
    November 6, 2025
    DJI Mavic 4 Pro Review: A Triple-Camera Powerhouse with 6K Video, Enhanced Gimbal, and Smarter Flight Tech
    May 14, 2025
    Acer Unleashes Two Cutting-Edge QD-OLED Predator Gaming Monitors
    April 17, 2025
    Logitech has introduced the MX Creative Console, a new product designed to elevate digital creation
    March 7, 2025
    A Powerhouse Tablet for Work and Gaming
    March 1, 2025
  • العربيةالعربية
Reading: Is Math Out? Mario Challenges AI to the Ultimate Test
Share
  • Gitex 2024
  • AI
  • Interviews
  • Middle East
  • Saudi Arabia
  • United Arab of Emirates
  • Cyber Security
Font ResizerAa
TECHNOLOGY MEATECHNOLOGY MEA
  • Gitex 2025
  • AI
  • Interviews
  • Middle east
  • Saudi Arabia
  • United Arab of Emirates
  • Cyber Security
Search
  • Technology
  • Business
  • Transportation Technology
  • Videos
  • Editor Preference
  • Entrepreneurship
  • Gaming
  • العربيةالعربية
Follow US
© Copyright TECHNOLOGY MEA. All Rights Reserved.
AI

Is Math Out? Mario Challenges AI to the Ultimate Test

Last updated: March 17, 2025 6:07 PM
4 Min Read
Share
SHARE

Researchers have tested artificial intelligence’s ability to adapt quickly through the classic game Super Mario Bros.

The Claude 3.7 model excelled in fast responses and jump planning, while other models faced noticeable difficulties.

The experiment raised questions about how relevant game-based tests are to real-world AI capabilities.

As the quest to measure AI abilities continues, researchers are turning to a new approach that goes beyond traditional mathematical tests and enters the realm of games—something both fun and equally challenging.

Following Anthropic’s testing of its latest Claude 3.7 Sonnet model in Pokémon, a fresh attempt emerged using the iconic Super Mario Bros., a game released by Nintendo in 1985. This now serves as a new testing platform for AI’s capabilities, symbolizing a shift from classic logical puzzles to dynamic jumping challenges.

This innovative approach comes from the Hao AI lab at the University of California, San Diego, where researchers tested multiple advanced AI models using Super Mario as an evaluation tool. Rather than using traditional metrics, the team decided to assess AI in an environment that humans instinctively understand.

To carry out the experiment, they used an emulator version of the game combined with the GamingAgent system—a custom framework developed by the lab. This system provided the AI models with basic control instructions, guidance conditions, and real-time screenshots of the game, with the AI models controlling Mario’s movements via Python code. Although Super Mario Bros. is a relatively simple adventure game, researchers at Hao AI discovered that it required the AI to engage in complex planning and adapt rapidly. Success wasn’t just about computational power but also about making strategic decisions and performing precise, sequential actions in a fast-changing environment.

At the conclusion of the experiment, the Claude 3.7 model from Anthropic stood out as the most impressive, displaying rapid responses and expertly timed jumps while avoiding enemies. The Claude 3.5 model also performed well. However, the real surprise came with AI models designed for logical reasoning, such as GPT-4o from OpenAI and Gemini 1.5 Pro from Google, which struggled to keep pace with the demands of the game.

Researchers highlighted timing as a key factor in this test, noting that a fraction of a second could make the difference between success and failure. Models relying on deep logical reasoning tend to process information in sequential steps, which makes them slower to respond to quickly changing scenarios, leading to frequent game losses.

While using games to assess AI abilities isn’t new, some experts are questioning how relevant these tests are to real-world AI. Games often simplify real-world complexity, offering limited training data compared to the unpredictability and intricacy of the actual world.

In this context, AI researcher Andrej Karpathy raised concerns about a “valuation crisis” in the field, suggesting that the current testing methods—especially those involving games—might not provide an accurate picture of true AI progress.

An interesting and somewhat amusing question arises: If AI struggles to navigate the Mushroom Kingdom, can we trust it to handle the complexities of the real world? While the Super Mario test is an exciting way to explore AI’s capabilities, it also serves as a reminder of the challenges that even seemingly simple tasks present. For those interested in exploring this further, the Hao AI lab has made the GamingAgent framework open-source on GitHub.

You Might Also Like

Successful AI Will Simply Become Part of Life

Meta’s Strategic Move: Vietnam to Lead Production of Next-Gen VR Headsets.

Luminance has introduced an AI assistant for lawyers designed to handle some of the more tedious aspects of legal work

Astra Tech UAE Joins Forces with World Economic Forum to Shape Global AI Innovations

HONOR Magic7 Pro Launches in MEA: Redefining AI-Powered Camera and Performance Excellence

TAGGED:super mario challenge
Share This Article
Facebook Twitter Email Print
Previous Article Kia EV4, PV5, and Concept EV2 Unveiled at 2025 Kia EV Day as Key Pillars of Enhanced Global EV Strategy
Next Article Amazon, Google, and Meta Call for Tripling Global Nuclear Power Capacity by 2050 to Strengthen Energy Security and Tackle Climate Change

Stay Connected

FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe

Recent Post

Snap Inc. Showcases AR, Spectacles, and Generative AI at Web Summit Qatar
Middle east Technology
What Makes the Nissan X-Trail a Long-Term SUV for UAE Roads
Uncategorized United Arab of Emirates
Dell Technologies Study Shows UAE Companies Reaping AI Benefits
United Arab of Emirates
Why Smartphone Prices Are Expected to Increase in 2026, According to Nothing CEO Carl Pei
Business Mobiles

You Might Also Like

AI

RetailGPT: AI Insights Increase Shopper Understanding from 10% to 45%

October 29, 2024
AI

IFS Strengthens Supply Chain AI Capabilities with Acquisition of 7bridges

August 19, 2025
AI

A woman is suing an artificial intelligence company, accusing it of causing her teenage son’s death.

October 24, 2024
AI

OpenAI Eyes India for Major AI Data Centre Under Stargate Program

September 2, 2025

Join us as we explore how technology is shaping our world and discover insights that will keep you ahead of the curve.

Follow Us

Quick Links

  • Technology
  • Business
  • Transportation Technology
  • Videos
  • Editor Preference
  • Entrepreneurship
  • Gaming

Information

  • Contact Us
  • Privacy Policy
  • Terms of use

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

Loading
© Copyright TECHNOLOGY MEA. All Rights Reserved.