Free Airdrop Season 7 is LIVE! Answer fun questions or do simple tasks to earn rewards from the $30K BitDegree prize pool. Participate Now ! 🔥
Key Takeaways
Free Airdrop Season 7 is LIVE! Answer fun questions or do simple tasks to earn rewards from the $30K BitDegree prize pool. Participate Now ! 🔥
VideoGameBench, a new tool developed to test how well artificial intelligence (AI) models can play video games, has revealed that even advanced models still struggle with older, simpler ones.
The benchmark was designed to evaluate vision-language models like GPT-4o, Claude Sonnet 3.7, and Gemini 2.5 Pro using a set of 20 popular games, including Doom, Prince of Persia, and Warcraft II.
Instead of relying on code or special inputs, these models were only given the visual game screen to decide their next move. The AI takes a screenshot, analyzes it, suggests an action, and then tries to carry it out.
Did you know?
Subscribe - We publish new crypto explainer videos every week!
What is BNB? The Truth Behind Binance Smart Chain (Animated)
This delay is especially noticeable in fast-paced games like Doom, where quick reactions are key. If the AI takes too long to respond, the situation on the screen has already changed, which makes its decision outdated. For example, an enemy might have moved, or the player may already be in danger before the model responds.
According to the research team, current models are not only slow to react but also struggle with basic tasks. They often miss items, fail to interact with the environment properly, or keep repeating the same actions without making progress.
The team used older Game Boy and MS-DOS games because their simple graphics and variety of control types provide a good way to test how well models understand space and timing.
The benchmark was developed by computer scientist Alex Zhang, who explained that these games help reveal how much work is still needed before AI can play games reliably in real-time.
Meanwhile, on April 14, Meta received approval from the EU's data regulator to use public posts from its platforms to train its AI systems. What does this mean? Read the full story.
To ensure the highest level of accuracy & most up-to-date information, BitDegree.org is regularly audited & fact-checked by following strict editorial guidelines & review methodology.
Carefully selected industry experts contribute their real-life experience & expertise to BitDegree's content. Our extensive Web3 Expert Network is compiled of professionals from leading companies, research organizations and academia.