• About
  • FAQ
  • Contact Us
Newsletter
Crypto News
Advertisement
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • News
  • Market
  • Analysis
  • DeFi & NFTs
  • Guides
  • Tools
  • Flash
  • Insights
  • Subscribe
No Result
View All Result
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • News
  • Market
  • Analysis
  • DeFi & NFTs
  • Guides
  • Tools
  • Flash
  • Insights
  • Subscribe
No Result
View All Result
Crypto News
No Result
View All Result
Home Analysis

Relax, You’re Still Better at Playing ‘Doom’ Than AI

admin by admin
April 25, 2025
in Analysis
0
Relax, You’re Still Better at Playing ‘Doom’ Than AI
189
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter



Despite the buzz surrounding artificial intelligence, even the most advanced vision-language models—GPT-4o, Claude Sonnet 3.7, and Gemini 2.5 Pro—struggle with a decades-old challenge: playing the classic first-person shooter Doom.

On Thursday, a new research project introduced VideoGameBench, an AI benchmark designed to test whether state-of-the-art vision-language models can play—and beat—a suite of 20 popular video games, using only what they see on the screen.

Related articles

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

May 22, 2025
Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

May 22, 2025

“In our experience, current state-of-the-art VLMs substantially struggle to play video games because of high inference latency,” the researchers said. “When an agent takes a screenshot and queries the VLM about what action to take, by the time the response comes back, the game state has changed significantly and the action is no longer relevant.”

The researchers stated that they used classic Game Boy and MS-DOS games due to their simpler visuals and diverse input styles, like a mouse and keyboard or game controller, which better test a vision-language model’s spatial reasoning capabilities than text-based games.

VideoGameBench was developed by computer scientist and AI researcher Alex Zhang. The suite of games includes classics like Warcraft II, Age of Empires, and Prince of Persia.

Claude can play Pokemon, but can it play DOOM?

With a simple agent, we let VLMs play it, and found Sonnet 3.7 to get the furthest, finding the blue room!

Our VideoGameBench (twenty games from the 90s) and agent are open source so you can try it yourself now –> 🧵 pic.twitter.com/vl9NNZPBHY

— Alex Zhang (@a1zhang) April 17, 2025

According to the researchers, delayed responses are most problematic in first-person shooters like Doom. In these fast-paced environments, an enemy visible in a screenshot may already have moved—or even reached the player—by the time the model acts.

For software developers, Doom has long served as a litmus test for technological capability in gaming environments. Lawnmowers, Bitcoin, and even human gut bacteria have faced down the demons from hell with varying levels of success. Now it’s AI’s turn.

“What has brought Doom out of the shadows of the 90s and into the modern light is not its riveting gameplay, but rather its appealing computational design,” MIT biotech researcher Lauren Ramlan previously told Decrypt. “Built on the id Tech 1 engine, the game was designed to require only the most modest of setups to be played.”

In addition to struggling with understanding game environments, the models often failed to perform basic in-game actions.

“We observed frequent instances where the agent had trouble understanding how its actions—such as moving right—would translate on screen,” the researchers said. “The most consistent failure across all frontier models we tested was an inability to reliably control the mouse in games like Civilization and Warcraft II, where precise and frequent mouse movements are essential.”

To better understand the limitations of current AI systems, VideoGameBench emphasized the importance of evaluating their reasoning abilities in environments that are both dynamic and complex.

“Unlike extremely complicated domains like unsolved math proofs and olympiad-level math problems, playing video games is not a superhuman reasoning task, yet models still struggle to solve them,” they said.

Edited by Andrew Hayward

GG Newsletter

Get the latest web3 gaming news, hear directly from gaming studios and influencers covering the space, and receive power-ups from our partners.





#Relax #Youre #Playing #Doom

Tags: 0xbow s ethereum privacy pools surpass1 eth 1880appealing computationalbitcoin btc hit six week highsbitcoin p p anothercan t really be considered propertycrypto s biggest project launchdesigned to test whether stateDoomem p p three cryptoemphasized bitcoin s price actionfinancial crime unit sindeed the united statesmoment to buy the dip whethernetwork gun tokenon solana according to a binancep ethereum pricep p ether eth hasPlayingRelaxtokens 585 million setweight 400 coinbase chief legal officerYoure
Share76Tweet47

Related Posts

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

by admin
May 22, 2025
0

In brief Bitcoin just marked a new all-time high, but Myriad users are now betting whether it'll top $115K by...

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

by admin
May 22, 2025
0

In brief Bitcoin just marked a new all-time high, but Myriad users are now betting whether it'll top $115K by...

Bitcoin Options Open Interest Spikes to Record High as Traders Target 6K

Bitcoin Options Open Interest Spikes to Record High as Traders Target $116K

by admin
May 22, 2025
0

In brief Open interest for Bitcoin options most recently stood at an all-time high of around $65 billion. An increase...

‘Orgy of Corruption’: Senators Slam Trump Crypto Dinner, Demand Info on Attendees

‘Orgy of Corruption’: Senators Slam Trump Crypto Dinner, Demand Info on Attendees

by admin
May 22, 2025
0

Congressional Democrats unloaded on President Donald Trump’s plans to dine with top holders of his meme coin this evening, demanding...

BTC hits ATH, InfoFi battle begins, Texas passes BTC bill

BTC hits ATH, InfoFi battle begins, Texas passes BTC bill

by admin
May 22, 2025
0

BTC hits ATH, InfoFi battle begins, Texas passes BTC billBTC hits ATH, InfoFi battle begins, Texas passes BTC bill FOMO...

Load More
  • Trending
  • Comments
  • Latest
Bitcoin and Ethereum Stuck in Range, DOGE and XRP Gain

Bitcoin and Ethereum Stuck in Range, DOGE and XRP Gain

April 25, 2025
Saylor says Warren Buffett’s Berkshire Hathaway is Bitcoin of 20th century – Deep Insight

Saylor says Warren Buffett’s Berkshire Hathaway is Bitcoin of 20th century – Deep Insight

May 7, 2025
Amazon CEO on Crypto and NFTs, EPNS to Expand Beyond Ethereum + More News

Amazon CEO on Crypto and NFTs, EPNS to Expand Beyond Ethereum + More News

April 25, 2025
Why DeFi agents need a private brain

Why DeFi agents need a private brain

May 4, 2025
US Commodities Regulator Beefs Up Bitcoin Futures Review

US Commodities Regulator Beefs Up Bitcoin Futures Review

0
Bitcoin Hits 2018 Low as Concerns Mount on Regulation, Viability

Bitcoin Hits 2018 Low as Concerns Mount on Regulation, Viability

0
India: Bitcoin Prices Drop As Media Misinterprets Gov’s Regulation Speech

India: Bitcoin Prices Drop As Media Misinterprets Gov’s Regulation Speech

0
Bitcoin’s Main Rival Ethereum Hits A Fresh Record High: 5.55

Bitcoin’s Main Rival Ethereum Hits A Fresh Record High: $425.55

0
Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

May 22, 2025
Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

Myriad Moves: Bitcoin Price Predictions and Eyes on Coinbase Hack Bounty Prize

May 22, 2025
US tourist drugged by fake Uber driver and robbed of 3K BTC — Report

US tourist drugged by fake Uber driver and robbed of $123K BTC — Report

May 22, 2025
Bitcoin Options Open Interest Spikes to Record High as Traders Target 6K

Bitcoin Options Open Interest Spikes to Record High as Traders Target $116K

May 22, 2025
  • About
  • FAQ
  • Contact Us
Call us: +1 23456 JEG THEME

© 2025 Btc04.com

No Result
View All Result
  • Home
  • News
  • Market
  • Analysis
  • DeFi & NFTs
  • Guides
  • Tools
  • Flash
  • Insights
  • Subscribe
  • Contact Us

© 2025 Btc04.com