Goose Pod LogoGoose Pod
‘Like watching kids games’: Magnus Carlsen roasts Elon Musk’s Grok 4 as it loses 4-0 to OpenAI’s o3 in chess tournament

‘Like watching kids games’: Magnus Carlsen roasts Elon Musk’s Grok 4 as it loses 4-0 to OpenAI’s o3 in chess tournament

2025-08-11Technology
Summary

This report from The Indian Express, published on August 8, 2025, details the performance of Elon Musk's AI model, Grok 4, in an AI chess exhibition tournament held on Google's Kaggle Game Arena. The tournament featured eight general-purpose large language models (LLMs), including competitors from OpenAI, Google, and Anthropic.

Key Findings:

Tournament Participants:

In 30 seconds

  • This report from The Indian Express, published on August 8, 2025, details the performance of Elon Musk's AI model, Grok 4, in an AI...
  • Key Findings:
  • Tournament Participants:
Read source
Published
8/8/2025
Language
Sources
1 cited
Listen
5 min listen
Published
8/8/2025
Language
Sources
1 cited
Listen
5 min listen

Quick brief

The fastest way to understand what changed, why it matters, and what to listen for in the episode.

  • This report from The Indian Express, published on August 8, 2025, details the performance of Elon Musk's AI model, Grok 4, in an AI...
  • Key Findings:
  • Tournament Participants:
  • The eight participating LLMs were:

Why this summary is trustworthy

Goose Pod anchors each episode to cited reporting so listeners can verify the source material before or after they press play.

Articles reviewed
1
Distinct sources
1
Latest cited update
8/8/2025
Topic path
Technology

Listen to the episode

Start with the audio, then open the transcript only when you want the line-by-line version.

--:--
--:--

What happened

This report from The Indian Express, published on August 8, 2025, details the performance of Elon Musk's AI model, Grok 4, in an AI chess exhibition tournament held on Google's Kaggle Game Arena. The tournament featured eight general-purpose large language models (LLMs), including competitors from OpenAI, Google,...

Key Findings:

Tournament Participants:

As Elon Musk's Grok 4 made blunder after blunder in the final, five-time world champion Magnus Carlsen was at hand to commentate -- and laugh -- at the errors. (PHOTOS: AP, Partha Paul/Express Photos) On Thursday evening, some time around the time when Elon Musk was tweeting at Microsoft’s Satya Nadella that “OpenAI is going to eat Microsoft alive”, his own AI model, Grok 4, was being humbled 4-0 by OpenAI’s o3 in an AI chess exhibition tournament on Google’s Kaggle Game Arena.

The chess tournament featuring eight general-purpose large language models (LLMs) also had Gemini 2.5 Pro (Google), Gemini 2.5 Flash (Google), o4-mini (OpenAI), Claude 4 Opus (Anthropic), DeepSeek R1 and Kimi k2 (Moonshot AI). Musk’s Grok 4 had looked like the strongest fighter in the eight-player field until it reached the final, where it made some questionable knight and bishop sacrifices and blundered away the queen in more games than one.

At multiple points, former world champion Magnus Carlsen burst out laughing on seeing Grok’s inexplicable moves or reacted with shock — complete with a palm on his face — as Grok lost all four games in the final. Carlsen was doing live commentary for the four games of the final for the Take Take Take app with grandmaster David Howell.

After Grok 4 was down 3-0, Howell told Carlsen that the fourth game would also be played, rather than the tournament ending with a 3-0 scoreline. Carlsen said that made sense because “this is like watching kids’ games. In those tournaments you always play them out.” After the final ended, Carlsen quipped: “Hope everyone feels better about their games after watching this.

” The battle for first place saw a battle between the LLMs of friends-turned-foes Altman and Musk. The duo had co-founded OpenAI a decade ago. But Musk left to launch his own rival AI company, xAI. The man who now owns X (Twitter) had also sued OpenAI last year, saying Altman violated Open AI’s original agreement which said the company would prioritise public good over profit.

Gemini 2.5 Pro ended third after defeating o4-mini. In the first game of the Grok 4 vs o3 final, Grok inexplicably sacrificed its light-squared bishop on the 8th move itself. Then, it started to simplify the game by throwing up all of its pieces for trades, which was mind-boggling since most human players won’t try to simplify their position by trading away pieces when a whole minor piece down.

Right after throwing away its bishop, Grok trades away both its knights and a pawn before offering up its queen for a trade as well. The game ended in 35 moves. What was interesting is that on the live broadcast, the LLMs were offering their thought process for the moves, which Carlsen and the world could see.

But for some of the more inexplicable moves, there were no explanations forthcoming from the AI model. Story continues below this adAfter the first game between Musk’s Grok 4 and by Altman’s o3 ended with defeat for the xAI model, Carlsen was asked to estimate the chess strength of the two LLMs.“800 for Grok and 1200 for o3,” he said.

In game 2, at one point when Grok just gifted his queen (the most powerful piece on the board) away, Carlsen said: “It is like that one guy in a club tournament who has learnt theory and literally knows nothing else. Makes the worst blunders after that.” Magnus Carlsen and David Howell react as Grok 4 blunders its queen against o3 in the final.

(Screengrab via Take Take Take YouTube)Right after that, Grok started offering up other pieces as trades, Carlsen said: “What are you doing! What happened to (chess) principles?” In game 3, Grok blundered a knight and then the queen again! At this point, Carlsen burst out in a fit of giggles and said: “It thinks it’s playing giveaway or something.

It was the only way to blunder the queen as well.”Story continues below this adThe fourth game was the hardest fought, but still ended with a win for o3. Carlsen said that watching the final was like watching an “old-school world chess championship match where both players play the same openings… like (Mikhail) Botvinnik vs (David) Bronstein or (Alexander) Alekhine vs (José Raúl) Capablanca.

”Carlsen’s verdict on chess ability of other LLM modelsAfter the second game ended, Carlsen offered his verdict on the chess-playing skills of other AI models. “Both Gemini and Mini were not very good. Claude disappointed me as well. I expected Claude to be… I’ve heard great things about Claude,” Carlsen said.

Story continues below this adAfter the third game sealed a victory of o3, Carlsen said: “o3 is fairly ruthless in conversions, it looks like a chess player. Grok looks like it learnt a few opening moves and knows the rules but not much more. Grok’s moves are chess-related moves. They just came at the wrong time and in weird sequences.

The Indian Express8/8/2025
Read original at The Indian Express

Source coverage

This report from The Indian Express, published on August 8, 2025, details the performance of Elon Musk's AI model, Grok 4, in an AI chess exhibition tournament held on Google's Kaggle Game Arena. The tournament featured eight general-purpose large language models (LLMs), including competitors from OpenAI, Google,...

Key Findings:

Deeper analysis

Full source content

As Elon Musk's Grok 4 made blunder after blunder in the final, five-time world champion Magnus Carlsen was at hand to commentate -- and laugh -- at the errors. (PHOTOS: AP, Partha Paul/Express Photos) On Thursday evening, some time around the time when Elon Musk was tweeting at Microsoft’s Satya Nadella that “OpenAI is going to eat Microsoft alive”, his own AI model, Grok 4, was being humbled 4-0 by OpenAI’s o3 in an AI chess exhibition tournament on Google’s Kaggle Game Arena.

The chess tournament featuring eight general-purpose large language models (LLMs) also had Gemini 2.5 Pro (Google), Gemini 2.5 Flash (Google), o4-mini (OpenAI), Claude 4 Opus (Anthropic), DeepSeek R1 and Kimi k2 (Moonshot AI). Musk’s Grok 4 had looked like the strongest fighter in the eight-player field until it reached the final, where it made some questionable knight and bishop sacrifices and blundered away the queen in more games than one.

At multiple points, former world champion Magnus Carlsen burst out laughing on seeing Grok’s inexplicable moves or reacted with shock — complete with a palm on his face — as Grok lost all four games in the final. Carlsen was doing live commentary for the four games of the final for the Take Take Take app with grandmaster David Howell.

After Grok 4 was down 3-0, Howell told Carlsen that the fourth game would also be played, rather than the tournament ending with a 3-0 scoreline. Carlsen said that made sense because “this is like watching kids’ games. In those tournaments you always play them out.” After the final ended, Carlsen quipped: “Hope everyone feels better about their games after watching this.

” The battle for first place saw a battle between the LLMs of friends-turned-foes Altman and Musk. The duo had co-founded OpenAI a decade ago. But Musk left to launch his own rival AI company, xAI. The man who now owns X (Twitter) had also sued OpenAI last year, saying Altman violated Open AI’s original agreement which said the company would prioritise public good over profit.

Gemini 2.5 Pro ended third after defeating o4-mini. In the first game of the Grok 4 vs o3 final, Grok inexplicably sacrificed its light-squared bishop on the 8th move itself. Then, it started to simplify the game by throwing up all of its pieces for trades, which was mind-boggling since most human players won’t try to simplify their position by trading away pieces when a whole minor piece down.

Right after throwing away its bishop, Grok trades away both its knights and a pawn before offering up its queen for a trade as well. The game ended in 35 moves. What was interesting is that on the live broadcast, the LLMs were offering their thought process for the moves, which Carlsen and the world could see.

But for some of the more inexplicable moves, there were no explanations forthcoming from the AI model. Story continues below this adAfter the first game between Musk’s Grok 4 and by Altman’s o3 ended with defeat for the xAI model, Carlsen was asked to estimate the chess strength of the two LLMs.“800 for Grok and 1200 for o3,” he said.

In game 2, at one point when Grok just gifted his queen (the most powerful piece on the board) away, Carlsen said: “It is like that one guy in a club tournament who has learnt theory and literally knows nothing else. Makes the worst blunders after that.” Magnus Carlsen and David Howell react as Grok 4 blunders its queen against o3 in the final.

(Screengrab via Take Take Take YouTube)Right after that, Grok started offering up other pieces as trades, Carlsen said: “What are you doing! What happened to (chess) principles?” In game 3, Grok blundered a knight and then the queen again! At this point, Carlsen burst out in a fit of giggles and said: “It thinks it’s playing giveaway or something.

It was the only way to blunder the queen as well.”Story continues below this adThe fourth game was the hardest fought, but still ended with a win for o3. Carlsen said that watching the final was like watching an “old-school world chess championship match where both players play the same openings… like (Mikhail) Botvinnik vs (David) Bronstein or (Alexander) Alekhine vs (José Raúl) Capablanca.

”Carlsen’s verdict on chess ability of other LLM modelsAfter the second game ended, Carlsen offered his verdict on the chess-playing skills of other AI models. “Both Gemini and Mini were not very good. Claude disappointed me as well. I expected Claude to be… I’ve heard great things about Claude,” Carlsen said.

Story continues below this adAfter the third game sealed a victory of o3, Carlsen said: “o3 is fairly ruthless in conversions, it looks like a chess player. Grok looks like it learnt a few opening moves and knows the rules but not much more. Grok’s moves are chess-related moves. They just came at the wrong time and in weird sequences.

How this page is built

Goose Pod turns cited reporting into a public episode summary first, then pairs that summary with audio playback so listeners can check the source material before they decide how deeply to engage.

The goal is to make this page useful as a news landing page first, while still giving listeners transcript access, related episodes, and direct links back to the original publishers.

Cited sources

More on this topic

About this page

Goose Pod turns cited reporting into a public episode summary first, then pairs that summary with audio playback so listeners can compare the recap with the underlying source material.

This page reviewed 1 article across 1 source, with the latest cited update on 8/8/2025.

Explore related pages