Sitemap

Grok 4 failed these Benchmarks : Elon lied again

5 min readJul 11, 2025

--

Press enter or click to view image in full size
Photo by Kostiantyn Li on Unsplash

But this is just the half story.

Benchmarks Grok 4 failed

1. LiveBench

Press enter or click to view image in full size

Though Grok 4 is good, its certainly not the best

2. Creative Writing benchmark

Press enter or click to view image in full size

Grok 4 is nowhere close to the best AI model here. It’s somewhere in the middle and looks quite average on creative writing.

DesignArena

Press enter or click to view image in full size

SVG generation

Press enter or click to view image in full size
Press enter or click to view image in full size
Press enter or click to view image in full size

Grok 4 is highly biased, follows nazi ideology and even sexually harasses its own CEO

Press enter or click to view image in full size
Press enter or click to view image in full size
Press enter or click to view image in full size

Even users are not happy

Press enter or click to view image in full size

Stop Buying the Hype

Smarter than humans? Please. Grok 4 isn’t even the smartest chatbot this quarter.

--

--

Responses (10)