See every side of every news story

Published 13 days ago • loading... • Updated 13 days ago

XAI Grok 4 Scoring Poorly in Realworld Tests

Summary by NextBigFuture.com

There is a common problem for all AI companies for overfitting to benchmarks. XAI Grok 4 has some problems with prompt adherence. XAI could have had overfitting resulted from the reinforcement learning used for the reasoning model work. Kimi K2 is doing well on realworld tests. XAI will likely improve Grok 4 with new versions ...

1 Articles

1 Articles

NextBigFuture.com

XAI Grok 4 Scoring Poorly in Realworld Tests

There is a common problem for all AI companies for overfitting to benchmarks. XAI Grok 4 has some problems with prompt adherence. XAI could have had overfitting resulted from the reinforcement learning used for the reasoning model work. Kimi K2 is doing well on realworld tests. XAI will likely improve Grok 4 with new versions ...

13 days ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

Stories disproportionately reported by the Left or the Right

Coverage Details

Total News Sources1

Leaning Left0Leaning Right0Center0Last Updated12 days agoBias Distribution

No sources with tracked biases.

Bias Distribution

There is no tracked Bias information for the sources covering this story.

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

NextBigFuture.com broke the news in 13 days ago on Monday, July 14, 2025.

Sources are mostly out of (0)

Similar News Topics

Stories disproportionately reported by the Left or the Right

Similar News Topics

You have read 1 out of your 5 free daily articles.

Join millions of well-informed readers who use Ground to compare coverage, check their news blindspots, and challenge their worldview.