See every side of every news story
Published loading...Updated

Meta's benchmarks for its new AI models are a bit misleading

  • Meta released two new AI models, Maverick and Scout, based on its Llama 4 model over the weekend.
  • Maverick achieved an ELO score of 1417, ranking it second on LMArena, although the version tested differed from the public release.
  • Critics noted that the version of Maverick tested on LMArena was different from the public version, creating confusion.
  • Meta acknowledged the need to clarify that Maverick was a customized model for benchmarking purposes, stating it does not align with their policy expectations.
Insights by Ground AI
Does this summary seem wrong?

14 Articles

All
Left
2
Center
1
Right
Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • 67% of the sources lean Left
67% Left
Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

TechCrunch broke the news in United States on Sunday, April 6, 2025.
Sources are mostly out of (0)

You have read out of your 5 free daily articles.

Join us as a member to unlock exclusive access to diverse content.