See every side of every news story
Published loading...Updated

Popular AIs head-to-head: OpenAI beats DeepSeek on sentence-level reasoning

  • Researchers compared OpenAI's o1 and DeepSeek's R1 using a benchmark on April 17, 2025.
  • AI chatbots sometimes make up citations, so the benchmark, Reasons, tested citation accuracy and reasoning.
  • The Reasons benchmark evaluated the models on F-1 score and hallucination rate for generated responses.
  • OpenAI's o1 scored 0.65 on F-1 with 35% hallucination; DeepSeek's R1 scored 0.35 with 85% hallucination.
  • OpenAI shows an advantage, possibly from its training data, though users should still verify AI-generated citations.
Insights by Ground AI
Does this summary seem wrong?

6 Articles

All
Left
1
Center
2
Right
Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • 67% of the sources are Center
67% Center
Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

TechXplore broke the news in on Thursday, April 17, 2025.
Sources are mostly out of (0)

You have read out of your 5 free daily articles.

Join us as a member to unlock exclusive access to diverse content.