See the Complete Picture.
Published loading...Updated

OpenAI Launches HealthBench, a Dataset That Benchmarks Health Care AI Models

  • On Tuesday, May 13, 2025, OpenAI released HealthBench, a large dataset in San Francisco to evaluate AI health care responses.
  • OpenAI launched HealthBench to address the challenge of fairly comparing AI models’ answers to health care questions using realistic data.
  • HealthBench contains 5,000 health conversations graded by rubrics with over 57,000 criteria developed by 262 physicians from 60 countries.
  • OpenAI’s o3 reasoning model scored highest with 60%, excelling in communication quality, though experts call for more subgroup analysis and human review.
  • HealthBench represents a major advance in AI health evaluation but cannot yet support safety claims and requires further testing before trusted deployment.
Insights by Ground AI
Does this summary seem wrong?

50 Articles

All
Left
6
Center
13
Right
6
ABC FOX MontanaABC FOX Montana
+36 Reposted by 36 other sources
Center

OpenAI Releases HealthBench Dataset to Test AI in Health Care

Key Takeaways

·Missoula, United States
Read Full Article
Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • 52% of the sources are Center
52% Center
Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

STAT broke the news in Boston, United States on Monday, May 12, 2025.
Sources are mostly out of (0)