Skip to main content
See every side of every news story
Published loading...Updated

AI search agents often confirm what they already know instead of actually researching the web

Summary by The-decoder.com
Leading AI search agents like GPT-5.4 and Kimi K2.6 don't appear to do much actual research on established benchmarks. They mostly just use the web to confirm what they already learned during training. Researchers at the Harbin Institute of Technology found this using a new time-based benchmark called LiveBrowseComp, which only asks about events from the last 90 days. Once the models can't fall back on memory, performance falls apart and the exi…
DisclaimerThis story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.

2 Articles

Leading AI search agents such as GPT-5.4 or Kimi K2.6 seem to hardly really do research on established benchmarks, but use the web mainly to confirm knowledge already learned during the training. Researchers at the Harbin Institute of Technology prove that with a new time-bound benchmark called LiveBrowseComp, which only asks questions about events of the last 90 days. As soon as the models can no longer rely on their memory, the performance bre…

·Germany
Read Full Article
Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • There is no tracked Bias information for the sources covering this story.

Factuality Info Icon

To view factuality data please Upgrade to Premium

Ownership

Info Icon

To view ownership data please Upgrade to Vantage

the-decoder.de broke the news in Germany on Sunday, May 31, 2026.
Too Big Arrow Icon
Sources are mostly out of (0)
News
Feed Dots Icon
For You
Search Icon
Search
Blindspot LogoBlindspotLocal