Skip to main content
See every side of every news story
Published loading...Updated

Google Unveils DiffusionGemma, an AI Model that Breaks Free of Left-to-Right Processing

Summary by Computerworld
Extremely powerful large language models (LLMs) still operate as though they’re typing on a keyboard, processing workloads in a simple left-to-right fashion. But in locally-run, single-user scenarios, this sequential processing can leave graphics processing units (GPUs) and tensor processing units (TPUs) underutilized. Google is betting that DiffusionGemma can get around this bottleneck. The new experimental open model generates text “exceptiona…
DisclaimerThis story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.

4 Articles

More than 1,000 tokens per second on a single H100 card, the accelerator that Nvidia sells to data centers, and about 700 on a RTX 5090, its high-end gaming card. This is the speed that Google DeepMind announces for DiffusionGemma, its new open AI model, about four times what classic Gemma models produce of comparable size. All the difference is played in how to generate the text. The usual language models are self-regressive: they write from le…

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • There is no tracked Bias information for the sources covering this story.

Factuality Info Icon

To view factuality data please Upgrade to Premium

Ownership

Info Icon

To view ownership data please Upgrade to Vantage

Korben broke the news on Friday, June 12, 2026.
Too Big Arrow Icon
Sources are mostly out of (0)

Similar News Topics

News
Feed Dots Icon
For You
Search Icon
Search
Blindspot LogoBlindspotLocal