Skip to main content
See every side of every news story
Published loading...Updated

Xiaomi Announces Its Fastest AI Model yet with 1000 Token/second Speed

Summary by Gizmochina
Xiaomi‘s large language model family, MiMo, has officially launched UltraSpeed mode for MiMo-V2.5-Pro. Developed jointly with TileRT, the 1-trillion-parameter model can run on general-purpose GPUs while breaking the 1,000 tokens-per-second generation barrier. Xiaomi says this milestone is possible through the “ultimate co-design” of the model and its underlying system. Make a Snake game in 10 seconds To put that in perspective, MiMo-V2-Flash, an…
DisclaimerThis story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.

6 Articles

What we must remember: Xiaomi displays more than 1,000 tokens per second on a 1000 billion-parameter d的IA model. The exploit is based on standard GPUs, without a custom chip, thanks to extensive software work. Three innovations combine: the quantization FP4, the speculative decoding DFlash and the TileRT engine. Xiaomi has just pushed the race to speed in artificial intelligence. In collaboration with the TileRT team, the group unveiled MiMo-V2.…

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • There is no tracked Bias information for the sources covering this story.

Factuality Info Icon

To view factuality data please Upgrade to Premium

Ownership

Info Icon

To view ownership data please Upgrade to Vantage

MarkTechPost broke the news on Monday, June 8, 2026.
Too Big Arrow Icon
Sources are mostly out of (0)

Similar News Topics

News
Feed Dots Icon
For You
Search Icon
Search
Blindspot LogoBlindspotLocal