Published 4 days ago • loading... • Updated 2 days ago

Xiaomi Announces Its Fastest AI Model yet with 1000 Token/second Speed

Xiaomi‘s large language model family, MiMo, has officially launched UltraSpeed mode for MiMo-V2.5-Pro. Developed jointly with TileRT, the 1-trillion-parameter model can run on general-purpose GPUs while breaking the 1,000 tokens-per-second generation barrier. Xiaomi says this milestone is possible through the “ultimate co-design” of the model and its underlying system. Make a Snake game in 10 seconds To put that in perspective, MiMo-V2-Flash, an…

This story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.

6 Articles

GIGAZINE

'MiMo-V2.5-Pro-UltraSpeed' has emerged, capable of running a trillion-parameter model at a blazing speed of over 1000 tokens per second, with the underlying model released as open source.

The news blog specialized in Japanese culture, odd news, gadgets and all other funny stuffs. Updated everyday.

2 days ago

Read Full Article

TechGenyz

Xiaomi MiMo AI Model Hits 1,000 Tokens Per Second With Huge Breakthroughs

Xiaomi MiMo: Cerebras needed a wafer-scale chip the size of a dinner plate. Groq built custom silicon with on-chip SRAM from the ground up. Xiaomi used a standard eight-GPU server node — the kind any developer can rent from a cloud provider today. Xiaomi, in collaboration with inference partner TileRT, has officially launched MiMo-V2.5-Pro-UltraSpeed, achieving […]

3 days ago

Read Full Article

Coin Academy

Xiaomi Reaches 1,000 Tokens per Second on an AI Model with 1,000 Billion Parameters

What we must remember: Xiaomi displays more than 1,000 tokens per second on a 1000 billion-parameter d的IA model. The exploit is based on standard GPUs, without a custom chip, thanks to extensive software work. Three innovations combine: the quantization FP4, the speculative decoding DFlash and the TileRT engine. Xiaomi has just pushed the race to speed in artificial intelligence. In collaboration with the TileRT team, the group unveiled MiMo-V2.…

3 days ago

Read Full Article

Gizmochina

Xiaomi announces its fastest AI model yet with 1000 token/second speed

3 days ago

Read Full Article

decrypt.co

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude

Xiaomi's MiMo-V2.5-Pro-UltraSpeed blows past the speed threshold custom silicon companies spent years building toward—on regular GPUs.

3 days ago·New York, United States

Read Full Article

MarkTechPost

Xiaomi MiMo and TileRT Push a 1-Trillion-Parameter Model Past 1000 Tokens Per Second on Commodity GPUs

Inference speed is becoming a competitive metric for large language models. Xiaomi’s MiMo team just released MiMo-V2.5-Pro-UltraSpeed, built in collaboration with the TileRT systems group. It decodes faster than 1000 tokens per second on a 1-trillion-parameter model. Xiaomi team describes this as a first at trillion-parameter scale. Demos show generation peaks near 1200 tokens per second. The notable part is the hardware: it runs on commodity GP…

4 days ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

Coverage Details

Total News Sources6

Leaning Left0Leaning Right0Center0Last Updated2 days agoBias Distribution

No sources with tracked biases.

Bias Distribution

There is no tracked Bias information for the sources covering this story.

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

MarkTechPost broke the news 4 days ago on Monday, June 8, 2026.

Sources are mostly out of (0)

Xiaomi Announces Its Fastest AI Model yet with 1000 Token/second Speed

6 Articles

6 Articles

'MiMo-V2.5-Pro-UltraSpeed' has emerged, capable of running a trillion-parameter model at a blazing speed of over 1000 tokens per second, with the underlying model released as open source.

Xiaomi MiMo AI Model Hits 1,000 Tokens Per Second With Huge Breakthroughs

Xiaomi Reaches 1,000 Tokens per Second on an AI Model with 1,000 Billion Parameters

Xiaomi announces its fastest AI model yet with 1000 token/second speed

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude

Xiaomi MiMo and TileRT Push a 1-Trillion-Parameter Model Past 1000 Tokens Per Second on Commodity GPUs

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics

Xiaomi Announces Its Fastest AI Model yet with 1000 Token/second Speed

6 Articles

6 Articles

'MiMo-V2.5-Pro-UltraSpeed' has emerged, capable of running a trillion-parameter model at a blazing speed of over 1000 tokens per second, with the underlying model released as open source.

Xiaomi MiMo AI Model Hits 1,000 Tokens Per Second With Huge Breakthroughs

Translate IconXiaomi Reaches 1,000 Tokens per Second on an AI Model with 1,000 Billion Parameters

Xiaomi announces its fastest AI model yet with 1000 token/second speed

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude

Xiaomi MiMo and TileRT Push a 1-Trillion-Parameter Model Past 1000 Tokens Per Second on Commodity GPUs

Coverage Details

Bias Distribution Too Big Arrow Icon

Factuality Info Icon

Ownership

Similar News Topics

Similar News Topics

Xiaomi Reaches 1,000 Tokens per Second on an AI Model with 1,000 Billion Parameters

Bias Distribution

Factuality