← Back to KHAO

Claude · ChatGPT · China · GPT ·

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude

2 min read

Compiled by KHAO Editorial — aggregated from 1 source. See llms.txt for citation guidance.

★ Tier-1 Source

Most people know Xiaomi as the Chinese phone brand.

Key facts

Summary

The speed comes from FP4 quantization on the model's expert layers and DFlash speculative decoding, which proposes a full block of tokens in one pass instead of one at a time. A limited API trial opens June 9 through June 23, priced at 3× standard MiMo rates for roughly 10× the generation speed. Xiaomi released MiMo-V2.5-Pro-UltraSpeed, a serving mode for its trillion-parameter flagship that hits over 1,000 tokens per second—peaking near 1,200 in demos. Parameters are the internal numerical weights that define how a model thinks—the more you have, the more complex the patterns it can recognize.

Read full article at Decrypt →

#Claude #China #ChatGPT #GPT