Xiaomi Open-Sources First Native End-to-End Speech Model

NEWS / Brief News

Xiaomi Open-Sources First Native End-to-End Speech Model

Sep 18, 2025, 11:09 p.m. ET

AsianFin -- Xiaomi on Friday open-sourced its first native end-to-end speech model, Xiaomi-MiMo-Audio.

The model, built on an innovative pretraining architecture and trained on hundreds of millions of hours of data, achieves few-shot generalization based on in-context learning (ICL) for the first time in the speech domain and exhibits noticeable emergent behaviors during pretraining.

Unlock AI's Potentials
Company News Today

Please sign in and then enter your comment

About AsianFin Join Us Contribute Contact Us

AsianFin Newsletters

Download App

App Store

Android APK

Google Play