NEWS  /  Brief News

Chinese Data Makes Up Over 60% of Training Material in Most Domestic AI Models

Aug 14, 2025, 4:21 a.m. ET

AsianFin— The share of Chinese-language data in training datasets for most domestic AI models has surpassed 60%, with some models reaching as high as 80%, according to Liu Liehong, head of the National Data Administration.

Speaking at a State Council press conference on the “High-Quality Completion of the 14th Five-Year Plan” series, Liu said the capacity to develop and supply high-quality Chinese-language data has been steadily improving, driving rapid performance gains in China’s AI models.

Please sign in and then enter your comment