Chinese Internet Basic Corpus 3.0 officially released.

date
18/09/2025
On the morning of September 17th, at the Artificial Intelligence Security Governance Sub Forum of the 2025 National Cyber Security Awareness Week held in Kunming, the Chinese Internet Basic Corpus 3.0 was officially released to the public. This batch of language data has expanded the scope of high-quality Chinese website sources, strengthened the filtering of illegal and harmful information, and has a data volume of 120GB, providing credible data support for large-scale model training and the development of artificial intelligence.