Molding Shanghai Dialect Corpus Inclusive Plan 2.0 released: scale to exceed 10PB by the end of next year.

date
29/03/2026
On March 28th, at the "Language Corpus Construction and Intelligent Generation Era" forum of the 2026 Global Developers Vanguard Conference, the Moldable Shanghai Corpus Inclusion Plan 2.0 led by Cupath is officially launched. It is reported that the Corpus Inclusion Plan 2.0 will provide low-cost, high-quality sustainable language resources for small and medium-sized enterprises, teachers and students in universities, and innovators and entrepreneurs. By the end of 2027, the plan aims to connect with 500 innovative entities, create 300 scarce data sets, provide corpus value of no less than 1.5 billion, and have a corpus scale exceeding 10PB.