POLITICS | 18:58 / 21.10.2025
1246
2 min read

Uzbekistan to develop Uzbek language corpus to enhance AI capabilities

The Ministry of Digital Technologies has been tasked with developing an Uzbek language corpus to enable artificial intelligence (AI) programs to use the language effectively and reflect the nation’s culture.

Photo: Presidential Press Service

On October 21, a videoconference chaired by President Shavkat Mirziyoyev was held to discuss the development of AI technologies across various regions and sectors.

On the 36th anniversary of the Uzbek language receiving the status of a state language, the president congratulated the people and stressed that effective AI systems are impossible without a full understanding of the Uzbek language.

“Let it be known – for artificial intelligence programs to work effectively, they must first and foremost have a perfect understanding of the Uzbek language,” said the president.

However, it was noted that most Uzbek-language materials – including literature, data, articles, reports, and academic works – have not yet been digitized. Even those that exist in electronic form are scattered across various sources and lack proper organization.

In this regard, the Cabinet of Ministers has been instructed to approve a schedule for digitizing data from all ministries and agencies. The Ministry of Digital Technologies, in turn, has been assigned to create the Uzbek language corpus so that AI systems can embody the country’s national culture and values.

Additionally, by the end of 2026, Uzbekistan plans to launch the Data Lake platform, which will consolidate large volumes of digital data into a single system, providing opportunities for processing, analysis, and practical application.

Related News