KARAKALPAK AND UZBEK LANGUAGES PARALLEL CORPUS AND ITS SYNTACTIC ADAPTATION
Keywords:
Parallel corpus, NLP, Machine translationAbstract
This article examines the creation and syntactic adaptation of a parallel corpus for Karakalpak and Uzbek languages. It describes the role of parallel corpora in Natural Language Processing (NLP), machine translation, and linguistic research. The study focuses on the PARATRANSLATOR platform, which integrates multilingual corpora and enables contextual translations. It analyzes morphological and syntactic tagging, alignment techniques, and structural correspondences between the two languages. Similarities and differences in grammar, phonology, and lexicon between Karakalpak and Uzbek are highlighted. The research emphasizes the importance of such resources for preserving low-resource languages and improving AI-based translation technologies.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.