Development of an automated marker corpus of the kazakh language

Authors

  • Z.A. Makhanova SKSU named after M.Auezov
  • P.A. Kozhabekova SKSU named after M.Auezov
  • M.A. Seitzhappar SKSU named after M.Auezov
  • N.E. Sabit SKSU named after M.Auezov

DOI:

https://doi.org/10.51301/vest.su.2021.v143.i1.06

Keywords:

corpus, labeled corpus, Linguistics, corpus linguistics, corpus technology, tokenization, lemmatization.

Abstract

Article about the convergence of the Kazakh language with technologies. Because in the future, all the world around us will be closely connected to technology. It is as if new words in everyday life, new positions being formed, are the messenger of transformation.Information technologies and the development of the Internet strengthen communication links between members of society. This, in turn, led to the consolidation and accumulation of highly developed digital information. In fact, information exchange is not only a technological connection, but also a complex linguistic phenomenon.Problems such as people use of lingual means tongue, the use of phrases, understanding the structural data environment, have become a significant field of linguistic knowledge, combined with linguistics and computer science arose the subject area of computational linguistics.

Published

2021-02-28

How to Cite

Маханова, З. ., Кожабекова, П. ., Сейтжаппар, М. ., & Сабит , Н. . (2021). Development of an automated marker corpus of the kazakh language. Engineering Journal of Satbayev University, 143(1), 36–39. https://doi.org/10.51301/vest.su.2021.v143.i1.06

Issue

Section

Physics and Mathematics