{"title":"以《世界人权宣言》为语料库的突厥语语篇数量语义分析","authors":"A. Adamov, Gozel Khasanova","doi":"10.1109/AICT55583.2022.10013645","DOIUrl":null,"url":null,"abstract":"Thanks to Web, ubiquitous digital technologies and the increasing usage of digital environment by humans for work, entertainment, education and other activities, huge amounts of textual data is generated and available online. Text is the most informative and at the same time most sophisticated data type in terms of its comprehension by machines. The Text Analytics is a field that involves number of computer science disciplines to process textual data and transforms it into computer readable format suitable for another field of study Natural Language Processing to extract meaning.This research paper is an attempt to apply broad variety of statistical analysis methods to the corpora of several Turkic languages using Universal Declaration of Human Rights as a Corpus. Quantitative Text Analysis as a research area is focused on understanding the human language through statistics and numbers. As the language is the most effective tool to describe the social world, the Quantitative Text Analysis enables social exploration of the rial world at the scale.","PeriodicalId":441475,"journal":{"name":"2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Quantitative and Semantic Analysis of Texts in Turkic Languages using Universal Declaration of Human Rights (UDHR) as a Corpus\",\"authors\":\"A. Adamov, Gozel Khasanova\",\"doi\":\"10.1109/AICT55583.2022.10013645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Thanks to Web, ubiquitous digital technologies and the increasing usage of digital environment by humans for work, entertainment, education and other activities, huge amounts of textual data is generated and available online. Text is the most informative and at the same time most sophisticated data type in terms of its comprehension by machines. The Text Analytics is a field that involves number of computer science disciplines to process textual data and transforms it into computer readable format suitable for another field of study Natural Language Processing to extract meaning.This research paper is an attempt to apply broad variety of statistical analysis methods to the corpora of several Turkic languages using Universal Declaration of Human Rights as a Corpus. Quantitative Text Analysis as a research area is focused on understanding the human language through statistics and numbers. As the language is the most effective tool to describe the social world, the Quantitative Text Analysis enables social exploration of the rial world at the scale.\",\"PeriodicalId\":441475,\"journal\":{\"name\":\"2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICT55583.2022.10013645\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICT55583.2022.10013645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Quantitative and Semantic Analysis of Texts in Turkic Languages using Universal Declaration of Human Rights (UDHR) as a Corpus
Thanks to Web, ubiquitous digital technologies and the increasing usage of digital environment by humans for work, entertainment, education and other activities, huge amounts of textual data is generated and available online. Text is the most informative and at the same time most sophisticated data type in terms of its comprehension by machines. The Text Analytics is a field that involves number of computer science disciplines to process textual data and transforms it into computer readable format suitable for another field of study Natural Language Processing to extract meaning.This research paper is an attempt to apply broad variety of statistical analysis methods to the corpora of several Turkic languages using Universal Declaration of Human Rights as a Corpus. Quantitative Text Analysis as a research area is focused on understanding the human language through statistics and numbers. As the language is the most effective tool to describe the social world, the Quantitative Text Analysis enables social exploration of the rial world at the scale.