乌克兰语文本声调自动确定系统

I. Olenych, M. Prytula, O. Sinkevych, O. Khamar
{"title":"乌克兰语文本声调自动确定系统","authors":"I. Olenych, M. Prytula, O. Sinkevych, O. Khamar","doi":"10.1109/ELIT53502.2021.9501124","DOIUrl":null,"url":null,"abstract":"In the work, the system of determination of the emotional tone of Ukrainian texts based on dictionaries and rules is proposed. The developed software downloads text information in various formats and carries out tokenization and lemmatization procedures using the Python Tokenize UK and pymorphy2 libraries. The obtained array of words in the basic grammatical form was analyzed using tonal dictionaries of the Ukrainian language. A dictionary of synonyms was used to expand the vocabulary. To increase the validity of sentiment analysis, coefficients were used that take into account the various emotional loads of words of different speech parts and their dissimilar impact on the overall assessment of the text tone as well as the action of intensifying or softening adverbs. Using the means of fuzzy modeling for sentiment analysis of texts makes it possible to take into account subjective factors of the expression of human emotions and for the contribution of all emotional categories in the final evaluation of the text tone. A quantitative value of the text tone has been obtained by the method of the center of gravity for one-element sets. Based on the analysis of Ukrainian-language texts, it was found that emotionally significant words have a greater impact on the text tone value of short messages.","PeriodicalId":164798,"journal":{"name":"2021 IEEE 12th International Conference on Electronics and Information Technologies (ELIT)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"System of Automatic Determination of Ukrainian Text Tone\",\"authors\":\"I. Olenych, M. Prytula, O. Sinkevych, O. Khamar\",\"doi\":\"10.1109/ELIT53502.2021.9501124\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the work, the system of determination of the emotional tone of Ukrainian texts based on dictionaries and rules is proposed. The developed software downloads text information in various formats and carries out tokenization and lemmatization procedures using the Python Tokenize UK and pymorphy2 libraries. The obtained array of words in the basic grammatical form was analyzed using tonal dictionaries of the Ukrainian language. A dictionary of synonyms was used to expand the vocabulary. To increase the validity of sentiment analysis, coefficients were used that take into account the various emotional loads of words of different speech parts and their dissimilar impact on the overall assessment of the text tone as well as the action of intensifying or softening adverbs. Using the means of fuzzy modeling for sentiment analysis of texts makes it possible to take into account subjective factors of the expression of human emotions and for the contribution of all emotional categories in the final evaluation of the text tone. A quantitative value of the text tone has been obtained by the method of the center of gravity for one-element sets. Based on the analysis of Ukrainian-language texts, it was found that emotionally significant words have a greater impact on the text tone value of short messages.\",\"PeriodicalId\":164798,\"journal\":{\"name\":\"2021 IEEE 12th International Conference on Electronics and Information Technologies (ELIT)\",\"volume\":\"132 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 12th International Conference on Electronics and Information Technologies (ELIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELIT53502.2021.9501124\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 12th International Conference on Electronics and Information Technologies (ELIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELIT53502.2021.9501124","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本文提出了基于词典和规则的乌克兰语文本情感语气确定系统。开发的软件下载各种格式的文本信息,并使用Python Tokenize UK和pymorphy2库执行标记化和词序化过程。获得的基本语法形式的单词阵列使用乌克兰语的音调字典进行分析。一本同义词词典被用来扩大词汇量。为了提高情感分析的有效性,我们使用了考虑不同词性词的不同情感负荷及其对文本语气整体评估的不同影响以及副词的强化或软化作用的系数。使用模糊建模的方法对文本进行情感分析,可以考虑到人类情感表达的主观因素,以及所有情感类别在文本语气的最终评价中的贡献。利用单元素集的重心法,得到了文本语气的定量值。通过对乌克兰语文本的分析,发现情感意义词对短信文本语气值的影响更大。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
System of Automatic Determination of Ukrainian Text Tone
In the work, the system of determination of the emotional tone of Ukrainian texts based on dictionaries and rules is proposed. The developed software downloads text information in various formats and carries out tokenization and lemmatization procedures using the Python Tokenize UK and pymorphy2 libraries. The obtained array of words in the basic grammatical form was analyzed using tonal dictionaries of the Ukrainian language. A dictionary of synonyms was used to expand the vocabulary. To increase the validity of sentiment analysis, coefficients were used that take into account the various emotional loads of words of different speech parts and their dissimilar impact on the overall assessment of the text tone as well as the action of intensifying or softening adverbs. Using the means of fuzzy modeling for sentiment analysis of texts makes it possible to take into account subjective factors of the expression of human emotions and for the contribution of all emotional categories in the final evaluation of the text tone. A quantitative value of the text tone has been obtained by the method of the center of gravity for one-element sets. Based on the analysis of Ukrainian-language texts, it was found that emotionally significant words have a greater impact on the text tone value of short messages.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信