丰富词典,提高自动分类的感受,在帖子中使用相关系统

Afonso Matheus Sousa Lima, M. Mendes, L. A. Cruz
{"title":"丰富词典,提高自动分类的感受,在帖子中使用相关系统","authors":"Afonso Matheus Sousa Lima, M. Mendes, L. A. Cruz","doi":"10.1145/3330204.3330219","DOIUrl":null,"url":null,"abstract":"This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems\",\"authors\":\"Afonso Matheus Sousa Lima, M. Mendes, L. A. Cruz\",\"doi\":\"10.1145/3330204.3330219\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.\",\"PeriodicalId\":348938,\"journal\":{\"name\":\"Proceedings of the XV Brazilian Symposium on Information Systems\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the XV Brazilian Symposium on Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3330204.3330219\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the XV Brazilian Symposium on Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3330204.3330219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

这项工作提出了一项调查,以提高基于词汇的分类器SentiStrength的效率,用于与系统使用相关的帖子中的自动情感检测。为了实现这一目标,使用TF-IDF度量来选择与帖子领域相关的单词,这将丰富该工具用于生成帖子极性的字典。本文还将对词根形式词典和词根形式词典的效率进行研究。这项研究从Play商店的城市移动应用评论部分提取了2108个句子,如Waze、谷歌地图和GPS巴西。其中一个结果是,当使用丰富的字典时,分类器的准确性提高了7.3%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems
This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信