Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing

Andres Rojas Paredes, Brenda Mareco
{"title":"Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing","authors":"Andres Rojas Paredes, Brenda Mareco","doi":"arxiv-2409.12012","DOIUrl":null,"url":null,"abstract":"App reviews in mobile app stores contain useful information which is used to\nimprove applications and promote software evolution. This information is\nprocessed by automatic tools which prioritize reviews. In order to carry out\nthis prioritization, reviews are decomposed into features like category and\nsentiment. Then, a weighted function assigns a weight to each feature and a\nreview ranking is calculated. Unfortunately, in order to extract category and\nsentiment from reviews, its is required at least a classifier trained in an\nannotated corpus. Therefore this task is computational demanding. Thus, in this\nwork, we propose Shannon Entropy as a simple feature which can replace standard\nfeatures. Our results show that a Shannon Entropy based ranking is better than\na standard ranking according to the NDCG metric. This result is promising even\nif we require fairness by means of algorithmic bias. Finally, we highlight a\ncomputational limit which appears in the search of the best ranking.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

App reviews in mobile app stores contain useful information which is used to improve applications and promote software evolution. This information is processed by automatic tools which prioritize reviews. In order to carry out this prioritization, reviews are decomposed into features like category and sentiment. Then, a weighted function assigns a weight to each feature and a review ranking is calculated. Unfortunately, in order to extract category and sentiment from reviews, its is required at least a classifier trained in an annotated corpus. Therefore this task is computational demanding. Thus, in this work, we propose Shannon Entropy as a simple feature which can replace standard features. Our results show that a Shannon Entropy based ranking is better than a standard ranking according to the NDCG metric. This result is promising even if we require fairness by means of algorithmic bias. Finally, we highlight a computational limit which appears in the search of the best ranking.
在用户反馈处理中,香农熵是比类别和情感更好的特征
移动应用商店中的应用评论包含有用信息,可用于改进应用和促进软件发展。这些信息由自动工具处理,这些工具会对评论进行优先排序。为了进行优先级排序,评论会被分解成类别和情感等特征。然后,用加权函数为每个特征分配权重,并计算出评论排名。遗憾的是,要从评论中提取类别和情感,至少需要一个在有注释的语料库中训练过的分类器。因此,这项任务对计算要求很高。因此,在这项工作中,我们提出香农熵作为一种简单的特征,可以取代标准特征。我们的结果表明,根据 NDCG 指标,基于香农熵的排序优于标准排序。即使我们通过算法偏差来要求公平性,这一结果也是很有希望的。最后,我们强调了在搜索最佳排名时出现的计算极限。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信