需求的自动分类-相关内容从应用程序评论在阿拉伯语

IEEE Open Journal of the Computer Society Pub Date : 2025-03-26 DOI:10.1109/OJCS.2025.3573499

Abualsoud A. Hanani;Alaa R. Isaac;Abdallatif Abu-Issa

{"title":"需求的自动分类-相关内容从应用程序评论在阿拉伯语","authors":"Abualsoud A. Hanani;Alaa R. Isaac;Abdallatif Abu-Issa","doi":"10.1109/OJCS.2025.3573499","DOIUrl":null,"url":null,"abstract":"The market for mobile application development continues to thrive with billions of users and millions of apps. Collecting software requirements for mobile apps has to cope with this trend, so as for the software to compete in this crowded scene. Therefore, efforts to analyze mobile app reviews for requirements have shown a similar trend of increase. Among the billions of mobile users, there are hundreds of millions of Arabic-speaking users. According to our knowledge, this study would be one of the first studies in the field of mining mobile app reviews for the assistance of requirements engineering, to direct its focus on Arabic reviews. The main contribution of this study is to provide a framework for mining mobile app reviews in Arabic. A dataset of 7604 Arabic app reviews has been constructed and manually annotated by six experts. Each categorization aims at assisting one or more processes of software requirements engineering. Three configurations of deep neural networks, namely, CNN, LSTM, and BLSTM, were used to classify the app reviews into the considered categories of software requirements from the Arabic reviews. Furthermore, two word embeddings were utilized, on pre-trained models; Fasttext and Word2Vec, produced by this study. The sentimental analysis results show that the LSTM classifier with the Fasttext word embeddings gives the best F1-score, 79.17%. However, the BLSTM classifier with the fastText embeddings outperforms the other classifiers, with an F1-score of 69.83%, when used for identifying the sub-categories of the user perspective main category. The F1-score for classifying the sub-categories of the intention and topics with the LSTM and using fastText embeddings is 82.68% and 85,02%, respectively. These results outperform the other configurations of the classifiers and word embeddings. These results demonstrate the potential of our system to serve as a robust tool for automating software requirement extraction from Arabic app reviews, particularly in contexts where real-time user feedback is critical to agile development cycles.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"862-873"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11015261","citationCount":"0","resultStr":"{\"title\":\"Automatic Classifying of Requirements-Relevant Contents From App Reviews in the Arabic Language\",\"authors\":\"Abualsoud A. Hanani;Alaa R. Isaac;Abdallatif Abu-Issa\",\"doi\":\"10.1109/OJCS.2025.3573499\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The market for mobile application development continues to thrive with billions of users and millions of apps. Collecting software requirements for mobile apps has to cope with this trend, so as for the software to compete in this crowded scene. Therefore, efforts to analyze mobile app reviews for requirements have shown a similar trend of increase. Among the billions of mobile users, there are hundreds of millions of Arabic-speaking users. According to our knowledge, this study would be one of the first studies in the field of mining mobile app reviews for the assistance of requirements engineering, to direct its focus on Arabic reviews. The main contribution of this study is to provide a framework for mining mobile app reviews in Arabic. A dataset of 7604 Arabic app reviews has been constructed and manually annotated by six experts. Each categorization aims at assisting one or more processes of software requirements engineering. Three configurations of deep neural networks, namely, CNN, LSTM, and BLSTM, were used to classify the app reviews into the considered categories of software requirements from the Arabic reviews. Furthermore, two word embeddings were utilized, on pre-trained models; Fasttext and Word2Vec, produced by this study. The sentimental analysis results show that the LSTM classifier with the Fasttext word embeddings gives the best F1-score, 79.17%. However, the BLSTM classifier with the fastText embeddings outperforms the other classifiers, with an F1-score of 69.83%, when used for identifying the sub-categories of the user perspective main category. The F1-score for classifying the sub-categories of the intention and topics with the LSTM and using fastText embeddings is 82.68% and 85,02%, respectively. These results outperform the other configurations of the classifiers and word embeddings. These results demonstrate the potential of our system to serve as a robust tool for automating software requirement extraction from Arabic app reviews, particularly in contexts where real-time user feedback is critical to agile development cycles.\",\"PeriodicalId\":13205,\"journal\":{\"name\":\"IEEE Open Journal of the Computer Society\",\"volume\":\"6 \",\"pages\":\"862-873\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11015261\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of the Computer Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11015261/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11015261/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

移动应用开发市场继续蓬勃发展，拥有数十亿用户和数百万应用程序。收集移动应用的软件需求必须应对这一趋势，这样软件才能在这个拥挤的场景中竞争。因此，分析手机应用评论需求的努力也呈现出类似的增长趋势。在数十亿的移动用户中，有数亿讲阿拉伯语的用户。据我们所知，这项研究将是挖掘移动应用程序评论领域的首批研究之一，以帮助需求工程，将其重点放在阿拉伯语评论上。这项研究的主要贡献是为挖掘阿拉伯语的手机应用评论提供了一个框架。由6位专家构建并手工注释了7604个阿拉伯语应用程序评论的数据集。每种分类的目的都是帮助软件需求工程的一个或多个过程。使用CNN、LSTM和BLSTM三种深度神经网络配置，将应用程序评论从阿拉伯语评论中划分为考虑的软件需求类别。此外，在预训练模型上使用了两个词嵌入；Fasttext和Word2Vec，由这项研究产生。情感分析结果表明，使用Fasttext词嵌入的LSTM分类器的f1得分最高，为79.17%。然而，当用于识别用户视角主类别的子类别时，具有fastText嵌入的BLSTM分类器优于其他分类器，f1得分为69.83%。使用LSTM和使用fastText嵌入对意图和主题的子类别进行分类的f1得分分别为82.68%和85,02%。这些结果优于分类器和词嵌入的其他配置。这些结果表明，我们的系统可以作为一个强大的工具，从阿拉伯语应用程序评论中自动提取软件需求，特别是在实时用户反馈对敏捷开发周期至关重要的环境中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatic Classifying of Requirements-Relevant Contents From App Reviews in the Arabic Language

The market for mobile application development continues to thrive with billions of users and millions of apps. Collecting software requirements for mobile apps has to cope with this trend, so as for the software to compete in this crowded scene. Therefore, efforts to analyze mobile app reviews for requirements have shown a similar trend of increase. Among the billions of mobile users, there are hundreds of millions of Arabic-speaking users. According to our knowledge, this study would be one of the first studies in the field of mining mobile app reviews for the assistance of requirements engineering, to direct its focus on Arabic reviews. The main contribution of this study is to provide a framework for mining mobile app reviews in Arabic. A dataset of 7604 Arabic app reviews has been constructed and manually annotated by six experts. Each categorization aims at assisting one or more processes of software requirements engineering. Three configurations of deep neural networks, namely, CNN, LSTM, and BLSTM, were used to classify the app reviews into the considered categories of software requirements from the Arabic reviews. Furthermore, two word embeddings were utilized, on pre-trained models; Fasttext and Word2Vec, produced by this study. The sentimental analysis results show that the LSTM classifier with the Fasttext word embeddings gives the best F1-score, 79.17%. However, the BLSTM classifier with the fastText embeddings outperforms the other classifiers, with an F1-score of 69.83%, when used for identifying the sub-categories of the user perspective main category. The F1-score for classifying the sub-categories of the intention and topics with the LSTM and using fastText embeddings is 82.68% and 85,02%, respectively. These results outperform the other configurations of the classifiers and word embeddings. These results demonstrate the potential of our system to serve as a robust tool for automating software requirement extraction from Arabic app reviews, particularly in contexts where real-time user feedback is critical to agile development cycles.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Open Journal of the Computer Society

CiteScore

12.60

自引率

0.00%

发文量