Algorithms Comparison for Non-Requirements Classification using the Semantic Feature of Software Requirement Statements

Achmad An'im Fahmi, D. Siahaan
{"title":"Algorithms Comparison for Non-Requirements Classification using the Semantic Feature of Software Requirement Statements","authors":"Achmad An'im Fahmi, D. Siahaan","doi":"10.12962/J20882033.V31I3.7606","DOIUrl":null,"url":null,"abstract":"Noise in a Software Requirements Specification (SRS) is an irrelevant requirements statement or a non-requirements statement. This can be confusing to the reader and can have negative repercussions in later stages of software development. This study proposes a classification model to detect the second type of noise, the non-requirements statement. The classification model that is built is based on the semantic features of the non-requirements statement. This research also compares the five best-supervised machine learning methods to date, which are support vector machine (SVM), naive Bayes (NB), random forest (RF), k-nearest neighbor (kNN), and Decision Tree. This comparison aimed to determine which method can produce the best non-requirements classification, model. The comparison shows that the best model is produced by the SVM method with an average accuracy of 0.96. The most significant features in this non-requirement classification model are the requirements statement or non-requirements, id statement, normalized mean value, standard deviation value, similarity variant value, standard deviation normalization value, maximum normalized value, similarity variant normalization value, value Bad NN, mean value, number of sentences, bad VB score, and project id.","PeriodicalId":14549,"journal":{"name":"IPTEK: The Journal for Technology and Science","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPTEK: The Journal for Technology and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12962/J20882033.V31I3.7606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Noise in a Software Requirements Specification (SRS) is an irrelevant requirements statement or a non-requirements statement. This can be confusing to the reader and can have negative repercussions in later stages of software development. This study proposes a classification model to detect the second type of noise, the non-requirements statement. The classification model that is built is based on the semantic features of the non-requirements statement. This research also compares the five best-supervised machine learning methods to date, which are support vector machine (SVM), naive Bayes (NB), random forest (RF), k-nearest neighbor (kNN), and Decision Tree. This comparison aimed to determine which method can produce the best non-requirements classification, model. The comparison shows that the best model is produced by the SVM method with an average accuracy of 0.96. The most significant features in this non-requirement classification model are the requirements statement or non-requirements, id statement, normalized mean value, standard deviation value, similarity variant value, standard deviation normalization value, maximum normalized value, similarity variant normalization value, value Bad NN, mean value, number of sentences, bad VB score, and project id.
基于软件需求语句语义特征的非需求分类算法比较
软件需求规范(SRS)中的噪声是不相关的需求陈述或非需求陈述。这可能会让读者感到困惑,并在软件开发的后期阶段产生负面影响。本研究提出了一种分类模型来检测第二类噪声,即非要求陈述。所构建的分类模型是基于非需求陈述的语义特征。本研究还比较了迄今为止五种最好的监督机器学习方法,它们是支持向量机(SVM)、朴素贝叶斯(NB)、随机森林(RF)、k近邻(kNN)和决策树。这种比较旨在确定哪种方法可以产生最佳的非需求分类模型。结果表明,SVM方法得到的模型最优,平均精度为0.96。该非需求分类模型最显著的特征是需求语句或非需求、id语句、归一化均值、标准差值、相似变量值、标准差归一化值、最大归一化值、相似变量归一化值、Bad NN值、均值、句子数、Bad VB评分、项目id。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
17
审稿时长
9 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信