A Multinomial Naïve Bayes Classifier for identifying Actors and Use Cases from Software Requirement Specification documents

V. V., P. Samuel
{"title":"A Multinomial Naïve Bayes Classifier for identifying Actors and Use Cases from Software Requirement Specification documents","authors":"V. V., P. Samuel","doi":"10.1109/CONIT55038.2022.9848290","DOIUrl":null,"url":null,"abstract":"A software Requirements Specification (SRS) document is an NL (Natural Language) written textual specification that documents the functional and non-functional requirements of the system and various expectations of clients in a software development project. To understand the different requirements of the system, developers make use of this SRS document. In this paper, we apply Naive Bayes classifiers - Multinomial and Gaussian over different SRS documents and classify the software requirement entities (Actors and Use Cases) using Machine Learning based methods. SRS documents of 28 different systems are considered for our purpose and we define labels for the entities Actor and Use Case. Multinomial Naive Bayes is a popular classifier because of its computational efficiency and relatively good predictive performance. Out of the classifiers tried out, the Multinomial Naive Bayes recognizes Actors and Use Cases with an accuracy of 91%. Actors and Use Cases can be extracted with high accuracy from the SRS documents using Multinomial Naive Bayes, which then can be used for plotting the Use Case diagram of the system. Automated UML (Unified Modeling Language) model generation approaches have a very prominent role in an agile development environment where requirements change frequently. In this work, we attempt to automate the Requirement Engineering (RE) phase that can improve and accelerate the entire Software Development Life Cycle (SDLC).","PeriodicalId":270445,"journal":{"name":"2022 2nd International Conference on Intelligent Technologies (CONIT)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Intelligent Technologies (CONIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONIT55038.2022.9848290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

A software Requirements Specification (SRS) document is an NL (Natural Language) written textual specification that documents the functional and non-functional requirements of the system and various expectations of clients in a software development project. To understand the different requirements of the system, developers make use of this SRS document. In this paper, we apply Naive Bayes classifiers - Multinomial and Gaussian over different SRS documents and classify the software requirement entities (Actors and Use Cases) using Machine Learning based methods. SRS documents of 28 different systems are considered for our purpose and we define labels for the entities Actor and Use Case. Multinomial Naive Bayes is a popular classifier because of its computational efficiency and relatively good predictive performance. Out of the classifiers tried out, the Multinomial Naive Bayes recognizes Actors and Use Cases with an accuracy of 91%. Actors and Use Cases can be extracted with high accuracy from the SRS documents using Multinomial Naive Bayes, which then can be used for plotting the Use Case diagram of the system. Automated UML (Unified Modeling Language) model generation approaches have a very prominent role in an agile development environment where requirements change frequently. In this work, we attempt to automate the Requirement Engineering (RE) phase that can improve and accelerate the entire Software Development Life Cycle (SDLC).
从软件需求规范文档中识别参与者和用例的多项式Naïve贝叶斯分类器
软件需求规范(SRS)文档是一种NL(自然语言)编写的文本规范,它记录了软件开发项目中系统的功能和非功能需求以及客户的各种期望。为了了解系统的不同需求,开发人员使用了这个SRS文档。在本文中,我们在不同的SRS文档上应用朴素贝叶斯分类器-多项和高斯分类器,并使用基于机器学习的方法对软件需求实体(参与者和用例)进行分类。我们考虑了28个不同系统的SRS文档,并为实体Actor和Use Case定义了标签。多项朴素贝叶斯因其计算效率和相对较好的预测性能而成为一种流行的分类器。在测试过的分类器中,多项朴素贝叶斯识别参与者和用例的准确率为91%。使用多项式朴素贝叶斯可以从SRS文档中高精度地提取参与者和用例,然后可以用于绘制系统的用例图。自动化UML(统一建模语言)模型生成方法在需求频繁变化的敏捷开发环境中具有非常突出的作用。在这项工作中,我们尝试自动化需求工程(RE)阶段,它可以改进并加速整个软件开发生命周期(SDLC)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信