A Multinomial Naïve Bayes Classifier for identifying Actors and Use Cases from Software Requirement Specification documents

2022 2nd International Conference on Intelligent Technologies (CONIT) Pub Date : 2022-06-24 DOI:10.1109/CONIT55038.2022.9848290

V. V., P. Samuel

{"title":"A Multinomial Naïve Bayes Classifier for identifying Actors and Use Cases from Software Requirement Specification documents","authors":"V. V., P. Samuel","doi":"10.1109/CONIT55038.2022.9848290","DOIUrl":null,"url":null,"abstract":"A software Requirements Specification (SRS) document is an NL (Natural Language) written textual specification that documents the functional and non-functional requirements of the system and various expectations of clients in a software development project. To understand the different requirements of the system, developers make use of this SRS document. In this paper, we apply Naive Bayes classifiers - Multinomial and Gaussian over different SRS documents and classify the software requirement entities (Actors and Use Cases) using Machine Learning based methods. SRS documents of 28 different systems are considered for our purpose and we define labels for the entities Actor and Use Case. Multinomial Naive Bayes is a popular classifier because of its computational efficiency and relatively good predictive performance. Out of the classifiers tried out, the Multinomial Naive Bayes recognizes Actors and Use Cases with an accuracy of 91%. Actors and Use Cases can be extracted with high accuracy from the SRS documents using Multinomial Naive Bayes, which then can be used for plotting the Use Case diagram of the system. Automated UML (Unified Modeling Language) model generation approaches have a very prominent role in an agile development environment where requirements change frequently. In this work, we attempt to automate the Requirement Engineering (RE) phase that can improve and accelerate the entire Software Development Life Cycle (SDLC).","PeriodicalId":270445,"journal":{"name":"2022 2nd International Conference on Intelligent Technologies (CONIT)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Intelligent Technologies (CONIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONIT55038.2022.9848290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

A software Requirements Specification (SRS) document is an NL (Natural Language) written textual specification that documents the functional and non-functional requirements of the system and various expectations of clients in a software development project. To understand the different requirements of the system, developers make use of this SRS document. In this paper, we apply Naive Bayes classifiers - Multinomial and Gaussian over different SRS documents and classify the software requirement entities (Actors and Use Cases) using Machine Learning based methods. SRS documents of 28 different systems are considered for our purpose and we define labels for the entities Actor and Use Case. Multinomial Naive Bayes is a popular classifier because of its computational efficiency and relatively good predictive performance. Out of the classifiers tried out, the Multinomial Naive Bayes recognizes Actors and Use Cases with an accuracy of 91%. Actors and Use Cases can be extracted with high accuracy from the SRS documents using Multinomial Naive Bayes, which then can be used for plotting the Use Case diagram of the system. Automated UML (Unified Modeling Language) model generation approaches have a very prominent role in an agile development environment where requirements change frequently. In this work, we attempt to automate the Requirement Engineering (RE) phase that can improve and accelerate the entire Software Development Life Cycle (SDLC).

查看原文本刊更多论文

从软件需求规范文档中识别参与者和用例的多项式Naïve贝叶斯分类器

软件需求规范(SRS)文档是一种NL(自然语言)编写的文本规范，它记录了软件开发项目中系统的功能和非功能需求以及客户的各种期望。为了了解系统的不同需求，开发人员使用了这个SRS文档。在本文中，我们在不同的SRS文档上应用朴素贝叶斯分类器-多项和高斯分类器，并使用基于机器学习的方法对软件需求实体(参与者和用例)进行分类。我们考虑了28个不同系统的SRS文档，并为实体Actor和Use Case定义了标签。多项朴素贝叶斯因其计算效率和相对较好的预测性能而成为一种流行的分类器。在测试过的分类器中，多项朴素贝叶斯识别参与者和用例的准确率为91%。使用多项式朴素贝叶斯可以从SRS文档中高精度地提取参与者和用例，然后可以用于绘制系统的用例图。自动化UML(统一建模语言)模型生成方法在需求频繁变化的敏捷开发环境中具有非常突出的作用。在这项工作中，我们尝试自动化需求工程(RE)阶段，它可以改进并加速整个软件开发生命周期(SDLC)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 2nd International Conference on Intelligent Technologies (CONIT)

自引率

0.00%

发文量