Bidirectional Recurrent Neural Network Language Model: Cross Entropy Churn Metrics for Defect Prediction Modeling

International Journal of Data Mining Techniques and Applications Pub Date : 2020-12-10 DOI:10.20894/ijdmta.102.009.001.010

N. R, K. S

{"title":"Bidirectional Recurrent Neural Network Language Model: Cross Entropy Churn Metrics for Defect Prediction Modeling","authors":"N. R, K. S","doi":"10.20894/ijdmta.102.009.001.010","DOIUrl":null,"url":null,"abstract":"Software Defect Prediction (SDP) plays an active area in many research domain of Software Quality of Assurance (SQA). Many existing research studies are based on software traditional metric sets and defect prediction models are built in machine language to detect the bug for limited source code line. Inspired by the above existing system. In this paper, defect prediction is focused on predicting defects in source code. The objective of this thesis is to improve the software quality for accurate defect prediction is source code for file level. So, that it helps the developer to find the bug and fix the issue, to make better use of a resource which reduces the test effort, minimize the cost and improve the quality of software. A new approach is introduced to improve the prediction performance of Bidirectional RNNLM in Deep Neural Network. To build the defect prediction model a defect learner framework is proposed and first it need to build a Neural Language Model. Using this Language Model it helps to learn to deep semantic features in source code and it train & test the model. Based on language model it combined with software traditional metric sets to measure the code and find the defect. The probability of language model and metric set Cross-Entropy with Abstract Syntax Tree (CE-AST) metric is used to evaluate the defect proneness and set as a metric label. For classification the metric label K-NN classifier is used. BPTT algorithm for learning RNN will provide additional improvement, it improves the predictions performance to find the dynamic error.","PeriodicalId":414709,"journal":{"name":"International Journal of Data Mining Techniques and Applications","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining Techniques and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20894/ijdmta.102.009.001.010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Software Defect Prediction (SDP) plays an active area in many research domain of Software Quality of Assurance (SQA). Many existing research studies are based on software traditional metric sets and defect prediction models are built in machine language to detect the bug for limited source code line. Inspired by the above existing system. In this paper, defect prediction is focused on predicting defects in source code. The objective of this thesis is to improve the software quality for accurate defect prediction is source code for file level. So, that it helps the developer to find the bug and fix the issue, to make better use of a resource which reduces the test effort, minimize the cost and improve the quality of software. A new approach is introduced to improve the prediction performance of Bidirectional RNNLM in Deep Neural Network. To build the defect prediction model a defect learner framework is proposed and first it need to build a Neural Language Model. Using this Language Model it helps to learn to deep semantic features in source code and it train & test the model. Based on language model it combined with software traditional metric sets to measure the code and find the defect. The probability of language model and metric set Cross-Entropy with Abstract Syntax Tree (CE-AST) metric is used to evaluate the defect proneness and set as a metric label. For classification the metric label K-NN classifier is used. BPTT algorithm for learning RNN will provide additional improvement, it improves the predictions performance to find the dynamic error.

查看原文本刊更多论文

双向递归神经网络语言模型:用于缺陷预测建模的交叉熵流失度量

软件缺陷预测(SDP)在软件质量保证(SQA)的许多研究领域中占有活跃的地位。现有的许多研究都是基于软件的传统度量集和用机器语言建立缺陷预测模型来检测有限的源代码行。受到上述现有系统的启发。在本文中，缺陷预测的重点是预测源代码中的缺陷。本文的目标是提高软件质量，实现对文件级源代码的准确缺陷预测。因此，它可以帮助开发人员找到错误并修复问题，从而更好地利用资源，从而减少测试工作量，最大限度地降低成本并提高软件质量。提出了一种提高深度神经网络双向RNNLM预测性能的新方法。为了建立缺陷预测模型，提出了一个缺陷学习器框架，该框架首先需要建立一个神经语言模型。使用该语言模型有助于学习源代码中的深层语义特征，并对模型进行训练和测试。在语言模型的基础上，结合软件传统度量集对代码进行度量，发现缺陷。利用语言模型和度量集的概率与抽象语法树(CE-AST)度量交叉熵来评估缺陷的倾向性，并将其作为度量标签。对于分类，使用度量标签K-NN分类器。BPTT算法将为学习RNN提供额外的改进，它提高了动态误差的预测性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Data Mining Techniques and Applications

自引率

0.00%

发文量