一种新的代码变更预测数据集:基于HEP软件的案例研究

E. Ronchieri, M. Canaparo, Yue Yang, A. Costantini, D. C. Duma, D. Salomoni
{"title":"一种新的代码变更预测数据集:基于HEP软件的案例研究","authors":"E. Ronchieri, M. Canaparo, Yue Yang, A. Costantini, D. C. Duma, D. Salomoni","doi":"10.1109/NSS/MIC42677.2020.9508053","DOIUrl":null,"url":null,"abstract":"Predicting changes proneness in software modules is an open area of research. This activity implies dealing with code changes datasets that are typically either incomplete or absent. To obtain a change dataset properly constructed, a new dictionary of software changes terms has been defined by leveraging our experience with the High Energy Physics (HEP) software. Our new dictionary includes various terms that classify a “code change” like warning, fixed bug, minor fix and optimization. Each term has been opportunely used to label each software module analyzed. The derived categories range from code development to performance improvements and refer to single pieces of the considered software. The resulting code-change dataset has been used to build a prediction model able to monitor software evolution and assess its maintainability over time. The present article gives details of the designed procedure that has been followed and presents the obtained results. The designed dictionary can be used with other, non-HEP, software as long as researchers can rely on well documented code changes. In such respect, our prediction model can be tested against these new datasets in order to improve both reliability and performance.","PeriodicalId":6760,"journal":{"name":"2020 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)","volume":"220 1","pages":"1-2"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A New Code Change Prediction Dataset: A Case Study Based on HEP Software\",\"authors\":\"E. Ronchieri, M. Canaparo, Yue Yang, A. Costantini, D. C. Duma, D. Salomoni\",\"doi\":\"10.1109/NSS/MIC42677.2020.9508053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predicting changes proneness in software modules is an open area of research. This activity implies dealing with code changes datasets that are typically either incomplete or absent. To obtain a change dataset properly constructed, a new dictionary of software changes terms has been defined by leveraging our experience with the High Energy Physics (HEP) software. Our new dictionary includes various terms that classify a “code change” like warning, fixed bug, minor fix and optimization. Each term has been opportunely used to label each software module analyzed. The derived categories range from code development to performance improvements and refer to single pieces of the considered software. The resulting code-change dataset has been used to build a prediction model able to monitor software evolution and assess its maintainability over time. The present article gives details of the designed procedure that has been followed and presents the obtained results. The designed dictionary can be used with other, non-HEP, software as long as researchers can rely on well documented code changes. In such respect, our prediction model can be tested against these new datasets in order to improve both reliability and performance.\",\"PeriodicalId\":6760,\"journal\":{\"name\":\"2020 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)\",\"volume\":\"220 1\",\"pages\":\"1-2\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NSS/MIC42677.2020.9508053\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NSS/MIC42677.2020.9508053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

预测软件模块的变化倾向是一个开放的研究领域。此活动意味着处理代码更改数据集,这些数据集通常要么不完整,要么不存在。为了获得正确构建的变更数据集,利用我们使用高能物理(HEP)软件的经验,定义了一个新的软件变更术语词典。我们的新字典包含了对“代码更改”进行分类的各种术语,如警告、修复bug、小修复和优化。每个术语都被恰当地用于标记所分析的每个软件模块。派生的类别范围从代码开发到性能改进,并涉及所考虑的软件的单个部分。生成的代码更改数据集用于构建一个预测模型,该模型能够监视软件的演变并评估其随时间推移的可维护性。本文详细介绍了所设计的程序,并给出了所获得的结果。设计的字典可以与其他非hep软件一起使用,只要研究人员可以依靠良好的文档代码更改。在这方面,我们的预测模型可以针对这些新的数据集进行测试,以提高可靠性和性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A New Code Change Prediction Dataset: A Case Study Based on HEP Software
Predicting changes proneness in software modules is an open area of research. This activity implies dealing with code changes datasets that are typically either incomplete or absent. To obtain a change dataset properly constructed, a new dictionary of software changes terms has been defined by leveraging our experience with the High Energy Physics (HEP) software. Our new dictionary includes various terms that classify a “code change” like warning, fixed bug, minor fix and optimization. Each term has been opportunely used to label each software module analyzed. The derived categories range from code development to performance improvements and refer to single pieces of the considered software. The resulting code-change dataset has been used to build a prediction model able to monitor software evolution and assess its maintainability over time. The present article gives details of the designed procedure that has been followed and presents the obtained results. The designed dictionary can be used with other, non-HEP, software as long as researchers can rely on well documented code changes. In such respect, our prediction model can be tested against these new datasets in order to improve both reliability and performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信