Eclipse项目中不同类型变更的实证研究

P. Tiwari, Wei Li, Raouf Alomainy, Bingyang Wei
{"title":"Eclipse项目中不同类型变更的实证研究","authors":"P. Tiwari, Wei Li, Raouf Alomainy, Bingyang Wei","doi":"10.2174/1874107X01307010024","DOIUrl":null,"url":null,"abstract":"This paper studied the distribution of different types of changes in the various contexts of the system and the relationship between artifact (file and module) size and different changes. We used the change data in the open source Eclipse Project through its decade-long evolution history. The latest release has 220 modules, 33904 files, 3780201 lines of code, and 49853 changes (accumulatively). This study focused on two levels of software artifacts: module and file; and four contexts of changes: all changes, error changes, non-error changes, and 19 change categories. At the module level, we found that the power-law distribution was a common phenomenon for three contexts of changes at both the module and file levels: it existed in all changes, in error changes, and in non-error changes. When we analyzed the 19 change categories, the files and modules exhibited different behavior: the power-law distribution existed in all but one category at the module level, but, about two third of the change categories did not show the power-law distribution at the file level. On the relationship between artifact size and changes, we found, at the module level, a few modules that had the majority of changes accounted for the majority of the code size; however, this phenomenon disappeared when we separated the er- ror from non-error changes. At the file level, this phenomenon did not exist at all. We did not find any correlation between artifact size and changes at either the module or file level.","PeriodicalId":262856,"journal":{"name":"The Open Software Engineering Journal","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An Empirical Study of Different Types of Changes in the Eclipse Project\",\"authors\":\"P. Tiwari, Wei Li, Raouf Alomainy, Bingyang Wei\",\"doi\":\"10.2174/1874107X01307010024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper studied the distribution of different types of changes in the various contexts of the system and the relationship between artifact (file and module) size and different changes. We used the change data in the open source Eclipse Project through its decade-long evolution history. The latest release has 220 modules, 33904 files, 3780201 lines of code, and 49853 changes (accumulatively). This study focused on two levels of software artifacts: module and file; and four contexts of changes: all changes, error changes, non-error changes, and 19 change categories. At the module level, we found that the power-law distribution was a common phenomenon for three contexts of changes at both the module and file levels: it existed in all changes, in error changes, and in non-error changes. When we analyzed the 19 change categories, the files and modules exhibited different behavior: the power-law distribution existed in all but one category at the module level, but, about two third of the change categories did not show the power-law distribution at the file level. On the relationship between artifact size and changes, we found, at the module level, a few modules that had the majority of changes accounted for the majority of the code size; however, this phenomenon disappeared when we separated the er- ror from non-error changes. At the file level, this phenomenon did not exist at all. We did not find any correlation between artifact size and changes at either the module or file level.\",\"PeriodicalId\":262856,\"journal\":{\"name\":\"The Open Software Engineering Journal\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Open Software Engineering Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2174/1874107X01307010024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Open Software Engineering Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1874107X01307010024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

本文研究了不同类型的变更在系统不同环境中的分布,以及工件(文件和模块)大小与不同变更之间的关系。我们在开源Eclipse项目长达十年的发展历程中使用了变更数据。最新版本有220个模块、33904个文件、3780201行代码和49853个更改(累计)。本研究主要关注两个层次的软件构件:模块和文件;四种变更情境:全部变更、错误变更、非错误变更、19种变更类别。在模块级别,我们发现幂律分布在模块和文件级别的三种更改上下文中都是一种常见现象:它存在于所有更改中,存在于错误更改中,存在于非错误更改中。当我们分析19个变化类别时,文件和模块表现出不同的行为:在模块层面上,除了一个类别之外,所有类别都存在幂律分布,但是,大约三分之二的变化类别在文件层面上不存在幂律分布。在工件大小和变更之间的关系上,我们发现,在模块级别上,有几个模块的大部分变更占了代码大小的大部分;然而,当我们将误差变化与非误差变化分开时,这种现象就消失了。在文件级别,这种现象根本不存在。我们没有发现工件大小和模块或文件级别的更改之间的任何关联。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Empirical Study of Different Types of Changes in the Eclipse Project
This paper studied the distribution of different types of changes in the various contexts of the system and the relationship between artifact (file and module) size and different changes. We used the change data in the open source Eclipse Project through its decade-long evolution history. The latest release has 220 modules, 33904 files, 3780201 lines of code, and 49853 changes (accumulatively). This study focused on two levels of software artifacts: module and file; and four contexts of changes: all changes, error changes, non-error changes, and 19 change categories. At the module level, we found that the power-law distribution was a common phenomenon for three contexts of changes at both the module and file levels: it existed in all changes, in error changes, and in non-error changes. When we analyzed the 19 change categories, the files and modules exhibited different behavior: the power-law distribution existed in all but one category at the module level, but, about two third of the change categories did not show the power-law distribution at the file level. On the relationship between artifact size and changes, we found, at the module level, a few modules that had the majority of changes accounted for the majority of the code size; however, this phenomenon disappeared when we separated the er- ror from non-error changes. At the file level, this phenomenon did not exist at all. We did not find any correlation between artifact size and changes at either the module or file level.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信