Beyond Dependencies: The Role of Copy-Based Reuse in Open Source Software Development

Mahmoud Jahanshahi, David Reid, Audris Mockus
{"title":"Beyond Dependencies: The Role of Copy-Based Reuse in Open Source Software Development","authors":"Mahmoud Jahanshahi, David Reid, Audris Mockus","doi":"arxiv-2409.04830","DOIUrl":null,"url":null,"abstract":"In Open Source Software, resources of any project are open for reuse by\nintroducing dependencies or copying the resource itself. In contrast to\ndependency-based reuse, the infrastructure to systematically support copy-based\nreuse appears to be entirely missing. Our aim is to enable future research and\ntool development to increase efficiency and reduce the risks of copy-based\nreuse. We seek a better understanding of such reuse by measuring its prevalence\nand identifying factors affecting the propensity to reuse. To identify reused\nartifacts and trace their origins, our method exploits World of Code\ninfrastructure. We begin with a set of theory-derived factors related to the\npropensity to reuse, sample instances of different reuse types, and survey\ndevelopers to better understand their intentions. Our results indicate that\ncopy-based reuse is common, with many developers being aware of it when writing\ncode. The propensity for a file to be reused varies greatly among languages and\nbetween source code and binary files, consistently decreasing over time. Files\nintroduced by popular projects are more likely to be reused, but at least half\nof reused resources originate from ``small'' and ``medium'' projects.\nDevelopers had various reasons for reuse but were generally positive about\nusing a package manager.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"11 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.04830","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In Open Source Software, resources of any project are open for reuse by introducing dependencies or copying the resource itself. In contrast to dependency-based reuse, the infrastructure to systematically support copy-based reuse appears to be entirely missing. Our aim is to enable future research and tool development to increase efficiency and reduce the risks of copy-based reuse. We seek a better understanding of such reuse by measuring its prevalence and identifying factors affecting the propensity to reuse. To identify reused artifacts and trace their origins, our method exploits World of Code infrastructure. We begin with a set of theory-derived factors related to the propensity to reuse, sample instances of different reuse types, and survey developers to better understand their intentions. Our results indicate that copy-based reuse is common, with many developers being aware of it when writing code. The propensity for a file to be reused varies greatly among languages and between source code and binary files, consistently decreasing over time. Files introduced by popular projects are more likely to be reused, but at least half of reused resources originate from ``small'' and ``medium'' projects. Developers had various reasons for reuse but were generally positive about using a package manager.
超越依赖:基于副本的重用在开源软件开发中的作用
在开源软件中,任何项目的资源都可以通过引入依赖关系或复制资源本身的方式进行重用。与基于依赖性的重用不同,系统地支持基于复制的重用的基础设施似乎完全缺失。我们的目标是使未来的研究和工具开发能够提高效率,降低基于复制的重用风险。我们试图通过测量这种重复使用的普遍程度和确定影响重复使用倾向的因素来更好地了解这种重复使用。为了识别重复使用的工件并追踪其来源,我们的方法利用了《代码世界》基础架构。我们从一组与重用倾向相关的理论衍生因素入手,对不同重用类型的实例进行抽样,并对开发者进行调查,以更好地了解他们的意图。我们的结果表明,基于拷贝的重用非常普遍,许多开发人员在编写代码时都意识到了这一点。不同语言、源代码和二进制文件之间的文件重用倾向差异很大,而且随着时间的推移不断降低。热门项目引入的文件更容易被重用,但至少有一半的重用资源来自 "小型 "和 "中型 "项目。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信