A large-scale study on developer engagement and expertise in Configurable Software System projects

IF 4.1 2区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Systems and Software Pub Date : 2025-08-23 DOI:10.1016/j.jss.2025.112575

Karolina M. Milano , Wesley K.G. Assunção , Bruno B.P. Cafeo

{"title":"A large-scale study on developer engagement and expertise in Configurable Software System projects","authors":"Karolina M. Milano , Wesley K.G. Assunção , Bruno B.P. Cafeo","doi":"10.1016/j.jss.2025.112575","DOIUrl":null,"url":null,"abstract":"<div><div>Modern systems operate in multiple contexts making variability a fundamental aspect of Configurable Software Systems (CSSs). Variability, implemented via pre-processor directives (e.g., <span>#ifdef</span> blocks) interleaved with other code and spread across files, complicates maintenance and increases error risk. Despite its importance, little is known about how variable code is distributed among developers or whether conventional expertise metrics adequately capture variable code proficiency. This study investigates developers’ engagement with variable versus mandatory code, the concentration of variable code workload, and the effectiveness of expertise metrics in CSS projects. We mined repositories of 25 CSS projects, analyzing 450,255 commits from 9,678 developers. Results show that 59% of developers never modified variable code, while about 17% were responsible for developing and maintaining 83% of it. This indicates a high concentration of variable code expertise among a few developers, suggesting that task assignments should prioritize these specialists. Moreover, conventional expertise metrics performed poorly—achieving only around 55% precision and 50% recall in identifying developers engaged with variable code. Our findings highlight an unbalanced distribution of variable code responsibilities and underscore the need to refine expertise metrics to better support task assignments in CSS projects, thereby promoting a more equitable workload distribution.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"231 ","pages":"Article 112575"},"PeriodicalIF":4.1000,"publicationDate":"2025-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121225002444","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Modern systems operate in multiple contexts making variability a fundamental aspect of Configurable Software Systems (CSSs). Variability, implemented via pre-processor directives (e.g., #ifdef blocks) interleaved with other code and spread across files, complicates maintenance and increases error risk. Despite its importance, little is known about how variable code is distributed among developers or whether conventional expertise metrics adequately capture variable code proficiency. This study investigates developers’ engagement with variable versus mandatory code, the concentration of variable code workload, and the effectiveness of expertise metrics in CSS projects. We mined repositories of 25 CSS projects, analyzing 450,255 commits from 9,678 developers. Results show that 59% of developers never modified variable code, while about 17% were responsible for developing and maintaining 83% of it. This indicates a high concentration of variable code expertise among a few developers, suggesting that task assignments should prioritize these specialists. Moreover, conventional expertise metrics performed poorly—achieving only around 55% precision and 50% recall in identifying developers engaged with variable code. Our findings highlight an unbalanced distribution of variable code responsibilities and underscore the need to refine expertise metrics to better support task assignments in CSS projects, thereby promoting a more equitable workload distribution.

查看原文本刊更多论文

对可配置软件系统项目中开发人员参与和专业知识的大规模研究

现代系统在多种环境下运行，使可变性成为可配置软件系统（css）的一个基本方面。可变性，通过预处理器指令（例如，#ifdef块）与其他代码交错并分布在文件中实现，使维护变得复杂并增加了错误风险。尽管它很重要，但是很少有人知道可变代码是如何在开发人员之间分布的，或者传统的专业技术度量是否充分地捕获了可变代码的熟练程度。本研究调查了开发人员对可变代码和强制代码的参与，可变代码工作量的集中，以及CSS项目中专业知识度量的有效性。我们挖掘了25个CSS项目的存储库，分析了来自9,678名开发人员的450,255个提交。结果显示，59%的开发人员从未修改过变量代码，而17%的开发人员负责开发和维护其中83%的代码。这表明少数开发人员高度集中了变量代码专家，这表明任务分配应该优先考虑这些专家。此外，传统的专业技术指标表现不佳——在识别使用可变代码的开发人员时，只有大约55%的精确度和50%的召回率。我们的研究结果强调了可变代码责任的不平衡分布，并强调了改进专业技术指标以更好地支持CSS项目中的任务分配的必要性，从而促进更公平的工作量分配。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Systems and Software 工程技术-计算机：理论方法

CiteScore

8.60

自引率

5.70%

发文量

193

审稿时长

16 weeks

期刊介绍： The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to: •Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution •Agile, model-driven, service-oriented, open source and global software development •Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems •Human factors and management concerns of software development •Data management and big data issues of software systems •Metrics and evaluation, data mining of software development resources •Business and economic aspects of software development processes The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.