Journal of Software-Evolution and Process最新文献

筛选
英文 中文
Strategic Intelligence Management (ISO 56006)—Using AI by the Innovation Agent Task Force in the Automotive Skills Alliance (ASA) 战略情报管理(ISO 56006) -汽车技能联盟(ASA)创新代理工作组使用人工智能
IF 1.8 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-07-31 DOI: 10.1002/smr.70038
Laura Aschbacher, Mikus Zelmenis, Richard Messnarz, Damjan Ekert
{"title":"Strategic Intelligence Management (ISO 56006)—Using AI by the Innovation Agent Task Force in the Automotive Skills Alliance (ASA)","authors":"Laura Aschbacher,&nbsp;Mikus Zelmenis,&nbsp;Richard Messnarz,&nbsp;Damjan Ekert","doi":"10.1002/smr.70038","DOIUrl":"https://doi.org/10.1002/smr.70038","url":null,"abstract":"<div>\u0000 \u0000 <p>In the EU blueprint project FLAMENCO (Forward Looking Approaches for Green Mobility Ecosystem Network Collaboration), an innovation agent task force has been founded, which acts as an expert panel to elaborate a skills set of an innovation agent for automotive and establishes an innovation capability assessment model based on the ISO 560xx Innovation Management Systems norm series. In 2024, a new EU project TRIREME (Digital &amp; Green Skills Towards Future of the Mobility Ecosystem, 2024–2027) started, which builds on this existing innovation agent task force and provides resources to elaborate MOOcs (Massive Open Online Courses) per chapter of the ISO 5600x norm applying new tools like AI. The MOOC is then configured in a European Skills Hub of the ASA (Automotive Skills Alliance). ASA represents the pact for skills partner in the EU Erasmus+ program for the automotive sector. The research work about the use of AI (artificial intelligence) for the implementation of specific ISO 560xx chapters will be published. This paper is about the results of the work on the ISO 56006 Strategic Intelligence Management implementation using AI in the TRIREME project.</p>\u0000 </div>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 8","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144740510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling Software Reliability With Power Law Testing Effort Function Under Operational Uncertain Environment 运行不确定环境下用幂律测试功函数建模软件可靠性
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-07-09 DOI: 10.1002/smr.70037
Anup Kumar Behera, Priyanka Agarwal
{"title":"Modeling Software Reliability With Power Law Testing Effort Function Under Operational Uncertain Environment","authors":"Anup Kumar Behera,&nbsp;Priyanka Agarwal","doi":"10.1002/smr.70037","DOIUrl":"https://doi.org/10.1002/smr.70037","url":null,"abstract":"<div>\u0000 \u0000 <p>In today's swiftly evolving technological landscape, the importance of software reliability has become crucial. To evaluate software reliability, many researchers have investigated several software reliability growth models (SRGMs). Software developers frequently use a controlled environment for software testing, where they are aware of all the factors. However, the operational environment can introduce unpredictable and unfamiliar factors. Many studies in the literature have recognized the existence of uncertainty in the operational environment with different scenarios like perfect and imperfect debugging, several testing coverage functions, different error detection rates, etc. However, the inclusion of the testing effort function (TEF) alongside this operating uncertain environment has received notably less attention. This paper addresses this gap by exploring a software reliability growth model that integrates a power law TEF to account for an operational uncertain environment. For the validation, a numerical analysis is done based on two datasets (DS1 and DS2), and the proposed model is compared to seven existing reliability models using six goodness-of-fit criteria, and other improved NCD ranking criteria. In addition, we have also conducted single and multiple-parameter sensitivity analysis, which has enabled us to identify the critical parameters. The proposed models could potentially assist system analysts in predicting various parameters related to certain software systems. The findings encourage the decision makers.</p>\u0000 </div>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 7","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144589673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
“Eco-Friendly” Micro-Services: Resource Footprint of Cloud-Native Micro-Services Written in Java—A Balance Between Build-Time and Run-Time Effects of Releases “生态友好型”微服务:用java编写的云原生微服务的资源足迹——版本构建时和运行时效果之间的平衡
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-07-01 DOI: 10.1002/smr.70036
Alexander Poth, Olsi Rrjolli
{"title":"“Eco-Friendly” Micro-Services: Resource Footprint of Cloud-Native Micro-Services Written in Java—A Balance Between Build-Time and Run-Time Effects of Releases","authors":"Alexander Poth,&nbsp;Olsi Rrjolli","doi":"10.1002/smr.70036","DOIUrl":"https://doi.org/10.1002/smr.70036","url":null,"abstract":"<div>\u0000 \u0000 <p>The usage of micro-services in IT services is increasing. As this growth continues, the importance of eco-friendly design and operation becomes a significant factor. This study aims to evaluates the potential of Java frameworks that facilitate cloud-native micro-services in reducing the energy footprint throughout the release lifecycle. The release lifecycle view also looks into the development phase and its footprint impact respectively potentials to build overall more energy footprint optimized releases. Technology-driven methods and tools based on micro-service frameworks can help reduce the energy footprint at the micro-service level. However, local optimizations at the micro-service level cannot diminish the importance of more holistic approaches, such as optimizing the overall system, architecture and design of micro-services. The concepts derived from this analysis can be implemented in industrial settings, as presented by the case study. The effects are measurable and represent a positive step towards more eco-friendly cloud-native micro-service-based IT service offerings.</p>\u0000 </div>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 7","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144524874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Test Co-Evolution in Software Projects: A Large-Scale Empirical Study 软件项目中的测试协同进化:一个大规模的实证研究
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-06-26 DOI: 10.1002/smr.70035
Charles Miranda, Guilherme Avelino, Pedro Santos Neto
{"title":"Test Co-Evolution in Software Projects: A Large-Scale Empirical Study","authors":"Charles Miranda,&nbsp;Guilherme Avelino,&nbsp;Pedro Santos Neto","doi":"10.1002/smr.70035","DOIUrl":"https://doi.org/10.1002/smr.70035","url":null,"abstract":"<p>The asynchronous evolution of tests and code can compromise software quality and project longevity. To investigate the impact of test and production code co-evolution, this study analyzes a large-scale dataset of 526 GitHub repositories written in six programming languages: JavaScript, TypeScript, Java, Python, PHP, and C#. We focus on understanding how tests evolve throughout the software lifecycle and the frequency with which production and test code evolve in sync. By applying clustering algorithms and Pearson's correlation coefficient, we identify different patterns of test co-evolution between projects. We found a significant correlation between high test co-evolution and smaller development teams but no significant relationship with the frequency of different maintenance activities (corrective, adaptive, perfective, or multi). Despite this, we identified five distinct test evolution patterns, highlighting diverse approaches to integrating testing practices. This work provides valuable insights into the dynamics of test co-evolution and its correlation in software maintainability.</p>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 7","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/smr.70035","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144493020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Test Adequacy of Benchmarks for LLMs on Code Generation 评估llm代码生成基准测试的充分性
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-06-25 DOI: 10.1002/smr.70034
Xiangyue Liu, Xiaobing Sun, Lili Bo, Yufei Hu, Xinwei Liu, Zhenlei Ye
{"title":"Evaluating the Test Adequacy of Benchmarks for LLMs on Code Generation","authors":"Xiangyue Liu,&nbsp;Xiaobing Sun,&nbsp;Lili Bo,&nbsp;Yufei Hu,&nbsp;Xinwei Liu,&nbsp;Zhenlei Ye","doi":"10.1002/smr.70034","DOIUrl":"https://doi.org/10.1002/smr.70034","url":null,"abstract":"<div>\u0000 \u0000 <p>Code generation for users' intent has become increasingly prevalent with the large language models (LLMs). To automatically evaluate the effectiveness of these models, multiple execution-based benchmarks are proposed, including specially crafted tasks, accompanied by some test cases and a ground truth solution. LLMs are regarded as well-performed in code generation tasks if they can pass the test cases corresponding to most tasks in these benchmarks. However, it is unknown whether the test cases have sufficient test adequacy and whether the test adequacy can affect the evaluation. In this paper, we conducted an empirical study to evaluate the test adequacy of the execution-based benchmarks and to explore their effects during evaluation for LLMs. Based on the evaluation of the widely used benchmarks, HumanEval, MBPP, and two enhanced benchmarks HumanEval+ and MBPP+, we obtained the following results: (1) All the evaluated benchmarks have high statement coverage (above 99.16%), low branch coverage (74.39%) and low mutation score (87.69%). Especially for the tasks with higher cyclomatic complexities in the HumanEval and MBPP, the mutation score of test cases is lower. (2) No significant correlation exists between test adequacy (statement coverage, branch coverage and mutation score) of benchmarks and evaluating results on LLMs at the individual task level. (3) There is a significant positive correlation between mutation score-based evaluation and another execution-based evaluation metric (<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>A</mi>\u0000 <mi>v</mi>\u0000 <mi>g</mi>\u0000 <mi>P</mi>\u0000 <mi>a</mi>\u0000 <mi>s</mi>\u0000 <mi>s</mi>\u0000 <mi>R</mi>\u0000 <mi>a</mi>\u0000 <mi>t</mi>\u0000 <mi>i</mi>\u0000 <mi>o</mi>\u0000 </mrow>\u0000 <annotation>$$ AvgPassRatio $$</annotation>\u0000 </semantics></math>) on LLMs at the individual task level. (4) The existing test case augmentation techniques have limited improvement in the coverage of test cases in the benchmark, while significantly improving the mutation score by approximately 34.60% and also can bring a more rigorous evaluation to LLMs on code generation. (5) The LLM-based test case generation technique (EvalPlus) performs better than the traditional search-based technique (Pynguin) in improving the benchmarks' test quality and evaluation ability of code generation.</p>\u0000 </div>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 7","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144482208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Effectiveness of Open-Source Donation Platform: An Empirical Study on Opencollective 探索开源捐赠平台的有效性:基于Opencollective的实证研究
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-06-25 DOI: 10.1002/smr.70033
Shuoxiao Zhang, Enyi Tang, Xinyu Gao, Zhekai Zhang, Yixiao Shan, Haofeng Zhang, Ziyang He, Jianhua Zhao, Xuandong Li
{"title":"Exploring the Effectiveness of Open-Source Donation Platform: An Empirical Study on Opencollective","authors":"Shuoxiao Zhang,&nbsp;Enyi Tang,&nbsp;Xinyu Gao,&nbsp;Zhekai Zhang,&nbsp;Yixiao Shan,&nbsp;Haofeng Zhang,&nbsp;Ziyang He,&nbsp;Jianhua Zhao,&nbsp;Xuandong Li","doi":"10.1002/smr.70033","DOIUrl":"https://doi.org/10.1002/smr.70033","url":null,"abstract":"<div>\u0000 \u0000 <p>In recent years, with the development of the open-source community, various open-source donation platforms have emerged. These platforms effectively alleviate the financial pressures faced by open-source projects through diversified funding sources and flexible donation methods. As one of the most representative open-source donation platforms, Opencollective has garnered widespread attention from both the open-source community and academia. Although Opencollective claims to provide more funding opportunities for open-source projects, the extent to which it effectively addresses the financial challenges faced by these projects remains unclear. While there have been studies on the effectiveness of traditional donation models, research on the effectiveness of emerging donation platforms such as Opencollective is still limited. Given that a large number of open-source projects are urgently seeking donations, understanding the effectiveness of donations through Opencollective is crucial for these projects. To address this gap, we have made an early step in this direction. This paper conducts a comprehensive study on the effectiveness of donations through the Opencollective, employing a combination of quantitative and qualitative analysis and identifies the following key findings: (1) Opencollective attracts a diverse group of participants, including individual donors, sponsors, contributors, and project managers, with individual donors constituting the largest group. Most donations are concentrated in the range of $5 to $10, indicating that the platform largely relies on small but frequent donations from individuals. (2) Only about 26.61% of open-source projects receive donations through Opencollective, with approximately 64.38% of these projects receiving a total donation amount of less than $50,000. The likelihood of receiving donations increases with project scale, maturity and the number of stars. Among projects that have received donations, larger projects with stronger social media promotion, greater attention and more issues are more likely to receive additional donations. (3) The positive impact of donations on project development and spend activities is significant only in the short term, with no notable long-term effects. In contrast, donations do not have a significant short-term impact on community engagement. Although the long-term effect is slightly positive, it is not statistically significant. (4) The main shortcomings of Opencollective include insufficient project management and collaboration features, inadequate user experience and interface design, high transaction fees, and a lack of transparency in fund allocation and usage. Our findings provide significant theoretical support and practical recommendations for the effectiveness of emerging donation platforms and the sustainable development of open-source projects.</p>\u0000 </div>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 7","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144482206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating Security Controls in DevSecOps: Challenges, Solutions, and Future Research Directions 在DevSecOps中集成安全控制:挑战、解决方案和未来研究方向
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-06-09 DOI: 10.1002/smr.70029
Maysa Sinan, Mojtaba Shahin, Iqbal Gondal
{"title":"Integrating Security Controls in DevSecOps: Challenges, Solutions, and Future Research Directions","authors":"Maysa Sinan,&nbsp;Mojtaba Shahin,&nbsp;Iqbal Gondal","doi":"10.1002/smr.70029","DOIUrl":"https://doi.org/10.1002/smr.70029","url":null,"abstract":"<p>Cybersecurity has become a top priority for most organizations to protect their applications. The rapid increase in cyberattacks has necessitated a comprehensive repositioning of how security should be implemented within the software development lifecycle (SDLC). Development, Security, Operations (DevSecOps) is one of the trendy security methodologies and fastest growing development methods promoting shared responsibility for security and automating security practices at every step of the SDLC. DevSecOps is a cultural shift that integrates security controls into DevOps pipelines aiming to upscale overall security. Therefore, many organizations started to incorporate security controls within the deployment of DevSecOps through conducting continuous practices, for example, automated security testing, infrastructure as code (IaC), compliance as code, and continuous monitoring. This study aims to organize the knowledge and shed light on challenges concerning security controls during the adoption of DevSecOps, along with associated solutions and remediation workarounds reported in the literature. Further, the study aims to provide clear insights into the areas that require further investigation and research in the future. A systematic literature review (SLR) of 45 primary studies was carried out to extract data, and subsequently, the extracted data was analyzed using the thematic analysis method. This paper identifies 19 challenges related to security controls that could be experienced by security practitioners while implementing a DevSecOps model, along with 18 solutions and remediation actions suggested in literature to address and overcome some of the enlisted challenges. In addition, some gap areas are identified as opportunities for future research in this domain with the aim of improving the integration of security controls in a DevSecOps environment. Based on findings, this paper points out the importance of automation in software engineering practices, for example, continuous automation, continuous delivery, and continuous feedback, to embed security controls at the early stages of the development process.</p>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/smr.70029","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144244310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improvement of Software Testing Processes With Test Maturity Model Integration 基于测试成熟度模型集成的软件测试过程改进
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-05-22 DOI: 10.1002/smr.70031
Gökhan Şit, Süleyman Ersöz, Mehmet Burak Bilgin
{"title":"Improvement of Software Testing Processes With Test Maturity Model Integration","authors":"Gökhan Şit,&nbsp;Süleyman Ersöz,&nbsp;Mehmet Burak Bilgin","doi":"10.1002/smr.70031","DOIUrl":"https://doi.org/10.1002/smr.70031","url":null,"abstract":"<div>\u0000 \u0000 <p>In this study, a maturity-level determination and assessment method developed for companies operating in the software industry to perform TMMi Levels 2 and 3 assessments in-house, with the goal of improving testing processes, is presented. With this method, it is aimed to help companies to conduct their own self-assessments and improve their testing processes before participating in high-budget audits. The validity of this method was tested in practice for TMMi Levels 2 and 3 assessments. Companies can prepare for a formal TMMi audit by using the test maturity-level determination methodology developed in this study, or they can simply improve their processes to produce higher quality products. Additionally, if they already have TMMi certification, they can regularly self-audit to ensure continuity and compliance. Demonstrating how TMMi can be applied in practice and providing a guide to test process maturity determination are key contributions of this work.</p>\u0000 </div>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 5","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144117832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional Size Measurement With Conceptual Models: A Systematic Literature Review 概念模型的功能大小测量:系统文献综述
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-05-21 DOI: 10.1002/smr.70030
Ala Arman, Emiliano Di Reto, Massimo Mecella, Giuseppe Santucci
{"title":"Functional Size Measurement With Conceptual Models: A Systematic Literature Review","authors":"Ala Arman,&nbsp;Emiliano Di Reto,&nbsp;Massimo Mecella,&nbsp;Giuseppe Santucci","doi":"10.1002/smr.70030","DOIUrl":"https://doi.org/10.1002/smr.70030","url":null,"abstract":"&lt;p&gt;The demand for efficient functional size measurement (FSM) methods in the competitive software market today is undeniable. However, incomplete and imprecise system specifications pose significant challenges, particularly in scenarios that require fast, flexible, and accurate software size estimation, such as public tenders. Although the integration of conceptual models within FSMs offers a promising solution to these issues, a systematic exploration of such methods remains largely unexplored. This work evaluates FSM methods that integrate conceptual models by analyzing studies from the past 20 years. It highlights key contributions and advances in proposed conceptual model-based FSM methods. In addition, the study examines their limitations and challenges, offering insights for future improvements. A systematic literature review (SLR) was conducted to guide the research process. The review was organized around three research questions, each targeting the study's key objectives: (1) to explore FSM methods utilizing conceptual models, (2) to summarize proposals for their improvement, and (3) to identify the limitations of the proposed enhancements. Primary studies span two decades (2004–2024), with peaks in 2008 and 2015, averaging one to two studies annually. Of the 1371 initial studies, 13 were selected using strict criteria. These studies are categorized into &lt;i&gt;Measurement Techniques&lt;/i&gt; (30.77%), &lt;i&gt;Automation&lt;/i&gt; (38.46%), and &lt;i&gt;Application-Specific&lt;/i&gt; topics (30.77%). The contributions of the primary studies are analyzed in terms of their approaches &lt;i&gt;Repeatability&lt;/i&gt; and &lt;i&gt;Validation&lt;/i&gt;. &lt;i&gt;Repeatability&lt;/i&gt; is assessed by examining whether the primary studies proposed a formal model when using real datasets. In contrast, &lt;i&gt;Validation&lt;/i&gt; focuses on whether the studies were tested in real-world projects. A total of 46.15% of the primary studies utilize formal models, whereas 53.85% rely on nonformal models, although dataset size is often unspecified. Most studies validate their methods using 1 to 30 projects. Common Software Measurement International Consortium (COSMIC) is the most widely used FSM method (69.23%), followed by the Function Point Analysis (FPA) (15.38%) and custom Methods (15.38%), with conceptual UML models appearing in 84.61% of the studies. Key limitations, including &lt;i&gt;Scalability and Generalizability&lt;/i&gt;, &lt;i&gt;Complexity Robustness&lt;/i&gt;, and &lt;i&gt;Flexibility&lt;/i&gt;, persist across all categories. Notably, &lt;i&gt;Scalability and Generalizability&lt;/i&gt; was identified as a limitation in 75% of &lt;i&gt;Measurement Techniques&lt;/i&gt; studies, 80% of &lt;i&gt;Automation&lt;/i&gt; studies, and 75% of &lt;i&gt;Application-Specific&lt;/i&gt; studies, while &lt;i&gt;Flexibility&lt;/i&gt; challenges were most pronounced, affecting 100% of &lt;i&gt;Application-Specific&lt;/i&gt; studies. The limited number of primary studies underscores a substantial research gap in conceptual model-based FSM methods. Future research should focus on developing formal models to enhance theoretical rigor, lever","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 5","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/smr.70030","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144108838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Overcoming Data Shortage in Critical Domains With Data Augmentation for Natural Language Software Requirements 基于自然语言软件需求的数据扩充克服关键领域的数据短缺
IF 1.7 4区 计算机科学
Journal of Software-Evolution and Process Pub Date : 2025-05-12 DOI: 10.1002/smr.70027
Robin Korfmann, Patrick Beyersdorffer, Rainer Gerlich, Jürgen Münch, Marco Kuhrmann
{"title":"Overcoming Data Shortage in Critical Domains With Data Augmentation for Natural Language Software Requirements","authors":"Robin Korfmann,&nbsp;Patrick Beyersdorffer,&nbsp;Rainer Gerlich,&nbsp;Jürgen Münch,&nbsp;Marco Kuhrmann","doi":"10.1002/smr.70027","DOIUrl":"https://doi.org/10.1002/smr.70027","url":null,"abstract":"<p>Natural language processing (NLP) offers the potential to automate quality assurance of software requirement specifications. In particular, large-scale projects involving numerous suppliers can benefit from this improvement. However, due to privacy restrictions especially in highly restrictive industries, the availability of software requirements specification documents for training NLP tools is severely limited. Also, domain- and project-specific vocabulary, as such in the aerospace domain, require specialized models for processing effectively. To provide a sufficient amount of data to train such models, we studied algorithms for the augmentation of textual data. Four algorithms have been investigated by expanding a given set of requirements from the European Space projects generating correct and incorrect requirements. The initial study yielded data of poor quality due to the particularities of the domain-specific vocabulary, yet laid the foundation for the algorithms' improvement, which, eventually, resulted in an increased set of requirements, which is 20 times the size of the seed set. A complementing experiment demonstrated the usability of augmented requirements to support AI-based quality assurance of software requirements. Furthermore, a selected improvement of the augmentation algorithms demonstrated notable quality improvements by doubling the number of correctly augmented requirements.</p>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 5","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/smr.70027","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143939432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信