IEEE Transactions on Software Engineering最新文献_第8页

Search-Based DNN Testing and Retraining With GAN-Enhanced Simulations 基于搜索的深度神经网络测试和再训练与gan增强模拟

IF 6.5 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-11 DOI: 10.1109/TSE.2025.3540549

Mohammed Oualid Attaoui;Fabrizio Pastore;Lionel C. Briand

{"title":"Search-Based DNN Testing and Retraining With GAN-Enhanced Simulations","authors":"Mohammed Oualid Attaoui;Fabrizio Pastore;Lionel C. Briand","doi":"10.1109/TSE.2025.3540549","DOIUrl":"10.1109/TSE.2025.3540549","url":null,"abstract":"In safety-critical systems (e.g., autonomous vehicles and robots), Deep Neural Networks (DNNs) are becoming a key component for computer vision tasks, particularly semantic segmentation. Further, since DNN behavior cannot be assessed through code inspection and analysis, test automation has become an essential activity to gain confidence in the reliability of DNNs. Unfortunately, state-of-the-art automated testing solutions largely rely on simulators, whose fidelity is always imperfect, thus affecting the validity of test results. To address such limitations, we propose to combine meta-heuristic search, used to explore the input space using simulators, with Generative Adversarial Networks (GANs), to transform the data generated by simulators into realistic input images. Such images can be used both to assess the DNN accuracy and to retrain the DNN more effectively. We applied our approach to a state-of-the-art DNN performing semantic segmentation, in two different case studies, and demonstrated that it outperforms a state-of-the-art GAN-based testing solution and several other baselines. Specifically, it leads to the largest number of diverse images leading to the worst DNN accuracy. Further, the images generated with our approach, lead to the highest improvement in DNN accuracy when used for retraining. In conclusion, we suggest to always integrate a trained GAN to transform test inputs when performing search-driven, simulator-based testing.","PeriodicalId":13324,"journal":{"name":"IEEE Transactions on Software Engineering","volume":"51 4","pages":"1086-1103"},"PeriodicalIF":6.5,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143393222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automated Test Case Repair Using Language Models 使用语言模型自动修复测试用例

IF 6.5 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-11 DOI: 10.1109/TSE.2025.3541166

Ahmadreza Saboor Yaraghi;Darren Holden;Nafiseh Kahani;Lionel Briand

{"title":"Automated Test Case Repair Using Language Models","authors":"Ahmadreza Saboor Yaraghi;Darren Holden;Nafiseh Kahani;Lionel Briand","doi":"10.1109/TSE.2025.3541166","DOIUrl":"10.1109/TSE.2025.3541166","url":null,"abstract":"Ensuring the quality of software systems through testing is essential, yet maintaining test cases poses significant challenges and costs. The need for frequent updates to align with the evolving system under test often entails high complexity and cost for maintaining these test cases. Further, unrepaired broken test cases can degrade test suite quality and disrupt the software development process, wasting developers’ time. To address this challenge, we present <sc>TaRGET (<sc>Test Repair GEneraTor), a novel approach leveraging pre-trained code language models for automated test case repair. <sc>TaRGET treats test repair as a language translation task, employing a two-step process to fine-tune a language model based on essential context data characterizing the test breakage. To evaluate our approach, we introduce <sc>TaRBench, a comprehensive benchmark we developed covering 45,373 broken test repairs across 59 open-source projects. Our results demonstrate <sc>TaRGET's effectiveness, achieving a 66.1% exact match accuracy. Furthermore, our study examines the effectiveness of <sc>TaRGET across different test repair scenarios. We provide a practical guide to predict situations where the generated test repairs might be less reliable. We also explore whether project-specific data is always necessary for fine-tuning and if our approach can be effective on new projects.","PeriodicalId":13324,"journal":{"name":"IEEE Transactions on Software Engineering","volume":"51 4","pages":"1104-1133"},"PeriodicalIF":6.5,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143393223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Influence of the 1990 IEEE TSE Paper “Automated Software Test Data Generation” on Software Engineering 1990年IEEE TSE论文“自动化软件测试数据生成”对软件工程的影响

IF 7.4 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-07 DOI: 10.1109/tse.2025.3540430

Bogdan Korel

引用次数: 0

Design and Assurance of Control Software 控制软件的设计与保证

IF 6.5 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-07 DOI: 10.1109/TSE.2025.3539975

Nancy G. Leveson

引用次数: 0

A Reflection on Change Classification in the Era of Large Language Models 大语言模型时代对变化分类的思考

IF 7.4 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-05 DOI: 10.1109/tse.2025.3539566

Sunghun Kim, Shivkumar Shivaji, Jim Whitehead

引用次数: 0

A Retrospective on Whole Test Suite Generation: On the Role of SBST in the Age of LLMs 回顾整个测试套件的生成：在法学硕士时代SBST的作用

IF 7.4 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-05 DOI: 10.1109/tse.2025.3539458

Gordon Fraser, Andrea Arcuri

引用次数: 0

Decision Support for Selecting Blockchain-Based Application Design Patterns With Layered Taxonomy and Quality Attributes 基于分层分类和质量属性的区块链应用设计模式选择决策支持

IF 6.5 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-05 DOI: 10.1109/TSE.2025.3538612

Yanze Wang;Yiling Huang;Jingyue Li;Shanshan Li;He Zhang;Jun Lyu;Chenxing Zhong;Xiaodong Liu;Bohan Liu;Yue Liu;Qinghua Lu;Xin Zhou

{"title":"Decision Support for Selecting Blockchain-Based Application Design Patterns With Layered Taxonomy and Quality Attributes","authors":"Yanze Wang;Yiling Huang;Jingyue Li;Shanshan Li;He Zhang;Jun Lyu;Chenxing Zhong;Xiaodong Liu;Bohan Liu;Yue Liu;Qinghua Lu;Xin Zhou","doi":"10.1109/TSE.2025.3538612","DOIUrl":"10.1109/TSE.2025.3538612","url":null,"abstract":"<bold>Background: Along with the rapid development and widespread adoption of blockchain technology, many common practices have been summarized into blockchain-based design patterns for application development. However, the numerous and scattered patterns may cause confusion among practitioners. Therefore, adopting appropriate patterns to meet various requirements has become a major challenge, as it requires deep development experience and blockchain technology knowledge. <bold>Objective: To address this problem, this paper proposes a decision-support solution to assist with the selection of design patterns during the blockchain-based application development, including a layered taxonomy of design patterns, mappings of quality attributes with the patterns, and a decision model incorporating the taxonomy and mappings. <bold>Method: We collected 72 distinct and state-of-the-art design patterns via a Systematic Literature Review (SLR) to establish a layered taxonomy, and 18 unified quality attribute metrics were proposed for blockchain-based pattern assessment and mapping establishment. Based on the pattern taxonomy and quality attribute mappings, we developed a decision model that can provide intuitive guidance for pattern selection. <bold>Results: The proposed solution was evaluated through a case study in a seafood supply chain, in which we examined how well the decision model could help identify design flaws and provide reasonable solutions. Additionally, interviews and a questionnaire-based survey were conducted to measure the completeness, correctness, and usefulness of the proposed decision model. The evaluation results indicate that the proposed decision-support solution provides developers with comprehensive guidance, facilitates targeted decision making, and supports intuitive understanding. <bold>Conclusions: Our decision-support solution can improve the development efficiency of blockchain-based applications, especially in addressing potential design flaws, achieving targeted quality attributes, and reducing development costs.","PeriodicalId":13324,"journal":{"name":"IEEE Transactions on Software Engineering","volume":"51 4","pages":"1039-1066"},"PeriodicalIF":6.5,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143191822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Qualitative Research Methods in Software Engineering: Past, Present, and Future 软件工程中的定性研究方法：过去、现在和未来

IF 7.4 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-04 DOI: 10.1109/tse.2025.3538751

Carolyn Seaman, Rashina Hoda, Robert Feldt

引用次数: 0

A Retrospective of ChangeDistiller: Tree Differencing for Fine-Grained Source Code Change Extraction ChangeDistiller回顾：细粒度源代码变更提取的树区别

IF 7.4 1区计算机科学

IEEE Transactions on Software Engineering Pub Date : 2025-02-04 DOI: 10.1109/tse.2025.3538326

Beat Fluri, Michael Würsch, Martin Pinzger, Harald Gall

引用次数: 0

On “Prioritizing Test Cases for Regression Testing” 关于“为回归测试划分测试用例的优先级”