{"title":"Text–image fusion template for large language model assisted crowdsourcing test aggregation","authors":"Yunfeng Zhu, Shengcheng Yu, Zhaowei Zong, Yue Wang, Yuan Zhao, Zhenyu Chen","doi":"10.1016/j.jss.2025.112478","DOIUrl":null,"url":null,"abstract":"<div><div>Mobile crowdsourced testing leverages a varied group to enhance software quality through screenshots and text feedback. Examining the multitude of reports is tedious but crucial, often necessitating a combined analysis of both visual and textual information. However, professionals employ detailed judgment beyond mere similarity, which poses a challenge given the limited textual data and abundance of images in the reports.</div><div>We introduce a framework that guides large language models to handle missing data and inconsistencies in crowdsourced reports by using a triplet template <span><math><mrow><mo>〈</mo></mrow></math></span> Scene, Operation, Defect <span><math><mrow><mo>〉</mo></mrow></math></span> for bug identification. The framework leverages the element independence of the triplet for clustering ensemble and designs an algorithm to generate potential operation paths, aggregating reports within the cluster through constructed graphs. Our method, validated on 5115 reports, employs a clustering ensemble and graph aggregation, improving the clustering V-measure to 0.722. It also reduces the annotation time per report by 39. 3%, thereby improving the quality of the tagging. Source code available at <span><span>https://github.com/Boomnana/Text-Image-Fusion</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"228 ","pages":"Article 112478"},"PeriodicalIF":3.7000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121225001463","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Mobile crowdsourced testing leverages a varied group to enhance software quality through screenshots and text feedback. Examining the multitude of reports is tedious but crucial, often necessitating a combined analysis of both visual and textual information. However, professionals employ detailed judgment beyond mere similarity, which poses a challenge given the limited textual data and abundance of images in the reports.
We introduce a framework that guides large language models to handle missing data and inconsistencies in crowdsourced reports by using a triplet template Scene, Operation, Defect for bug identification. The framework leverages the element independence of the triplet for clustering ensemble and designs an algorithm to generate potential operation paths, aggregating reports within the cluster through constructed graphs. Our method, validated on 5115 reports, employs a clustering ensemble and graph aggregation, improving the clustering V-measure to 0.722. It also reduces the annotation time per report by 39. 3%, thereby improving the quality of the tagging. Source code available at https://github.com/Boomnana/Text-Image-Fusion.
期刊介绍:
The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to:
•Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution
•Agile, model-driven, service-oriented, open source and global software development
•Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems
•Human factors and management concerns of software development
•Data management and big data issues of software systems
•Metrics and evaluation, data mining of software development resources
•Business and economic aspects of software development processes
The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.