Leveraging multi-task learning to fine-tune RoBERTa for self-admitted technical debt identification and classification

IF 4.1 2区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Systems and Software Pub Date : 2025-09-15 DOI:10.1016/j.jss.2025.112629

Yihang Xu, Dongjin Yu, Xin Chen, Quanxin Yang, Sixuan Wang, Wangliang Yan

{"title":"Leveraging multi-task learning to fine-tune RoBERTa for self-admitted technical debt identification and classification","authors":"Yihang Xu, Dongjin Yu, Xin Chen, Quanxin Yang, Sixuan Wang, Wangliang Yan","doi":"10.1016/j.jss.2025.112629","DOIUrl":null,"url":null,"abstract":"<div><div>Self-Admitted Technical Debt (SATD) detection aims to identify whether a code comment explicitly admits technical debt and classify its specific type. Existing research largely treats identification and classification as separate tasks, with classification-focused approaches suffering from Out-Of-Vocabulary (OOV) issues and relatively low macro-averaged <span><math><msub><mrow><mi>F</mi></mrow><mrow><mn>1</mn></mrow></msub></math></span>-score. To address these challenges, this paper presents FRoM, a unified and efficient approach that integrates SATD identification and classification into a single pipeline. Specifically, FRoM employs a byte-level tokenizer to effectively mitigate OOV problems and leverages multi-task learning to fine-tune a pre-trained model for improved classification performance. Additionally, FRoM incorporates a novel undersampling technique to remove semantically similar non-SATD samples, reducing the time required for fine-tuning. Empirical evaluations on two datasets, comprising 38,902 and 2,528 comments respectively, demonstrate that FRoM achieves state-of-the-art performance in both identification and classification tasks. Furthermore, a case study highlights that our deployed tool, FRoMD, exhibits competitive performance compared to ChatGPT-4o. The dataset and the code are available at <span><span>https://github.com/HduDBSI/FRoM</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"231 ","pages":"Article 112629"},"PeriodicalIF":4.1000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121225002985","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Self-Admitted Technical Debt (SATD) detection aims to identify whether a code comment explicitly admits technical debt and classify its specific type. Existing research largely treats identification and classification as separate tasks, with classification-focused approaches suffering from Out-Of-Vocabulary (OOV) issues and relatively low macro-averaged

F_{1}

-score. To address these challenges, this paper presents FRoM, a unified and efficient approach that integrates SATD identification and classification into a single pipeline. Specifically, FRoM employs a byte-level tokenizer to effectively mitigate OOV problems and leverages multi-task learning to fine-tune a pre-trained model for improved classification performance. Additionally, FRoM incorporates a novel undersampling technique to remove semantically similar non-SATD samples, reducing the time required for fine-tuning. Empirical evaluations on two datasets, comprising 38,902 and 2,528 comments respectively, demonstrate that FRoM achieves state-of-the-art performance in both identification and classification tasks. Furthermore, a case study highlights that our deployed tool, FRoMD, exhibits competitive performance compared to ChatGPT-4o. The dataset and the code are available at https://github.com/HduDBSI/FRoM.

查看原文本刊更多论文

利用多任务学习对RoBERTa进行微调，以实现自我承认的技术债务识别和分类

自我承认的技术债务（SATD）检测旨在识别代码注释是否明确承认技术债务，并对其特定类型进行分类。现有的研究大多将识别和分类视为独立的任务，以分类为重点的方法存在词汇外（OOV）问题，宏观平均f1分数相对较低。为了应对这些挑战，本文提出了一种统一而有效的方法FRoM，该方法将SATD识别和分类集成到单个管道中。具体来说，FRoM使用字节级标记器来有效地缓解OOV问题，并利用多任务学习来微调预训练的模型，以提高分类性能。此外，FRoM还采用了一种新颖的欠采样技术来去除语义相似的非satd样本，从而减少了微调所需的时间。对两个数据集（分别包含38,902和2,528条评论）的实证评估表明，FRoM在识别和分类任务中都达到了最先进的性能。此外，一个案例研究强调，与chatgpt - 40相比，我们部署的工具FRoMD表现出了具有竞争力的性能。数据集和代码可在https://github.com/HduDBSI/FRoM上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Systems and Software 工程技术-计算机：理论方法

CiteScore

8.60

自引率

5.70%

发文量

193

审稿时长

16 weeks

期刊介绍： The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to: •Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution •Agile, model-driven, service-oriented, open source and global software development •Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems •Human factors and management concerns of software development •Data management and big data issues of software systems •Metrics and evaluation, data mining of software development resources •Business and economic aspects of software development processes The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.