Studying SATD in drone systems with Human-AI collaboration

IF 4.1 2区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Systems and Software Pub Date : 2025-09-12 DOI:10.1016/j.jss.2025.112625

Leevi Rantala , Lwin Khin Shar , Mika V. Mäntylä , Wei Minn , Yan Naing Tun

{"title":"Studying SATD in drone systems with Human-AI collaboration","authors":"Leevi Rantala , Lwin Khin Shar , Mika V. Mäntylä , Wei Minn , Yan Naing Tun","doi":"10.1016/j.jss.2025.112625","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><div>Self-Admitted Technical Debt (SATD) refers to sub-optimal solutions that developers acknowledge within the source code. SATD research originated on Java projects but is expanding to other domains. We focus on SATD in drones, which are used for various critical tasks.</div></div><div><h3>Aims:</h3><div>The primary objective is to investigate SATD in drone systems. The second aim is to explore the integration of AI and human collaboration for SATD labelling and classification.</div></div><div><h3>Method:</h3><div>Method: We conducted a sample study of SATD comments in drone systems (14 open source, 4 SDKs) to analyse the quantity and types of SATD comments present. Our study incorporates collaboration between AI and humans by utilising LLM for SATD classification. Additionally, we classified a sample of 385 SATD comments as either drone-specific or non-drone-specific.</div></div><div><h3>Results:</h3><div>The most prevalent SATD categories in drone software are Code Debt (35%), Unclassifiable Debt (16%), and Design Debt (15%). We found that 22% of SATD is specific to drones. Drone-specific SATD is proportionally more focused on Requirements and Design Debt compared to non-drone-specific SATD. We found that using both human and LLM for SATD classification can improve accuracy, as both LLM and human revised their initial ratings. After two rounds, a “near-perfect agreement” (Fleiss’ kappa 0.83) was achieved.</div></div><div><h3>Conclusions:</h3><div>Future studies should investigate whether our observation that domain-specific (drone) SATD comments relate more to Requirement Debt holds true in other domains. We propose a workflow that integrates AI into classification tasks, enhancing the accuracy of both human and AI classifications.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"231 ","pages":"Article 112625"},"PeriodicalIF":4.1000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121225002948","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Background:

Self-Admitted Technical Debt (SATD) refers to sub-optimal solutions that developers acknowledge within the source code. SATD research originated on Java projects but is expanding to other domains. We focus on SATD in drones, which are used for various critical tasks.

Aims:

The primary objective is to investigate SATD in drone systems. The second aim is to explore the integration of AI and human collaboration for SATD labelling and classification.

Method:

Method: We conducted a sample study of SATD comments in drone systems (14 open source, 4 SDKs) to analyse the quantity and types of SATD comments present. Our study incorporates collaboration between AI and humans by utilising LLM for SATD classification. Additionally, we classified a sample of 385 SATD comments as either drone-specific or non-drone-specific.

Results:

The most prevalent SATD categories in drone software are Code Debt (35%), Unclassifiable Debt (16%), and Design Debt (15%). We found that 22% of SATD is specific to drones. Drone-specific SATD is proportionally more focused on Requirements and Design Debt compared to non-drone-specific SATD. We found that using both human and LLM for SATD classification can improve accuracy, as both LLM and human revised their initial ratings. After two rounds, a “near-perfect agreement” (Fleiss’ kappa 0.83) was achieved.

Conclusions:

Future studies should investigate whether our observation that domain-specific (drone) SATD comments relate more to Requirement Debt holds true in other domains. We propose a workflow that integrates AI into classification tasks, enhancing the accuracy of both human and AI classifications.

查看原文本刊更多论文

基于人机协作的无人机系统SATD研究

背景：自我承认的技术债务（SATD）是指开发人员在源代码中承认的次优解决方案。SATD研究起源于Java项目，但正在扩展到其他领域。我们专注于无人机的SATD，用于各种关键任务。目的：主要目的是研究无人机系统中的SATD。第二个目标是探索人工智能和人类协作的集成，用于SATD标签和分类。方法：我们对无人机系统（14个开源，4个sdk）中的SATD评论进行了抽样研究，以分析目前SATD评论的数量和类型。我们的研究结合了人工智能和人类之间的合作，利用LLM进行SATD分类。此外，我们将385个SATD评论样本分类为无人机特定或非无人机特定。结果：无人机软件中最常见的SATD类别是代码债（35%）、不可分类债（16%）和设计债（15%）。我们发现22%的SATD是无人机特有的。与非无人机特定的SATD相比，无人机特定的SATD在比例上更关注需求和设计债。我们发现，同时使用人类和LLM进行SATD分类可以提高准确性，因为LLM和人类都修改了他们的初始评级。经过两轮谈判，双方达成了“近乎完美的协议”（Fleiss kappa 0.83）。结论：未来的研究应该调查我们关于领域特定的（无人机）SATD评论与需求债更相关的观察是否在其他领域成立。我们提出了一种将人工智能集成到分类任务中的工作流程，提高了人类和人工智能分类的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Systems and Software 工程技术-计算机：理论方法

CiteScore

8.60

自引率

5.70%

发文量

193

审稿时长

16 weeks

期刊介绍： The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to: •Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution •Agile, model-driven, service-oriented, open source and global software development •Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems •Human factors and management concerns of software development •Data management and big data issues of software systems •Metrics and evaluation, data mining of software development resources •Business and economic aspects of software development processes The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.