Best practices for evaluating IRFL approaches

IF 3.7 2区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Systems and Software Pub Date : 2025-01-16 DOI:10.1016/j.jss.2025.112342

Thomas Hirsch, Birgit Hofer

引用次数: 0

Abstract

Information retrieval fault localization (IRFL) is a popular research field and many IRFL approaches have been proposed recently. Unfortunately, the evaluation of some of these IRFL approaches is often too simplistic, which can cause an overestimation of performance of these approaches. In this paper, we discuss evaluation pitfalls and problems. Furthermore, we propose best practices to avoid them. In detail, we discuss evaluation strategies such as parameter tuning and temporal dependencies in the data, dataset issues, metrics, statistical significance testing, and the unavailability of supplemental material. To support our claim of the poor status quo of current evaluation practices in some research papers, we have performed a literature survey on 135 papers. We hope that this paper will help researchers to avoid the described pitfalls in their evaluation of IRFL approaches.

查看原文本刊更多论文

评估IRFL方法的最佳实践

信息检索故障定位（IRFL）是一个热门的研究领域，近年来提出了许多信息检索故障定位方法。不幸的是，对其中一些IRFL方法的评估往往过于简单，这可能导致对这些方法的性能的高估。在本文中，我们讨论了评估陷阱和问题。此外，我们提出了避免它们的最佳实践。详细地，我们讨论了评估策略，如参数调整和数据中的时间依赖性，数据集问题，度量，统计显著性测试以及补充材料的不可用性。为了支持我们对一些研究论文中当前评估实践现状不佳的说法，我们对135篇论文进行了文献调查。我们希望本文能帮助研究人员在评估IRFL方法时避免上述缺陷。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Systems and Software 工程技术-计算机：理论方法

CiteScore

8.60

自引率

5.70%

发文量

193

审稿时长

16 weeks

期刊介绍： The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to: •Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution •Agile, model-driven, service-oriented, open source and global software development •Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems •Human factors and management concerns of software development •Data management and big data issues of software systems •Metrics and evaluation, data mining of software development resources •Business and economic aspects of software development processes The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.