Fairness on a budget, across the board: A cost-effective evaluation of fairness-aware practices across contexts, tasks, and sensitive attributes

IF 4.3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information and Software Technology Pub Date : 2025-08-26 DOI:10.1016/j.infsof.2025.107858

Alessandra Parziale , Gianmario Voria , Giammaria Giordano , Gemma Catolino , Gregorio Robles , Fabio Palomba

{"title":"Fairness on a budget, across the board: A cost-effective evaluation of fairness-aware practices across contexts, tasks, and sensitive attributes","authors":"Alessandra Parziale , Gianmario Voria , Giammaria Giordano , Gemma Catolino , Gregorio Robles , Fabio Palomba","doi":"10.1016/j.infsof.2025.107858","DOIUrl":null,"url":null,"abstract":"<div><h3>Context:</h3><div>Machine Learning (ML) is widely used in critical domains like finance, healthcare, and criminal justice, where unfair predictions can lead to harmful outcomes. Although bias mitigation techniques have been developed by the Software Engineering (SE) community, their practical adoption is limited due to complexity and integration issues. As a simpler alternative, fairness-aware practices, namely conventional ML engineering techniques adapted to promote fairness, e.g., MinMax Scaling, which normalizes feature values to prevent attributes linked to sensitive groups from disproportionately influencing predictions, have recently been proposed, yet their actual impact is still unexplored.</div></div><div><h3>Objective:</h3><div>Building on our prior work that explored fairness-aware practices in different contexts, this paper extends the investigation through a large-scale empirical study assessing their effectiveness across diverse ML tasks, sensitive attributes, and datasets belonging to specific application domains.</div></div><div><h3>Methods:</h3><div>We conduct 5940 experiments, evaluating fairness-aware practices from two perspectives: <em>contextual bias mitigation</em> and <em>cost-effectiveness</em>. Contextual evaluation examines fairness improvements across different ML models, sensitive attributes, and datasets. Cost-effectiveness analysis considers the trade-off between fairness gains and performance costs.</div></div><div><h3>Results:</h3><div>Findings reveal that the effectiveness of fairness-aware practices depends on specific contexts’ datasets and configurations, while cost-effectiveness analysis highlights those that best balance ethical gains and efficiency.</div></div><div><h3>Conclusion:</h3><div>These insights guide practitioners in choosing fairness-enhancing practices with minimal performance impact, supporting ethical ML development.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"188 ","pages":"Article 107858"},"PeriodicalIF":4.3000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Software Technology","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950584925001971","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Context:

Machine Learning (ML) is widely used in critical domains like finance, healthcare, and criminal justice, where unfair predictions can lead to harmful outcomes. Although bias mitigation techniques have been developed by the Software Engineering (SE) community, their practical adoption is limited due to complexity and integration issues. As a simpler alternative, fairness-aware practices, namely conventional ML engineering techniques adapted to promote fairness, e.g., MinMax Scaling, which normalizes feature values to prevent attributes linked to sensitive groups from disproportionately influencing predictions, have recently been proposed, yet their actual impact is still unexplored.

Objective:

Building on our prior work that explored fairness-aware practices in different contexts, this paper extends the investigation through a large-scale empirical study assessing their effectiveness across diverse ML tasks, sensitive attributes, and datasets belonging to specific application domains.

Methods:

We conduct 5940 experiments, evaluating fairness-aware practices from two perspectives: contextual bias mitigation and cost-effectiveness. Contextual evaluation examines fairness improvements across different ML models, sensitive attributes, and datasets. Cost-effectiveness analysis considers the trade-off between fairness gains and performance costs.

Results:

Findings reveal that the effectiveness of fairness-aware practices depends on specific contexts’ datasets and configurations, while cost-effectiveness analysis highlights those that best balance ethical gains and efficiency.

Conclusion:

These insights guide practitioners in choosing fairness-enhancing practices with minimal performance impact, supporting ethical ML development.

查看原文本刊更多论文

预算上的公平性：对跨上下文、任务和敏感属性的具有公平性意识的实践进行具有成本效益的评估

背景：机器学习（ML）广泛应用于金融、医疗保健和刑事司法等关键领域，在这些领域，不公平的预测可能导致有害的结果。尽管软件工程（SE）社区已经开发了偏见缓解技术，但由于复杂性和集成问题，它们的实际采用受到限制。作为一种更简单的替代方案，公平性感知实践，即适应于促进公平性的传统ML工程技术，例如MinMax Scaling，它标准化特征值以防止与敏感群体相关的属性不成比例地影响预测，最近被提出，但其实际影响仍未被探索。目的：基于我们之前在不同背景下探索公平感知实践的工作，本文通过大规模的实证研究扩展了调查，评估了它们在不同ML任务、敏感属性和属于特定应用领域的数据集中的有效性。方法：我们进行了5940个实验，从两个角度评估公平意识实践：情境偏见缓解和成本效益。上下文评估检查跨不同ML模型、敏感属性和数据集的公平性改进。成本效益分析考虑公平性收益和性能成本之间的权衡。结果：研究结果表明，公平意识实践的有效性取决于特定环境的数据集和配置，而成本效益分析强调了那些最能平衡道德收益和效率的实践。结论：这些见解指导从业者选择对性能影响最小的公平增强实践，支持道德ML开发。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information and Software Technology 工程技术-计算机：软件工程

CiteScore

9.10

自引率

7.70%

发文量

164

审稿时长

9.6 weeks

期刊介绍： Information and Software Technology is the international archival journal focusing on research and experience that contributes to the improvement of software development practices. The journal''s scope includes methods and techniques to better engineer software and manage its development. Articles submitted for review should have a clear component of software engineering or address ways to improve the engineering and management of software development. Areas covered by the journal include: • Software management, quality and metrics, • Software processes, • Software architecture, modelling, specification, design and programming • Functional and non-functional software requirements • Software testing and verification & validation • Empirical studies of all aspects of engineering and managing software development Short Communications is a new section dedicated to short papers addressing new ideas, controversial opinions, "Negative" results and much more. Read the Guide for authors for more information. The journal encourages and welcomes submissions of systematic literature studies (reviews and maps) within the scope of the journal. Information and Software Technology is the premiere outlet for systematic literature studies in software engineering.