Mitigation Techniques to Overcome Data Harm in Model Building for ML

NLP Techniques and Applications Pub Date : 2021-11-27 DOI:10.5121/csit.2021.111916

A. Arslan

引用次数: 0

Abstract

Given the impact of Machine Learning (ML) on individuals and the society, understanding how harm might be occur throughout the ML life cycle becomes critical more than ever. By offering a framework to determine distinct potential sources of downstream harm in ML pipeline, the paper demonstrates the importance of choices throughout distinct phases of data collection, development, and deployment that extend far beyond just model training. Relevant mitigation techniques are also suggested for being used instead of merely relying on generic notions of what counts as fairness.

查看原文本刊更多论文

克服机器学习模型构建中数据危害的缓解技术

鉴于机器学习(ML)对个人和社会的影响，了解整个ML生命周期中可能发生的危害变得比以往任何时候都更加重要。通过提供一个框架来确定ML管道中不同的潜在下游危害来源，本文展示了在数据收集、开发和部署的不同阶段选择的重要性，这些选择远远超出了模型训练的范围。还建议使用相关的缓解技术，而不是仅仅依赖何为公平的一般概念。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NLP Techniques and Applications

自引率

0.00%

发文量