克服机器学习模型构建中数据危害的缓解技术

NLP Techniques and Applications Pub Date : 2021-11-27 DOI:10.5121/csit.2021.111916

A. Arslan

{"title":"克服机器学习模型构建中数据危害的缓解技术","authors":"A. Arslan","doi":"10.5121/csit.2021.111916","DOIUrl":null,"url":null,"abstract":"Given the impact of Machine Learning (ML) on individuals and the society, understanding how harm might be occur throughout the ML life cycle becomes critical more than ever. By offering a framework to determine distinct potential sources of downstream harm in ML pipeline, the paper demonstrates the importance of choices throughout distinct phases of data collection, development, and deployment that extend far beyond just model training. Relevant mitigation techniques are also suggested for being used instead of merely relying on generic notions of what counts as fairness.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mitigation Techniques to Overcome Data Harm in Model Building for ML\",\"authors\":\"A. Arslan\",\"doi\":\"10.5121/csit.2021.111916\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Given the impact of Machine Learning (ML) on individuals and the society, understanding how harm might be occur throughout the ML life cycle becomes critical more than ever. By offering a framework to determine distinct potential sources of downstream harm in ML pipeline, the paper demonstrates the importance of choices throughout distinct phases of data collection, development, and deployment that extend far beyond just model training. Relevant mitigation techniques are also suggested for being used instead of merely relying on generic notions of what counts as fairness.\",\"PeriodicalId\":193651,\"journal\":{\"name\":\"NLP Techniques and Applications\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NLP Techniques and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5121/csit.2021.111916\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NLP Techniques and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/csit.2021.111916","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

鉴于机器学习(ML)对个人和社会的影响，了解整个ML生命周期中可能发生的危害变得比以往任何时候都更加重要。通过提供一个框架来确定ML管道中不同的潜在下游危害来源，本文展示了在数据收集、开发和部署的不同阶段选择的重要性，这些选择远远超出了模型训练的范围。还建议使用相关的缓解技术，而不是仅仅依赖何为公平的一般概念。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Mitigation Techniques to Overcome Data Harm in Model Building for ML

Given the impact of Machine Learning (ML) on individuals and the society, understanding how harm might be occur throughout the ML life cycle becomes critical more than ever. By offering a framework to determine distinct potential sources of downstream harm in ML pipeline, the paper demonstrates the importance of choices throughout distinct phases of data collection, development, and deployment that extend far beyond just model training. Relevant mitigation techniques are also suggested for being used instead of merely relying on generic notions of what counts as fairness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

NLP Techniques and Applications

自引率

0.00%

发文量