基于机器学习的互联网通信过滤中的伦理偏见

2023 IEEE International Symposium on Ethics in Engineering, Science, and Technology (ETHICS) Pub Date : 2023-05-18 DOI:10.1109/ETHICS57328.2023.10154975

Ludovica Ilari, Giulia Rafaiani, M. Baldi, B. Giovanola

{"title":"基于机器学习的互联网通信过滤中的伦理偏见","authors":"Ludovica Ilari, Giulia Rafaiani, M. Baldi, B. Giovanola","doi":"10.1109/ETHICS57328.2023.10154975","DOIUrl":null,"url":null,"abstract":"The use of automated systems based on artificial intelligence and machine learning for filtering and moderating online communications has become commonplace. While this allows for high levels of efficiency and fine-grained control of malicious behaviors, it could also produce unintended disparities in treatment of legitimate users. In this paper, we aim at identifying some possible field-related biases in the wellknown Google Perspective API machine learning-based engine for controlling Internet communications. For this purpose, we consider communications in the fields of health, trade, finance, and defense and build a data set collecting Twitter-based online communications of the World Health Organization (WHO), World Trade Organization (WTO), International Monetary Fund (IMF) and North Atlantic Treaty Organization (NATO). Collected data are then analyzed through Perspective API to assign them an alleged likelihood of being abusive for specific emotional concepts, referred to as attributes. Upon analysis, discrimination between the considered users is identified for all attributes. This result, although preliminary, apparently indicates that Perspective API creates discrimination for field-related content as a result of semantic biases in the data, thus highlighting the need for an ethically sound design of these systems, following an ethics by design approach.","PeriodicalId":203527,"journal":{"name":"2023 IEEE International Symposium on Ethics in Engineering, Science, and Technology (ETHICS)","volume":"2006 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ethical Biases in Machine Learning-based Filtering of Internet Communications\",\"authors\":\"Ludovica Ilari, Giulia Rafaiani, M. Baldi, B. Giovanola\",\"doi\":\"10.1109/ETHICS57328.2023.10154975\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of automated systems based on artificial intelligence and machine learning for filtering and moderating online communications has become commonplace. While this allows for high levels of efficiency and fine-grained control of malicious behaviors, it could also produce unintended disparities in treatment of legitimate users. In this paper, we aim at identifying some possible field-related biases in the wellknown Google Perspective API machine learning-based engine for controlling Internet communications. For this purpose, we consider communications in the fields of health, trade, finance, and defense and build a data set collecting Twitter-based online communications of the World Health Organization (WHO), World Trade Organization (WTO), International Monetary Fund (IMF) and North Atlantic Treaty Organization (NATO). Collected data are then analyzed through Perspective API to assign them an alleged likelihood of being abusive for specific emotional concepts, referred to as attributes. Upon analysis, discrimination between the considered users is identified for all attributes. This result, although preliminary, apparently indicates that Perspective API creates discrimination for field-related content as a result of semantic biases in the data, thus highlighting the need for an ethically sound design of these systems, following an ethics by design approach.\",\"PeriodicalId\":203527,\"journal\":{\"name\":\"2023 IEEE International Symposium on Ethics in Engineering, Science, and Technology (ETHICS)\",\"volume\":\"2006 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Symposium on Ethics in Engineering, Science, and Technology (ETHICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ETHICS57328.2023.10154975\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Symposium on Ethics in Engineering, Science, and Technology (ETHICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETHICS57328.2023.10154975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

使用基于人工智能和机器学习的自动化系统来过滤和调节在线通信已经变得司空见惯。虽然这允许对恶意行为进行高水平的效率和细粒度控制，但它也可能在对待合法用户时产生意想不到的差异。在本文中，我们的目标是识别一些可能与领域相关的偏差，这些偏差存在于著名的谷歌Perspective API机器学习引擎中，用于控制互联网通信。为此，我们考虑了卫生、贸易、金融和国防领域的通信，并建立了一个数据集，收集世界卫生组织(世卫组织)、世界贸易组织(世贸组织)、国际货币基金组织(基金组织)和北大西洋公约组织(北约)基于twitter的在线通信。然后通过Perspective API对收集到的数据进行分析，从而为特定的情感概念(称为属性)分配所谓的滥用可能性。经过分析，可以识别所有属性中所考虑的用户之间的歧视。这个结果虽然是初步的，但显然表明，由于数据中的语义偏差，Perspective API会对字段相关的内容产生歧视，从而突出了这些系统的道德合理设计的必要性，遵循设计方法的道德规范。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Ethical Biases in Machine Learning-based Filtering of Internet Communications

The use of automated systems based on artificial intelligence and machine learning for filtering and moderating online communications has become commonplace. While this allows for high levels of efficiency and fine-grained control of malicious behaviors, it could also produce unintended disparities in treatment of legitimate users. In this paper, we aim at identifying some possible field-related biases in the wellknown Google Perspective API machine learning-based engine for controlling Internet communications. For this purpose, we consider communications in the fields of health, trade, finance, and defense and build a data set collecting Twitter-based online communications of the World Health Organization (WHO), World Trade Organization (WTO), International Monetary Fund (IMF) and North Atlantic Treaty Organization (NATO). Collected data are then analyzed through Perspective API to assign them an alleged likelihood of being abusive for specific emotional concepts, referred to as attributes. Upon analysis, discrimination between the considered users is identified for all attributes. This result, although preliminary, apparently indicates that Perspective API creates discrimination for field-related content as a result of semantic biases in the data, thus highlighting the need for an ethically sound design of these systems, following an ethics by design approach.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE International Symposium on Ethics in Engineering, Science, and Technology (ETHICS)

自引率

0.00%

发文量