Trapped in the search box: An examination of algorithmic bias in search engine autocomplete predictions

IF 7.6 2区管理学 Q1 INFORMATION SCIENCE & LIBRARY SCIENCE

Telematics and Informatics Pub Date : 2023-11-01 DOI:10.1016/j.tele.2023.102068

Cong Lin , Yuxin Gao , Na Ta , Kaiyu Li , Hongyao Fu

引用次数: 0

Abstract

This paper examines the autocomplete algorithmic bias of leading search engines against three sensitive attributes: gender, race, and sexual orientation. By simulating search query prefixes and calling search engine APIs, 106,896 autocomplete predictions were collected, and their semantic toxicity scores as measures of negative algorithmic bias were computed based on machine learning models. Results indicate that search engine autocomplete algorithmic bias is overall consistent with long-standing societal discrimination. Historically disadvantaged groups such as the female, the Black, and the homosexual suffer higher levels of negative algorithmic bias. Moreover, the degree of algorithmic bias varies across topic categories. Implications about the search engine mediatization, mechanisms and consequences of autocomplete algorithmic bias are discussed.

查看原文本刊更多论文

困在搜索框中:对搜索引擎自动完成预测中的算法偏差的检验

本文研究了主要搜索引擎的自动补全算法对三个敏感属性的偏见:性别、种族和性取向。通过模拟搜索查询前缀和调用搜索引擎api，收集了106,896个自动完成预测，并基于机器学习模型计算了作为负算法偏差度量的语义毒性评分。结果表明，搜索引擎自动补全算法的偏见与长期存在的社会歧视总体上是一致的。历史上处于不利地位的群体，如女性、黑人和同性恋，遭受了更高程度的负面算法偏见。此外，算法偏差的程度因主题类别而异。讨论了自动补全算法偏差对搜索引擎媒介化的影响、机制和后果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Telematics and Informatics INFORMATION SCIENCE & LIBRARY SCIENCE-

CiteScore

17.00

自引率

4.70%

发文量

104

审稿时长

24 days

期刊介绍： Telematics and Informatics is an interdisciplinary journal that publishes cutting-edge theoretical and methodological research exploring the social, economic, geographic, political, and cultural impacts of digital technologies. It covers various application areas, such as smart cities, sensors, information fusion, digital society, IoT, cyber-physical technologies, privacy, knowledge management, distributed work, emergency response, mobile communications, health informatics, social media's psychosocial effects, ICT for sustainable development, blockchain, e-commerce, and e-government.