AI voice between anthropocentrism and posthumanism: Alexa and voice cloning

Q1 Arts and Humanities

Journal of Interdisciplinary Voice Studies Pub Date : 2022-08-01 DOI:10.1386/jivs_00053_1

Domenico Napolitano

{"title":"AI voice between anthropocentrism and posthumanism: Alexa and voice cloning","authors":"Domenico Napolitano","doi":"10.1386/jivs_00053_1","DOIUrl":null,"url":null,"abstract":"This article deals with the groundbreaking phenomenon of AI voice, highlighting two possible meanings that are often not problematized: the voice embedded into AI-based devices and the voice created using AI algorithms. In order to clarify the distinctions and the intersections of these two meanings, the article uses an approach inspired by media archaeology and social constructionism. It argues that AI voice as a social phenomenon is constructed by the interaction of a discursive level of representations and a non-discursive level of material practices and operations. The interaction of these two levels results in a tension between anthropocentrism and posthumanism, which is a characteristic of AI voice. Such tension is investigated through two case studies: the commercial of the smart speaker Amazon Alexa and the phenomenon of ‘voice cloning’. While the first is an example of how at a discursive level the ‘voice in the machine’ is represented as a way to ‘personify’ AI technology, the second, which consists in the possibility of reproducing the features of an embodied and personal voice, is an example of how the materialization of that cultural idea depends on the technical possibilities and material practices required by data-driven algorithms.","PeriodicalId":36145,"journal":{"name":"Journal of Interdisciplinary Voice Studies","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Interdisciplinary Voice Studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1386/jivs_00053_1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Arts and Humanities","Score":null,"Total":0}

引用次数: 0

Abstract

This article deals with the groundbreaking phenomenon of AI voice, highlighting two possible meanings that are often not problematized: the voice embedded into AI-based devices and the voice created using AI algorithms. In order to clarify the distinctions and the intersections of these two meanings, the article uses an approach inspired by media archaeology and social constructionism. It argues that AI voice as a social phenomenon is constructed by the interaction of a discursive level of representations and a non-discursive level of material practices and operations. The interaction of these two levels results in a tension between anthropocentrism and posthumanism, which is a characteristic of AI voice. Such tension is investigated through two case studies: the commercial of the smart speaker Amazon Alexa and the phenomenon of ‘voice cloning’. While the first is an example of how at a discursive level the ‘voice in the machine’ is represented as a way to ‘personify’ AI technology, the second, which consists in the possibility of reproducing the features of an embodied and personal voice, is an example of how the materialization of that cultural idea depends on the technical possibilities and material practices required by data-driven algorithms.

查看原文本刊更多论文

人类中心主义与后人类主义之间的AI声音：Alexa与声音克隆

本文讨论了人工智能语音的开创性现象，强调了两种通常不会出现问题的可能含义：嵌入基于人工智能的设备中的语音和使用人工智能算法创建的语音。为了阐明这两种意义的区别和交叉，本文采用了一种受媒体考古学和社会建构主义启发的方法。它认为，人工智能声音作为一种社会现象是由表征的话语水平和物质实践和操作的非话语水平的相互作用构建的。这两个层面的互动导致了人类中心主义和后人类主义之间的紧张关系，这是人工智能声音的一个特征。通过两个案例研究来调查这种紧张关系：智能扬声器亚马逊Alexa的商业化和“语音克隆”现象。虽然第一个例子是如何在话语层面上将“机器中的声音”表示为“人格化”人工智能技术的一种方式，但第二个例子是再现具体和个人声音的特征的可能性，是一个例子，说明了这种文化理念的具体化如何取决于数据驱动算法所需的技术可能性和物质实践。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Interdisciplinary Voice Studies Arts and Humanities-Literature and Literary Theory

CiteScore

1.20

自引率

0.00%

发文量