算法口技:人工智能语音生成器中饱受争议的语音状态

IF 5.5 1区 文学 Q1 COMMUNICATION
Ido Ramati
{"title":"算法口技:人工智能语音生成器中饱受争议的语音状态","authors":"Ido Ramati","doi":"10.1177/20563051231224401","DOIUrl":null,"url":null,"abstract":"This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.","PeriodicalId":47920,"journal":{"name":"Social Media + Society","volume":"1 7","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators\",\"authors\":\"Ido Ramati\",\"doi\":\"10.1177/20563051231224401\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.\",\"PeriodicalId\":47920,\"journal\":{\"name\":\"Social Media + Society\",\"volume\":\"1 7\",\"pages\":\"\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Social Media + Society\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1177/20563051231224401\",\"RegionNum\":1,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMMUNICATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Media + Society","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1177/20563051231224401","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 0

摘要

本文探讨了文本到语音(TTS)生成器中蕴含的人机关系。文章追溯了合成语音背后的人类来源,并追踪了机器学习算法对语音的修正,认为 Siri 和 Alexa 等人工智能(AI)语音代理以及 TikTok 等其他 TTS 行为都是在表演算法口技。人工智能语音技术机械地使用专业配音艺术家的声音说话,并对这些声音进行算法处理,从而生成了角色,在具身与虚拟、特殊与一般、人类与非人类以及语音与书写之间形成了一连串相互关联的紧张关系。算法口技作为一种分析框架,将 TTS 系统的技术发声操作与其文化、经济、哲学和社会语言学困境联系在一起。最后一节讨论了算法腹语在语音领域之外的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators
This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Social Media + Society
Social Media + Society COMMUNICATION-
CiteScore
9.20
自引率
3.80%
发文量
111
审稿时长
12 weeks
期刊介绍: Social Media + Society is an open access, peer-reviewed scholarly journal that focuses on the socio-cultural, political, psychological, historical, economic, legal and policy dimensions of social media in societies past, contemporary and future. We publish interdisciplinary work that draws from the social sciences, humanities and computational social sciences, reaches out to the arts and natural sciences, and we endorse mixed methods and methodologies. The journal is open to a diversity of theoretic paradigms and methodologies. The editorial vision of Social Media + Society draws inspiration from research on social media to outline a field of study poised to reflexively grow as social technologies evolve. We foster the open access of sharing of research on the social properties of media, as they manifest themselves through the uses people make of networked platforms past and present, digital and non. The journal presents a collaborative, open, and shared space, dedicated exclusively to the study of social media and their implications for societies. It facilitates state-of-the-art research on cutting-edge trends and allows scholars to focus and track trends specific to this field of study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信