Need for automatically generated narration

David A. Evans, John B. Reichenbach
{"title":"Need for automatically generated narration","authors":"David A. Evans, John B. Reichenbach","doi":"10.1145/2390116.2390130","DOIUrl":null,"url":null,"abstract":"This paper argues that the best current text-to-speech (TTS) synthesis systems are approaching the quality necessary to provide effective automated narration of audio books. Currently, nearly all audio books and audio journals are recorded by professional voice actors at great expense with significant lead times. These cost and time constraints mean that fewer than than 2% of the new titles published each year are available in \"Talking Book\" editions, leaving the visually-impaired and print-disabled community of users with few options when seeking material in digital libraries. State-of-the-art TTS systems now can reproduce human voice prosody of sufficient quality to make listening to long narrative reading both pleasant and comprehensible. Such technology is relatively compact and inexpensive; it is time to deploy it widely as an alternative means of accessing digital texts. This would not only directly benefit the reading-disabled community, but also enable \"digital natives\" and other users to listen to texts on platforms on which reading may not be practical.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"62 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Research Advances in Large Digital Book Repositories","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2390116.2390130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

This paper argues that the best current text-to-speech (TTS) synthesis systems are approaching the quality necessary to provide effective automated narration of audio books. Currently, nearly all audio books and audio journals are recorded by professional voice actors at great expense with significant lead times. These cost and time constraints mean that fewer than than 2% of the new titles published each year are available in "Talking Book" editions, leaving the visually-impaired and print-disabled community of users with few options when seeking material in digital libraries. State-of-the-art TTS systems now can reproduce human voice prosody of sufficient quality to make listening to long narrative reading both pleasant and comprehensible. Such technology is relatively compact and inexpensive; it is time to deploy it widely as an alternative means of accessing digital texts. This would not only directly benefit the reading-disabled community, but also enable "digital natives" and other users to listen to texts on platforms on which reading may not be practical.
需要自动生成的叙述
本文认为,目前最好的文本到语音(TTS)合成系统正在接近提供有效的有声书自动叙述所需的质量。目前,几乎所有的有声书和有声杂志都是由专业的配音演员录制的,花费很大,需要很长时间。这些成本和时间限制意味着,每年出版的新书中只有不到2%是“有声书”版本,这使得视障和印刷品阅读障碍者群体在数字图书馆寻找资料时几乎没有选择。最先进的TTS系统现在可以重现足够质量的人声韵律,使听长篇叙事阅读既愉快又容易理解。这种技术相对紧凑,价格低廉;现在是时候将其作为一种获取数字文本的替代方式进行广泛部署了。这不仅会让阅读障碍群体直接受益,还能让“数字原住民”和其他用户在可能无法阅读的平台上收听文本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信