Need for automatically generated narration

Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2012-10-29 DOI:10.1145/2390116.2390130

David A. Evans, John B. Reichenbach

引用次数: 4

Abstract

This paper argues that the best current text-to-speech (TTS) synthesis systems are approaching the quality necessary to provide effective automated narration of audio books. Currently, nearly all audio books and audio journals are recorded by professional voice actors at great expense with significant lead times. These cost and time constraints mean that fewer than than 2% of the new titles published each year are available in "Talking Book" editions, leaving the visually-impaired and print-disabled community of users with few options when seeking material in digital libraries. State-of-the-art TTS systems now can reproduce human voice prosody of sufficient quality to make listening to long narrative reading both pleasant and comprehensible. Such technology is relatively compact and inexpensive; it is time to deploy it widely as an alternative means of accessing digital texts. This would not only directly benefit the reading-disabled community, but also enable "digital natives" and other users to listen to texts on platforms on which reading may not be practical.

查看原文本刊更多论文

需要自动生成的叙述

本文认为，目前最好的文本到语音(TTS)合成系统正在接近提供有效的有声书自动叙述所需的质量。目前，几乎所有的有声书和有声杂志都是由专业的配音演员录制的，花费很大，需要很长时间。这些成本和时间限制意味着，每年出版的新书中只有不到2%是“有声书”版本，这使得视障和印刷品阅读障碍者群体在数字图书馆寻找资料时几乎没有选择。最先进的TTS系统现在可以重现足够质量的人声韵律，使听长篇叙事阅读既愉快又容易理解。这种技术相对紧凑，价格低廉;现在是时候将其作为一种获取数字文本的替代方式进行广泛部署了。这不仅会让阅读障碍群体直接受益，还能让“数字原住民”和其他用户在可能无法阅读的平台上收听文本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on Research Advances in Large Digital Book Repositories

自引率

0.00%

发文量