The voice across Japan database-the Japanese language contribution to Polyphone

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 1994-04-19 DOI:10.1109/ICASSP.1994.389348

Thomas Staples, J. Picone, Nozomi Arai

引用次数: 10

Abstract

Texas Instruments' Voice Across Japan (VAJ) database, modeled after the highly successful Voice Across America project, consists of a wide range of diverse speech material including digit strings, yes/no questions, and phonetically-rich read sentences. The data is being collected using long distance telephone lines and an analog telephone interface. The target size is 14 items per speaker by 10,000 speakers. Greater emphasis is being placed on the collection of phonetically-rich read sentence data. Four randomly selected sentences are included in each session: one from the 512 sentence ATR PB set, and three from a 10,000 sentence set developed specifically for this project. This latter sentence set, designed to maximize the triphone coverage of the database, is described. The VAJ database is planned to be included in the Linguistic Data Consortium's (LDC) Polyphone (multi-language) database.<>

查看原文本刊更多论文

横跨日本的声音数据库-日语对Polyphone的贡献

德州仪器的Voice Across Japan (VAJ)数据库是仿造了非常成功的Voice Across America项目，由各种各样的语音材料组成，包括数字串、是/否问题和语音丰富的可读句子。数据是通过长途电话线和模拟电话接口收集的。目标尺寸是每10,000名演讲者14件物品。更大的重点放在收集语音丰富的阅读句子数据上。每个会话包含四个随机选择的句子:一个来自512个句子的ATR PB集，另外三个来自专门为这个项目开发的10,000个句子集。描述了后一种句子集，旨在最大限度地扩大数据库的三重电话覆盖范围。VAJ数据库计划被纳入语言数据联盟(LDC)的Polyphone(多语言)数据库。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

自引率

0.00%

发文量