Rapid language adaptation tools for multilingual speech processing

Proceedings. IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2009-12-01 DOI:10.1109/ASRU.2009.5373503

Tanja Schultz

{"title":"Rapid language adaptation tools for multilingual speech processing","authors":"Tanja Schultz","doi":"10.1109/ASRU.2009.5373503","DOIUrl":null,"url":null,"abstract":"The performance of speech and language processing technologies has improved dramatically over the past decade, with an increasing number of systems being deployed in a large variety of applications, such as spoken dialog systems, speech summarization and information retrieval systems, and speech translation systems. Most efforts to date were focused on a very small number of languages with large number of speakers, economic potential, and information technology needs of the population. However, speech technology has a lot to contribute even to those languages that do not fall into this category. Languages with a small number of speakers and few linguistic resources may suddenly become of interest for humanitarian and military reasons. Furthermore, a large number of languages are in danger of becoming extinct, and ongoing projects for preserving them could benefit from speech technology. With more than 6900 languages in the world and the need to support multiple input and output languages, the most important challenge today is to port speech processing systems to new languages rapidly and at reasonable costs. In my talk I will introduce state-of-the-art techniques for rapid language adaptation and present solutions to overcome the ever-existing problem of data sparseness and the gap between language and technology expertise. I will describe the building process for speech and language processing components for new unsupported languages and introduce tools to do this rapidly and at lost costs. I describe the Rapid Language Adaptation Tools (RLAT) which built on existing projects like SPICE, GlobalPhone, and FestVox and enable users to develop speech processing components, to collect appropriate speech and text data for building and improving these components, and to evaluate the results allowing for iterative improvements.","PeriodicalId":89617,"journal":{"name":"Proceedings. IEEE Workshop on Automatic Speech Recognition and Understanding","volume":"103 1","pages":"51"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE Workshop on Automatic Speech Recognition and Understanding","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASRU.2009.5373503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

The performance of speech and language processing technologies has improved dramatically over the past decade, with an increasing number of systems being deployed in a large variety of applications, such as spoken dialog systems, speech summarization and information retrieval systems, and speech translation systems. Most efforts to date were focused on a very small number of languages with large number of speakers, economic potential, and information technology needs of the population. However, speech technology has a lot to contribute even to those languages that do not fall into this category. Languages with a small number of speakers and few linguistic resources may suddenly become of interest for humanitarian and military reasons. Furthermore, a large number of languages are in danger of becoming extinct, and ongoing projects for preserving them could benefit from speech technology. With more than 6900 languages in the world and the need to support multiple input and output languages, the most important challenge today is to port speech processing systems to new languages rapidly and at reasonable costs. In my talk I will introduce state-of-the-art techniques for rapid language adaptation and present solutions to overcome the ever-existing problem of data sparseness and the gap between language and technology expertise. I will describe the building process for speech and language processing components for new unsupported languages and introduce tools to do this rapidly and at lost costs. I describe the Rapid Language Adaptation Tools (RLAT) which built on existing projects like SPICE, GlobalPhone, and FestVox and enable users to develop speech processing components, to collect appropriate speech and text data for building and improving these components, and to evaluate the results allowing for iterative improvements.

查看原文本刊更多论文

用于多语言语音处理的快速语言适应工具

在过去的十年中，语音和语言处理技术的性能有了显著的提高，越来越多的系统被部署在各种各样的应用中，如语音对话系统、语音摘要和信息检索系统以及语音翻译系统。迄今为止，大多数努力都集中在极少数有大量使用者的语言、经济潜力和人口的信息技术需求上。然而，语音技术甚至对那些不属于这一类的语言也有很多贡献。由于人道主义和军事原因，使用人数少、语言资源少的语言可能突然成为人们感兴趣的语言。此外，大量语言正面临灭绝的危险，正在进行的保护这些语言的项目可以从语音技术中受益。世界上有6900多种语言，需要支持多种输入和输出语言，目前最重要的挑战是以合理的成本将语音处理系统快速移植到新语言。在我的演讲中，我将介绍最先进的快速语言适应技术，并提出解决方案，以克服数据稀疏问题以及语言和技术专业知识之间的差距。我将描述为新的不受支持的语言构建语音和语言处理组件的过程，并介绍快速完成此工作的工具。我描述了快速语言适应工具(RLAT)，它建立在SPICE、GlobalPhone和FestVox等现有项目的基础上，使用户能够开发语音处理组件，收集适当的语音和文本数据以构建和改进这些组件，并评估允许迭代改进的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. IEEE Workshop on Automatic Speech Recognition and Understanding

自引率

0.00%

发文量