Romanian Spoken Language Resources and Annotation for Speaker Independent Spontaneous Speech Recognition

C. Burileanu, Andi Buzo, Cristina Sorina Petre, Diana Ghelmez-Haneş, H. Cucu
{"title":"Romanian Spoken Language Resources and Annotation for Speaker Independent Spontaneous Speech Recognition","authors":"C. Burileanu, Andi Buzo, Cristina Sorina Petre, Diana Ghelmez-Haneş, H. Cucu","doi":"10.1109/ICDT.2010.9","DOIUrl":null,"url":null,"abstract":"This paper presents studies and early results with the scope to build a robust spontaneous speech recognition system in Romanian language. We have tried to give solutions to several issues that have arisen like building a large and accurate database within a reasonable time. A short description of the database is given and some statistics are collected in order to show its evolution in several stages of the project. Embedded training technique has been used for training triphones. As a consequence, the alignment problem has been studied and a solution is proposed for it. The final purpose of these attempts is to obtain substantial results in speech recognition for Romanian language that can be used as baseline for further results.","PeriodicalId":322589,"journal":{"name":"2010 Fifth International Conference on Digital Telecommunications","volume":"01 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Fifth International Conference on Digital Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDT.2010.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

This paper presents studies and early results with the scope to build a robust spontaneous speech recognition system in Romanian language. We have tried to give solutions to several issues that have arisen like building a large and accurate database within a reasonable time. A short description of the database is given and some statistics are collected in order to show its evolution in several stages of the project. Embedded training technique has been used for training triphones. As a consequence, the alignment problem has been studied and a solution is proposed for it. The final purpose of these attempts is to obtain substantial results in speech recognition for Romanian language that can be used as baseline for further results.
罗马尼亚语口语资源和独立说话人自发语音识别的注释
本文介绍了在罗马尼亚语中建立一个鲁棒的自发语音识别系统的研究和初步结果。我们试图在合理的时间内解决一些问题,比如建立一个庞大而准确的数据库。对数据库进行了简短的描述,并收集了一些统计数据,以显示其在项目的几个阶段的演变。嵌入式培训技术已被用于培训三重奏。因此,本文对该问题进行了研究,并提出了一种解决方案。这些尝试的最终目的是在罗马尼亚语语音识别方面取得实质性成果,可以作为进一步成果的基线。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信