2011 International Conference on Speech Database and Assessments (Oriental COCOSDA)最新文献

Development of Hindi mobile communication text and speech corpus 印地语移动通信文本和语音语料库的开发

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085975

S. Sinha, S. Agrawal, Jesper Ø. Olsen

{"title":"Development of Hindi mobile communication text and speech corpus","authors":"S. Sinha, S. Agrawal, Jesper Ø. Olsen","doi":"10.1109/ICSDA.2011.6085975","DOIUrl":"https://doi.org/10.1109/ICSDA.2011.6085975","url":null,"abstract":"This paper describes the collection of a text and audio corpus for mobile personal communication in Hindi. Hindi is the largest of the Indian languages, and is the first language for more than 200 million people who use it not only for spoken mobile communication but also for sending text messages to each other. The main script for Hindi is Devanagari, but it is not well supported by the current generation of mobile devices. The Devanagari alphabet is twice as large as for English which makes it difficult to fit onto the small keypad of a mobile device. The aim of this project is to collect text and speech resources which can be used for training spoken language systems that aide text messaging on mobile devices - i.e. train a speech recogniser for the mobile personal communication domain so that text can be input through dictation rather than by typing. In total we collected a text corpus of 2 million words of natural messages in 12 different domains, and a spoken corpus of 100 speakers who each spoke 630 phonetically rich sentences - about 4 hours of speech. The speech utterances were recorded in 16 kHz through 3 recording channels: a mobile phone, a headset and a desktop mounted microphone. The data sets were properly annotated and available for development of speech recognition / synthesis systems in mobile domain.","PeriodicalId":269402,"journal":{"name":"2011 International Conference on Speech Database and Assessments (Oriental COCOSDA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132701330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Assessing the naturalness of malay emotional voice corpora 马来语情感语料库的自然度评估

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6086002

Mumtaz Begum Mustafa, R. N. Ainon, R. Zainuddin, Z. M. Don, G. Knowles

引用次数: 1

The influence of Shandong dialects on the acquisition of English plosives 山东方言对英语爆破音习得的影响

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085984

Yuan Jia, Xia Wang, Ai-jun Li

引用次数: 3

Acoustic Parameter Databases of Dagur, Evenki, Oroqen nationalities 达古尔族、鄂温克族、鄂伦春族声学参数数据库

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085993

Hu He, Xuewen Zhou, Wu Ri Ge, Xi Le Tu, M. Ge, Zheng Yuling

引用次数: 0

The development of a database of functional and emotional intonation in Chinese 汉语功能语调和情感语调数据库的开发

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085995

Maolin Wang, Yingjun Li, M. Lin, Ai-jun Li, Ziyu Xiong

引用次数: 3

Design and creation of Dysarthric Speech Database for development of QoLT software technology 面向QoLT软件技术开发的困难语音数据库的设计与创建

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085978

Dae-Lim Choi, Bong-Wan Kim, Yeon-Whoa Kim, Yong-Ju Lee, Yongnam Um, Minhwa Chung

引用次数: 27

Unsupervised phone segmentation method using delta spectral function 基于δ谱函数的无监督电话分割方法

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085998

Dac-Thang Hoang, Hsiao-Chuan Wang

引用次数: 3

Mongolian speech corpus for text-to-speech development 用于文本到语音发展的蒙古语语料库

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085994

C. Hansakunbuntheung, A. Thangthai, N. Thatphithakkul, Altangerel Chagnaa

引用次数: 3

A comparative study on accentuation implementation of Chinese EFL learners vs. American native speakers 中国英语学习者与美国英语母语者重音实施的比较研究

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085981

Xia Wang, Yuan Jia, Ai-jun Li

引用次数: 0

Morpheme concatenation approach in language modeling for large-vocabulary Uyghur speech recognition 基于语素拼接的大词汇量维吾尔语语音识别语言建模

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085990

Mijit Ablimit, A. Hamdulla, Tatsuya Kawahara

引用次数: 8