2009 Oriental COCOSDA International Conference on Speech Database and Assessments最新文献

筛选
英文 中文
Introduction to China minority speech acoustic Parameter Database Platform 中国少数民族语音声学参数数据库平台简介
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278361
Xuewen Zhou, Yuling Zheng, He Hu
{"title":"Introduction to China minority speech acoustic Parameter Database Platform","authors":"Xuewen Zhou, Yuling Zheng, He Hu","doi":"10.1109/ICSDA.2009.5278361","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278361","url":null,"abstract":"This paper introduces the current primary functionalities, characteristics and usages of Unified Minority Speech Parameter Database Platform Software as well as future-expanded functions. By using the platform, we can accomplish acoustic parameter retrieval, statistics and analysis of established Tibetan, Uigur and Yi broadcasting acoustic parameter databases. After adding acoustic parameters of more minority language speech into the platform, phonological analysis and comparison of languages within same language family can be achieved. Other important goals of designing the platform are to implement phonetic resources sharing, accumulation and protection of endangered minority language speech esc.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128707664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multilingual number expansion for TTS 多语种数字扩展为TTS
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278365
Jari Alhonen
{"title":"Multilingual number expansion for TTS","authors":"Jari Alhonen","doi":"10.1109/ICSDA.2009.5278365","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278365","url":null,"abstract":"As an inherently language-dependent feature, the expansion of numbers to their full written-out forms can also be accomplished with language-independent code for multilingual TTS systems. In this paper we present a number expansion system that works with little data and language-independent code, while still able to expand numerics in dozens of languages. The paper also describes a way to determine the type of the number for correct expansion type, also with language-independent code, normalization from various scripts, and discusses issues brought up by conjugation of some morphologically rich languages.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114839836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
MASS: A Malay language LVCSR corpus resource 马来语LVCSR语料库资源
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278382
T. Tan, Xiong Xiao, E. Tang, E. Chng, Haizhou Li
{"title":"MASS: A Malay language LVCSR corpus resource","authors":"T. Tan, Xiong Xiao, E. Tang, E. Chng, Haizhou Li","doi":"10.1109/ICSDA.2009.5278382","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278382","url":null,"abstract":"This paper presents the development of the speech, text and pronunciation dictionary resources required to build a large vocabulary speech recognizer for the Malay language. This project is a collaboration project among three universities: USM, MMU from Malaysia and NTU from Singapore. The Malay speech corpus consists of read speech (speaker independent/ dependent and accent independent/ dependent) and broadcast news. To date, 90 speakers have been recorded which is equal to a total of nearly 70 hours of read speech, and 10 hours of broadcast news from local TV stations in Malaysia was transcribed. The text corpus consists of 700Mbytes of data extracted from Malaysia's local news web pages from 1998–2008 and a rule based G2P tool is develop to generate the pronunciation dictionary.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131439225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
Unknown example detection for example-based spoken dialog system 基于示例的口语对话系统的未知示例检测
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278363
Shota Takeuchi, Hiromichi Kawanami, H. Saruwatari, K. Shikano
{"title":"Unknown example detection for example-based spoken dialog system","authors":"Shota Takeuchi, Hiromichi Kawanami, H. Saruwatari, K. Shikano","doi":"10.1109/ICSDA.2009.5278363","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278363","url":null,"abstract":"In a spoken dialog system, the example-based response generation method generates a response by searching a dialog example database for the example question most similar to an input user utterance. That method has the advantage of ease of system expansion. It requires, however, a number of utterance examples whose correct responses are labeled. In this paper, we propose an approach to reducing the system expansion cost. This approach employs a detection method that screens the unknown examples, the utterances to be added to the database with their correct responses. The experimental results show that the method can reduce the number of utterances required to be labeled while maintaining the system response accuracy improvement as well as full labeling.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"66 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120969578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Maximum Entropy combined FSM stemming method for Uyghur 维吾尔语最大熵组合FSM词干方法
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278378
Aishan Wumaier, Zaokere Kadeer, Parida Tursun, Shengwei Tian
{"title":"Maximum Entropy combined FSM stemming method for Uyghur","authors":"Aishan Wumaier, Zaokere Kadeer, Parida Tursun, Shengwei Tian","doi":"10.1109/ICSDA.2009.5278378","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278378","url":null,"abstract":"This paper presents the generation of Uyghur Noun Suffix DFA combined with Maximum Entropy (MaxEnt) for stemming algorithm. Because of the agglutinative nature of Uyghur language, stemming is an essential task for Uyghur language processing applications. We generate Uyghur noun inflectional suffixes finite state machines (FSMs) by using the morphotactic rules in reverse order. But there are eight suffixes which is similar to the ending part of some words. These suffixes make the FSM ambiguous. We apply the MaxEnt model to resolve ambiguity of the FSM. This paper describes the steps of generating the FSM, building the MaxEnt suffix identifying model and combination of MaxEnt with FSM.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114126844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Utilization of acoustical feature in visualization of multiple speech corpora 声学特征在多语音语料库可视化中的应用
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278359
K. Yamakawa, H. Kikuchi, T. Matsui, S. Itahashi
{"title":"Utilization of acoustical feature in visualization of multiple speech corpora","authors":"K. Yamakawa, H. Kikuchi, T. Matsui, S. Itahashi","doi":"10.1109/ICSDA.2009.5278359","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278359","url":null,"abstract":"The purpose of this study is to visualize the similarities among multiple speech corpora. In order for users to easily utilize various speech corpora, we reported a visualization method based on the corpus attribute using MDS. We had proposed the eight attributes as the speech corpus features. However, these attributes contained no acoustical feature of the speech corpus. The acoustical feature is important information in some intended use of corpus. In this paper, we propose a new attribute, the acoustical feature of speech corpora, in addition to the conventional attributes. The results of visualization indicates that the method using the new attribute can visualize better the similarities between multiple speech corpora. This will facilitate searching efficiently the specific corpus that fits a user's needs. Based on the obtained results, we built a corpus search system which corpus users can use as a benchmark of corpus selection. The outline and possibility of this system are described.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125081195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
LOTUS-BN: A Thai broadcast news corpus and its research applications LOTUS-BN:泰国广播新闻语料库及其研究应用
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278377
A. Chotimongkol, K. Saykhum, P. Chootrakool, N. Thatphithakkul, C. Wutiwiwatchai
{"title":"LOTUS-BN: A Thai broadcast news corpus and its research applications","authors":"A. Chotimongkol, K. Saykhum, P. Chootrakool, N. Thatphithakkul, C. Wutiwiwatchai","doi":"10.1109/ICSDA.2009.5278377","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278377","url":null,"abstract":"This paper describes the design and construction of the LOTUS-BN corpus, a Thai television broadcast news corpus. In addition to audio recordings and their transcription, this corpus also includes a detailed annotation of many interesting characteristics of broadcast news data such as acoustic condition, overlapping speech, news topic and named entity. The LOTUS-BN is still an ongoing project with the goal of collecting 100 hours of speech. We report initial statistics analyzed from 60 hours of speech which show that the LOTUS-BN corpus has a rich vocabulary of approximately 26,000 words with one third of them are named entities. Thus, this corpus is a good resource for developing an LVCSR system and investigating on named entity detection and recognition in addition to broadcast news related applications. Research applications on these topics are also discussed.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134516257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
A preliminary study on corpus design for computer-assisted German and Mandarin language learning 计算机辅助德语和汉语学习语料库设计的初步研究
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278357
Chia-yu Chiu, Y. Liao, Daniel Kulls, H. Mixdorff, Shing-Lung Chen
{"title":"A preliminary study on corpus design for computer-assisted German and Mandarin language learning","authors":"Chia-yu Chiu, Y. Liao, Daniel Kulls, H. Mixdorff, Shing-Lung Chen","doi":"10.1109/ICSDA.2009.5278357","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278357","url":null,"abstract":"This paper reports on the progress of a joint German-Taiwan computer assisted language learning (CALL) project. One major goal of this project is to collect a bi-lingual (both native and second language, i.e., L1 and L2) speech corpus of L2 learners of German and Mandarin across German and Taiwan. In the preparation phase of the database collection, contrastive analysis of German and Mandarin phonetic and prosodic systems is performed, and the potential pronunciation errors predicted to be made by L2 learners are hypothesized in a set of confusion tables. We expect to apply the set of confusable tables to database design. The eventual database collection will be conducted during the next three years.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134413227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Uyghur vowel weakening processing system 维吾尔语元音弱化处理系统
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278374
Tuergen Yibulayin, Parida Tursun, Aishan Wumaier, Zaokere Kadeer
{"title":"Uyghur vowel weakening processing system","authors":"Tuergen Yibulayin, Parida Tursun, Aishan Wumaier, Zaokere Kadeer","doi":"10.1109/ICSDA.2009.5278374","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278374","url":null,"abstract":"Uyghur vowel weakening has three kinds of phenomena, such as vowel dropping, inserting and neutralizing. When words added some suffixes, there always occurs some weakening take place. In Uyghur, vowel neutralizing is very common, and when we need to find the stem form of an inflected stem, it is hard to identify the neutralized vowel's original vowel without a dictionary in some cases. In Uyghur, there are eight vowels. But disregarding vowel length distinction, one identifies nine vowels in Uyghur. Actually, the neutral vowel in Uyghur represents two distinct phonemes, even though they share the same set of allophones and are orthographically alike. In this paper, we introduce our vowel weakening processing system and the Maximum Entropy (MaxEnt) based neutral vowel phoneme identification model.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116561622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Development of Japanese infant speech database using longitudinal recordings from birth to five years old 利用从出生到五岁的纵向录音开发日本婴儿语音数据库
2009 Oriental COCOSDA International Conference on Speech Database and Assessments Pub Date : 2009-10-02 DOI: 10.1109/ICSDA.2009.5278379
S. Amano, T. Kondo, Kazumi Kato, T. Nakatani
{"title":"Development of Japanese infant speech database using longitudinal recordings from birth to five years old","authors":"S. Amano, T. Kondo, Kazumi Kato, T. Nakatani","doi":"10.1109/ICSDA.2009.5278379","DOIUrl":"https://doi.org/10.1109/ICSDA.2009.5278379","url":null,"abstract":"Previously developed longitudinal infant speech databases are limited in terms of recording period or number of utterances. To facilitate longitudinal research on the development of speech production, an infant speech database was developed by using five years of recordings containing a large number of daily life utterances of five Japanese infants and their parents. The resulting database contains 269,467 utterances with various types of information including a transcription, a fundamental frequency, and a phoneme label. The database is useful for both acoustical and linguistic analyses of speech development.","PeriodicalId":254906,"journal":{"name":"2009 Oriental COCOSDA International Conference on Speech Database and Assessments","volume":"335 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126426169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信