2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)最新文献

Context-dependent grapheme-to-phoneme evaluation corpus using flexible contexts and Categorial Matrix 使用灵活语境和范畴矩阵的上下文依赖的字素-音素评价语料库

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357884

C. Hansakunbuntheung, Sumonmas Thatphithakkul

{"title":"Context-dependent grapheme-to-phoneme evaluation corpus using flexible contexts and Categorial Matrix","authors":"C. Hansakunbuntheung, Sumonmas Thatphithakkul","doi":"10.1109/ICSDA.2015.7357884","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357884","url":null,"abstract":"Context-dependent pronunciation, e.g. homographs, is a difficult grapheme-to-phoneme conversion (G2P) issue. It causes accuracy downgrade in speech synthesis and speech recognition. However, the context-dependent pronunciation issue is rarely considered in collecting pronunciation corpus for evaluating accuracy of G2P. Thus, this paper proposes a context-dependent pronunciation corpus using grapheme-phoneme pairs with their context information for G2P assessment. The context information includes 1) Categorial Matrix for representing orthographic types and usage domains of orthographic groups (OG). Categorial Matrix is designed to investigate problem categories in the G2P. 2) regular-expression-based flexible context for representing context variation. 3) OG Classes for representing interchangeable OGs in the flexible context. The flexible context and the word classes are designed to remove redundant contexts while covering context variation with minimal sets of patterns. By using the proposed corpus, automatic context generation for G2P evaluation can be implemented.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134179703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

The recognition of neutral tone across acoustic cues 通过声音线索对中性音调的识别

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357865

Shanshan Fan, Ao Chen, Ai-jun Li

引用次数: 3

Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint 以学习者为中心的视角，无噪声、无压力地可视化世界英语的发音多样性

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357855

Yuichi Sato, Yosuke Kashiwagi, N. Minematsu, D. Saito, K. Hirose

{"title":"Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint","authors":"Yuichi Sato, Yosuke Kashiwagi, N. Minematsu, D. Saito, K. Hirose","doi":"10.1109/ICSDA.2015.7357855","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357855","url":null,"abstract":"The term of “World Englishes” describes the current and real state of English and one of their main characteristics is a large diversity of pronunciation, called accents. We have developed two techniques of individual-based clustering of the diversity [1, 2] and educationally-effective visualization of the diversity [3]. Accent clustering requires a technique to quantify the accent gap between any speaker pair and visualization requires a technique of stress-free plotting of the speakers. In the above studies, however, we developed and assessed these two techniques independently and in this paper, we assess our technique of automatic accept gap prediction when it is used for our stress-free visualization. Further, since CALL applications today are not always used in a quiet environment, we introduce a feature enhancement (denoising) technique to improve noise-robustness of accent gap prediction. Results show that our accent gap prediction shows correlation of 0.77 to IPA-based manually-defined accent gaps and that, by applying feature enhancement to noisy input utterances, our technique can predict the accent gap that could be obtained in a clean condition, when the SNR is larger than 10 [dB].","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114885120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Stress annotated Urdu speech corpus to build female voice for TTS 重音注释乌尔都语语料库构建TTS女声

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357857

B. Mumtaz, Saba Urooj, S. Hussain, Wajiha Habib

引用次数: 4

Information content, weighting and distribution in continuous speech prosody - A cross-genre comparison 连续语音韵律中的信息内容、权重和分布——跨体裁比较

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357868

Helen Kai-Yun Chen, Wei-te Fang, Chiu-yu Tseng

{"title":"Information content, weighting and distribution in continuous speech prosody - A cross-genre comparison","authors":"Helen Kai-Yun Chen, Wei-te Fang, Chiu-yu Tseng","doi":"10.1109/ICSDA.2015.7357868","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357868","url":null,"abstract":"This study explores the composition of information content in continuous speech using data of a diversity of speech genres. Our approach is to measure information weighting, distribution and correlative expressiveness through perceived prosodic prominences in continuous speech from data of 4 different styles. This alternative perspective differs from reported studies on emotion related prosodic expressions and is based mainly on the assumption that patterned prominences are also positively correlated with the allocation and weighted loading of information, but only by higher level of discourse units. Four speech genres, i.e., 2 styles of read vs. 2 of spontaneous speech annotated with perceived prominences at 4 relative degrees are compared. Information allocation and weighting are calculated using both frequency count of prominence patterns and designation of weighting scores by prominence levels. The most revealing results are found in data of spontaneous conversation, which feature in more varieties of emphasis patterns as results of constant reduction. Far more significantly, conversation data also showcase that while their paragraph-level prosodic units carry the least amount of information content, the discourse-level prosodic units exhibit the highest score of information weighting. In other words, one major but less known distinctive feature of conversation speech is its largest amount of information content, which only surfaces when examined by the highest level of discourse-prosodic unit. We believe the results have furthered our understanding of prosody expressions in continuous speech in general and spontaneous conversation in particular; and could readily be utilized in many speech technology related implementations.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121565597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Elicit spoken-style data from social media through a style classifier 通过风格分类器从社交媒体中获取口语风格数据

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357856

A. Chotimongkol, Vataya Chunwijitra, Sumonmas Thatphithakkul, Nattapong Kurpukdee, C. Wutiwiwatchai

引用次数: 5

A comparison study on contextual modeling for estimating functional loads of phonological contrasts 语音对比功能负荷估算的语境建模比较研究

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357886

Bin Wu, Yanlu Xie, Jinsong Zhang

{"title":"A comparison study on contextual modeling for estimating functional loads of phonological contrasts","authors":"Bin Wu, Yanlu Xie, Jinsong Zhang","doi":"10.1109/ICSDA.2015.7357886","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357886","url":null,"abstract":"Functional load (FL) is the quantitative measure of the importance of phonological contrasts, which stand for the differentiation of communicative linguistic units. Correct estimate of FLs is useful for the studies of speech recognition, language evolution, language teaching and etc. Conventional approaches use phonological transcriptions and unigram probabilities for the estimation, hence weak in contextual modeling. Based on the measurement of mutual information (MI) between the text and its phonological transcription, we previously proposed a novel FL measurement which utilizes n-gram word probabilities, hence owing better context modeling power. In this study, we compare the effects of different context on the estimation of FL: syllable, word, n-gram word model, and open data. Experimental results show: the wider the context modeling, the smaller the FL; FL based on MI with the trigram model achieves the best performance in modeling the context in our experiments. Compared with FL based on entropy, FL based on MI showed smaller value and is applicable to open data.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128895288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Tonal alignment in Shanghai Chinese 上海汉语的调性

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357878

Bijun Ling, Jie Liang

引用次数: 2

Construction and analysis of social-affective interaction corpus in English and Indonesian 英语和印尼语社会情感互动语料库的构建与分析

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357892

Nurul Lubis, S. Sakti, Graham Neubig, T. Toda, Satoshi Nakamura

引用次数: 6

On finding word-level break-type formation rules for mandarin read speech 汉语朗读语音的词级断型构词法研究

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI: 10.1109/ICSDA.2015.7357864

Fu-Ja Kung, Pauline Lee, Yih-Ru Wang, Sin-Horng Chen, Chen-Yu Chiang

引用次数: 1