2011 International Conference on Speech Database and Assessments (Oriental COCOSDA)最新文献_第2页

The design and development of PELECAN: Pronunciation Errors from Learners of English Corpus and Annotation 英语语料库学习者的发音错误与标注的设计与开发

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085976

A. Chotimongkol, Sumonmas Thatphithakkul, P. Chootrakool, C. Hansakunbuntheung, C. Wutiwiwatchai

{"title":"The design and development of PELECAN: Pronunciation Errors from Learners of English Corpus and Annotation","authors":"A. Chotimongkol, Sumonmas Thatphithakkul, P. Chootrakool, C. Hansakunbuntheung, C. Wutiwiwatchai","doi":"10.1109/ICSDA.2011.6085976","DOIUrl":"https://doi.org/10.1109/ICSDA.2011.6085976","url":null,"abstract":"This paper describes the design and construction of PELECAN (Pronunciation Errors from Learners of English Corpus and Annotation). PELECAN is created primarily for collecting pronunciation errors from Thai learners of English in order to develop a more suitable pronunciation assessment tool for Thais. A 2-phase data collection process is used to balance between recording effort and the coverage of interested acoustic phenomena. The data collected from the first phase contains 1.5 hours of speech from 30 Thai learners reading 2 English passages that cover all English phones. Recorded speech was annotated with 2 types of error annotation: phonetic transcription of incorrect pronunciation and level of correctness of each phone. A contrastive list was used to guide the error analysis process. We found that many pronunciation errors are influenced by L1 (Thai), e.g. incorrect pronunciations of suffixes and the deletion of /l/ and /r/ in consonant clusters. However, there are some errors that may not be predictable from contrastive analysis alone such as the case of schwa. Hence, the data driven approach could help identify errors that may not be foreseen from only a linguistic point of view.","PeriodicalId":269402,"journal":{"name":"2011 International Conference on Speech Database and Assessments (Oriental COCOSDA)","volume":"1905 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133121719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Emotions in Hindi speech- analysis, perception and recognition 印地语言语中的情感——分析、感知和识别

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085972

S. Agrawal

{"title":"Emotions in Hindi speech- analysis, perception and recognition","authors":"S. Agrawal","doi":"10.1109/ICSDA.2011.6085972","DOIUrl":"https://doi.org/10.1109/ICSDA.2011.6085972","url":null,"abstract":"Human Speech conveys speaker's emotional state along with linguistic intelligence. Meaning of a speech sample changes when it is uttered with different emotions. The present paper gives a description of different types of studies conducted to analyze, perceive and recognize commonly occurring emotions in Hindi speech. These have been classified as anger, happiness, fear, sadness, surprise in addition to neutral. Intonation, intensity and duration patterns changes due to changes in sentence types as well as due to changes in emotions. A relationship among the measured acoustic parameters and the patterns has been used to classify them. Experiments have been conducted to study and recognise emotions based on phonetic as well as prosodic parameters in the speech samples due to changes in emotions. These parameters include MFCC & their derivatives and prosodic parameters as the F0, A0 and Duration. In one of the experiment vowel segments taken from continuously spoken sentences and in another experiment Hindi digits were used as speech samples for machine recognition of emotions using the Neural Net classifiers. Human perception experiments have been conducted at all levels of experiments and compared the results with machine recognition performance. In most cases it has been found that machine recognition was found to be better compared to human performance. Both Phonetic as well as prosodic parameters play role in identification of emotions.","PeriodicalId":269402,"journal":{"name":"2011 International Conference on Speech Database and Assessments (Oriental COCOSDA)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124673006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

The role of speech technology in service-operation estimation 语音技术在业务运营评估中的作用

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085991

Masanori Takehara, S. Tamura, Ryuhei Tenmoku, T. Kurata, S. Hayamizu

引用次数: 9

Annotation of japanese response tokens and preliminary analysis on their distribution in three-party conversations 日语应答令牌标注及其在三方对话中的分布初探

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6086001

Yasuharu Den, Nao Yoshida, K. Takanashi, H. Koiso

引用次数: 20

A question-and-answer classification technique for constructing and managing spoken dialog system 构建和管理口语对话系统的问答分类技术

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085987

Ryosuke Inoue, Y. Kurosawa, Kazuya Mera, T. Takezawa

引用次数: 2

A multimodal corpus for modeling turn management in multi-party conversations 用于多方对话中回合管理建模的多模态语料库

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085996

H. Furukawa, M. Nishida, Kristiina Jokinen, S. Yamamoto

引用次数: 1

Acoustic feature and variance of Uigur vowels 维吾尔语元音的声学特征及其变异

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6086003

Xuewen Zhou, He Hu, Wu Ri Ge, Xi Le Tu, Qi Mu Ge, Zheng Yuling

引用次数: 3

Unsupervised spoken term detection with acoustic segment model 基于声学段模型的无监督口语词检测

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085989

Haipeng Wang, Tan Lee, C. Leung

引用次数: 33

Interactive visualization and search system for speech corpora 语音语料库的交互式可视化和搜索系统

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085999

S. Itahashi, T. Kajiyama, K. Yamakawa, Y. Ishimoto, T. Matsui

引用次数: 2

Construction of speech corpus of AESOP-SD AESOP-SD语言语料库的构建

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI: 10.1109/ICSDA.2011.6085977

Yuan Jia, Meng Wang, Honghua Zhai, Ai-jun Li

引用次数: 2