2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)最新文献

筛选
英文 中文
Melody of Mandarin L2 English—when L1 transfer and L2 planning come together 当L1迁移和L2规划结合在一起时,普通话L2英语的旋律
Chao-yu Su, Chiu-yu Tseng
{"title":"Melody of Mandarin L2 English—when L1 transfer and L2 planning come together","authors":"Chao-yu Su, Chiu-yu Tseng","doi":"10.1109/ICSDA.2015.7357871","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357871","url":null,"abstract":"It is always more difficult for L2 speakers to produce the melody and tempo of continuous speech because it requires simultaneous planning of L2 linguistic specifications, higher level discourse associations and information placements. We assume that higher level planning requires within-phrase chunking and cross-phrase paragraph phrasing while information arrangements through emphasis weighting assignment and allocation. The above involved planning is most notably delivered through distinct global melodic modulations and patterns. The present compares the onset features of extracted Phrase Commands and their consistency with tagged discourse units and perceived emphases using speech data of L1 English, Taiwan (TW) L2 English and TW L1 Mandarin. Explicitly, we study F0 contour features to compare L1/L2 chunking units and their global patterns to pinpoint L2 features. Results of distinct TW L2 features compared with L1 English are (1) less consistent discourse chunking, (2) fewer distinct contours by prosodic words, (3) less degree of emphasis contrast in prosodic phrase and (4) more distinct contours in non-emphases. While (1) and (2) may reflect general L2 planning difficulties, our results show that (3) and (4) namely, flatter overall contour, are Mandarin inherent transferred to L2. We believe our proposed methods of extracted Phrase Command more accurately and better represent global melodic features that could be applied to other L1/L1 comparison in general; the findings could also be directly applied to CALL development of L2 prosody enhancement to improve overall intelligibility.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131898643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
District names speech corpus for Pakistani Languages 巴基斯坦语地区名称语料库
Sahar Rauf, Asima Hameed, T. Habib, S. Hussain
{"title":"District names speech corpus for Pakistani Languages","authors":"Sahar Rauf, Asima Hameed, T. Habib, S. Hussain","doi":"10.1109/ICSDA.2015.7357893","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357893","url":null,"abstract":"This paper presents a speech corpus that is developed for Urdu automatic speech recognition (ASR) system. The corpus comprises of single word utterances fixed vocabulary consisting of district names of Pakistan. The data is recorded over a telephone channel from all over Pakistan to cover six major accents; Punjabi, Urdu, Saraiki, Pashto, Sindhi, and Balochi. The data was collected in challenging acoustic environments; the major issues were silence, background noise and alternate pronunciations, which can affect the performance of the system. In order to address these issues, comprehensive data verification and cleaning guidelines are presented. The proposed process serves as a data preprocessing step for the development of ASR, which is successfully integrated in an Urdu dialog system to provide weather information of Pakistan.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133230919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Corpus design and development of an annotated speech database for Punjabi 旁遮普语注释语音数据库的语料库设计与开发
S. Bansal, S. Sharan, S. Agrawal
{"title":"Corpus design and development of an annotated speech database for Punjabi","authors":"S. Bansal, S. Sharan, S. Agrawal","doi":"10.1109/ICSDA.2015.7357860","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357860","url":null,"abstract":"Punjabi is an important Indo-Aryan languages spoken in India and in some other countries especially Pakistan. It is a tonal language and its phonetic and phonological aspects have not been studied very much. The present paper reports development of phonemically annotated speech database of Malwai dialect of Punjabi. A phonetically rich text database of 1500 words and 300 sentences from a corpus of about 300,000 words was created. These were recorded by 25 male and 25 female speaker format with sampling rate of 16 kHz and 16 bit. The recordings were made in the native places of speakers possessing the original version the Malwai dialect of Punjabi. The recorded data was segmented and labeled phonemically to get the phonemic and sub-phonemic elements of each phoneme and the tonemes of Punjabi language. The annotated database can be useful for phonetic studies and to develop Punjabi speech synthesis system.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122543562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Tonal phoneme based model for Vietnamese LVCSR 基于声调音素的越南语LVCSR模型
V. Nguyen, C. Luong, T. Vu
{"title":"Tonal phoneme based model for Vietnamese LVCSR","authors":"V. Nguyen, C. Luong, T. Vu","doi":"10.1109/ICSDA.2015.7357876","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357876","url":null,"abstract":"This paper proposes an algorithm that is first known as a grapheme-to-phoneme method to transform any Vietnamese word to a tonal phoneme-based pronunciation. The tonal phoneme set produced by this algorithm is further used to develop some acoustic models which integrated tone information and tonal feature. The processes using the Kaldi toolkit to develop a LVCSR system and extract a bottleneck feature which is calculated from a trained deep neural network for Vietnamese are also presented. The results showed that the use of tonal phoneme improved by 1.54% of word error rate (WER) compared to the system using the nontonal phoneme, the use of tonal feature information improved by 4.65% of WER, and of the bottleneck feature gave the best WER with about 10% improvement.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116138285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Automatic word segmentation for spoken Cantonese 粤语口语自动分词
R. Fung, B. Bigi
{"title":"Automatic word segmentation for spoken Cantonese","authors":"R. Fung, B. Bigi","doi":"10.1109/ICSDA.2015.7357891","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357891","url":null,"abstract":"Though Cantonese is the most influential variety of Chinese other than Mandarin, there are only a limited number of Cantonese corpora available for linguistic studies. Among the essential steps of building a corpus, word segmentation is a necessary but highly challenging task due to the lack of clear word boundary in Cantonese. This paper reports the construction and evaluation of an open-source automatic Cantonese word segmenter developed for Cantonese. The tool is a component of the multilingual SPPAS program designed to be used directly by linguists. It is a free software distributed under a GPL license. The effectiveness of the tool was evaluated by comparing the result of segmenting some samples of a spoken Cantonese corpus manually and automatically using the tool developed. High precision and recall were found in our study. Upon completion, the tool would definitely promote the development of more Cantonese corpora for language related studies.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130506507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An empirical study of phonetic transfer in English monophthong learning by Tibetan (Lhasa) speakers 西藏(拉萨)人英语单音节学习中语音迁移的实证研究
Feng Hui, Zhao Lu, Dang Jianwu
{"title":"An empirical study of phonetic transfer in English monophthong learning by Tibetan (Lhasa) speakers","authors":"Feng Hui, Zhao Lu, Dang Jianwu","doi":"10.1109/ICSDA.2015.7357888","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357888","url":null,"abstract":"English teaching has been carried out in every region of China. However, the teaching of English phonetics in minority regions faces the problems of poor teaching quality and low intelligibility of phonetic production. Under the framework of Speech Learning Model (SLM), the current research adopts the experimental phonetic approach to select 10 Lhasa students (5 male and 5 female) from the Corpus of Chinese, English, Tibetan by Tibetan Speakers (CETTS) and examine the characteristics of the vowel space of their English in continuous speech, with a purpose to explore minority students' phonetic transfer of L1 and L2 on L3 vowel system. Euclidean distances between Tibetan speakers' English and Tibetan, and between Tibetan speakers' English and Chinese, reveal the degree of phonetic transfer of Tibetan (L1) and Chinese (L2) on English (L3). Results show that the nonstandard English vowel production by Lhasa speakers are more influenced by the similar vowels in L1 or L2. Due to equivalence classification, Lhasa students' English vowels /a:/, /i:/ and /i/ are more influenced by Tibetan similar vowels, and their English vowels /α/ and /u:/ are more influenced by Chinese similar vowels. In addition, Tibetan female speakers' /α:/ is more influenced by the similar vowel in Chinese. The study has revealed that when Tibetan speakers learn L3, the interference seems to be partly from their L1 and partly from their L2. Therefore, in minority students' L3 vowel learning, both L1 and L2 have influence on their L3 vowel system.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114649260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Acoustic analysis of English lexical stress produced by native (L1) Bengali speakers compared to native (L1) English speakers 母语为孟加拉语的人与母语为英语的人所产生的英语词汇重音的声学分析
Shambhu Nath Saha, Shyamal Kr. Das Mandal
{"title":"Acoustic analysis of English lexical stress produced by native (L1) Bengali speakers compared to native (L1) English speakers","authors":"Shambhu Nath Saha, Shyamal Kr. Das Mandal","doi":"10.1109/ICSDA.2015.7357874","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357874","url":null,"abstract":"English lexical stress is acoustically related to combination of fundamental frequency (F0), duration, intensity and vowel quality. Current study compares the use of these correlates by 10 L1 English and 20 L1 Bengali speakers to find out which correlates are most difficult for Bengali speakers to acquire. Results showed that English and Bengali speakers used the acoustic correlates of vowel duration, intensity and F0 in similar manner, but Bengali speakers produced significantly less English like stress patterns. English speakers reduced vowel duration significantly more in the unstressed vowels compared to Bengali speakers and degree of intensity and F0 increase in stressed vowels by English speakers was higher than that by Bengali speakers. Moreover Bengali speakers produced English like vowel quality in certain unstressed syllables, but in other cases there were significant differences in vowel quality across groups. This study supports the idea of interference from L1 to L2 (nonnative) phonology.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121788810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An articulatory analysis of apical syllables in Standard Chinese 普通话尖音节的发音分析
Yu Chen, Jin Zhang, Yanting Chen, Yu Chen, Licheng Liu, Jianguo Wei, J. Dang
{"title":"An articulatory analysis of apical syllables in Standard Chinese","authors":"Yu Chen, Jin Zhang, Yanting Chen, Yu Chen, Licheng Liu, Jianguo Wei, J. Dang","doi":"10.1109/ICSDA.2015.7357877","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357877","url":null,"abstract":"In the present paper, the synchronous Ultrasonic and EMA data of two participants were collected to observe the Tongue and Jaw movements during the production of zi and zhi in Mandarin Chinese. Results of data analysis partly verified the viewpoint that the performance of articulators in producing the two apical syllables are different, which in turn suggests that the vowels of the two apical syllables are totally different in nature, with the vowel after z as the voiced extension of that alveolar apical consonant, and the vowel after the post-alveolar consonant zh being a real apical vowel in Standard Chinese.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122791174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Prosodic processing in developmental dyslexia: A case study in Standard Chinese 发展性阅读障碍的韵律加工:以普通话为例
Yi Yuan, Ai-jun Li, Yuan Jia, Jianhua Hu, Balazs Surany
{"title":"Prosodic processing in developmental dyslexia: A case study in Standard Chinese","authors":"Yi Yuan, Ai-jun Li, Yuan Jia, Jianhua Hu, Balazs Surany","doi":"10.1109/ICSDA.2015.7357866","DOIUrl":"https://doi.org/10.1109/ICSDA.2015.7357866","url":null,"abstract":"The present case study investigated the speech production and perception of developmental dyslexia through prosodic cues in Standard Chinese. As a non-alphabetic tonal language, there is a complex neurobiological mechanism between visual picture processing and semantic priming, perception and production of prosodic patterns. The experimental materials employed in the study were the syntactic structure of Double Object construction with various information structures. The results show that the dyslexic participant, a native Chinese adult, i.e. M6, had an impaired perceptual ability to discriminate boundaries and sentence accents in different information structures. While on the aspect of production, both sentence accents and boundaries produced by M6 could be perceived by other normal participants, however, every syntactic structure has a consistent pattern in all the focus conditions. The results implied that in normal participants, prosody processing (i.e., sentence accent and boundary perception) was interacted with higher-level information of language, such as syntactic and information structures, however, the dyslexic shows no such interactions.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128285740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
[Various country reports] [各国报告]
Hong Kong, Tan Lee
{"title":"[Various country reports]","authors":"Hong Kong, Tan Lee","doi":"10.1109/icsda.2015.7357895","DOIUrl":"https://doi.org/10.1109/icsda.2015.7357895","url":null,"abstract":"AnnotCltion on discourse level InionnaLion structure/Rhetoric structure an a which have Lhe [unctions including: ('3 Phonetic transcription for dialect recording (\".3 xRecorder: a corpus recording, anlysis and management tools (\".3 Word pronunciation presented in both ancient and present times (\".3 Grammar materials for 14 dialects ('3 syllable materials and their SOllllds of 70 speakers from Beijing dialect (\".3 http://9zhou.phonetics.org.cn","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123070747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信