{"title":"On finding word-level break-type formation rules for mandarin read speech","authors":"Fu-Ja Kung, Pauline Lee, Yih-Ru Wang, Sin-Horng Chen, Chen-Yu Chiang","doi":"10.1109/ICSDA.2015.7357864","DOIUrl":null,"url":null,"abstract":"This paper presents a study on exploring word-level break-type formation rules for Mandarin read speech. A 4-layer hierarchical structure with seven break types is adopted to represent the prosody of utterance. The work is based on the break-type tags labeled on a large read-speech database by the prosody labeling and modeling algorithm (PLM) proposed previously. Occurrence frequencies of seven break types for pre- and post-boundaries of several types of function words are calculated and taken as the inferred statistical break-type formation rules. Linguistic interpretations of the most likely break types occurred at pre- and post-boundaries of each function word are discussed. Some exceptions that deviate from the most likely break types are also examined.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2015.7357864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a study on exploring word-level break-type formation rules for Mandarin read speech. A 4-layer hierarchical structure with seven break types is adopted to represent the prosody of utterance. The work is based on the break-type tags labeled on a large read-speech database by the prosody labeling and modeling algorithm (PLM) proposed previously. Occurrence frequencies of seven break types for pre- and post-boundaries of several types of function words are calculated and taken as the inferred statistical break-type formation rules. Linguistic interpretations of the most likely break types occurred at pre- and post-boundaries of each function word are discussed. Some exceptions that deviate from the most likely break types are also examined.