{"title":"Entropy-Based Analysis of the Prosodic Features of Chinese Dialects","authors":"Raymond W. M. Ng, Tan Lee","doi":"10.1109/CHINSL.2008.ECP.28","DOIUrl":null,"url":null,"abstract":"In this paper, a novel approach is proposed to analyze prosodic features of four Chinese dialects: Wu, Cantonese, Min and Mandarin. The ultimate goal is to exploit these features in the task of automatic spoken language identification. Two entropy-based evaluation metrics are formulated to address the problems of data sparseness and lack of speakers. Different prosody-related acoustic features and their combinations are evaluated. FO, FO gradient and intensity are found to contain the most language-related information. Maximum language-related information are observed in multi-dimensional N-gram features with FO, FO gradient and syllable position in sentence. There are also some uncertain results that reveal the limitations of the proposed metrics.","PeriodicalId":291958,"journal":{"name":"2008 6th International Symposium on Chinese Spoken Language Processing","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 6th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2008.ECP.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper, a novel approach is proposed to analyze prosodic features of four Chinese dialects: Wu, Cantonese, Min and Mandarin. The ultimate goal is to exploit these features in the task of automatic spoken language identification. Two entropy-based evaluation metrics are formulated to address the problems of data sparseness and lack of speakers. Different prosody-related acoustic features and their combinations are evaluated. FO, FO gradient and intensity are found to contain the most language-related information. Maximum language-related information are observed in multi-dimensional N-gram features with FO, FO gradient and syllable position in sentence. There are also some uncertain results that reveal the limitations of the proposed metrics.