{"title":"A novel method for blind segmentation of Thai continuous speech","authors":"S. Potisuk","doi":"10.1109/DSP-SPE.2015.7369590","DOIUrl":null,"url":null,"abstract":"This paper describes an acoustical investigation on Thai speech segmentation using a combination of average level crossing rate (ALCR) and root-mean-square (RMS) energy. Simple and easy to compute, ALCR information alone was successfully used in an automatic speech segmentation system for English. However, ALCR has never been applied to Thai. As a result, the objective of the study is to apply ALCR information to ascertain its usefulness in detecting significant temporal changes in continuous Thai Speech. An experiment was conducted on a small speech corpus containing 21 sentences. Preliminary results suggest that ALCR and RMS energy can be used to detect the phonetic boundary between obstruent initial consonant and preceding/following vowel. In addition, it can also be used to detect boundary between final consonant of the preceding syllable and initial consonant of the following syllable except for the case involving two successive non-identical nasals. The overall accuracy is around 81% for data from four speakers.","PeriodicalId":91992,"journal":{"name":"2015 IEEE Signal Processing and Signal Processing Education Workshop (SP/SPE)","volume":"104 1","pages":"415-420"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE Signal Processing and Signal Processing Education Workshop (SP/SPE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSP-SPE.2015.7369590","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper describes an acoustical investigation on Thai speech segmentation using a combination of average level crossing rate (ALCR) and root-mean-square (RMS) energy. Simple and easy to compute, ALCR information alone was successfully used in an automatic speech segmentation system for English. However, ALCR has never been applied to Thai. As a result, the objective of the study is to apply ALCR information to ascertain its usefulness in detecting significant temporal changes in continuous Thai Speech. An experiment was conducted on a small speech corpus containing 21 sentences. Preliminary results suggest that ALCR and RMS energy can be used to detect the phonetic boundary between obstruent initial consonant and preceding/following vowel. In addition, it can also be used to detect boundary between final consonant of the preceding syllable and initial consonant of the following syllable except for the case involving two successive non-identical nasals. The overall accuracy is around 81% for data from four speakers.