{"title":"基于估计-最大化算法的随机分段建模(语音识别)","authors":"Salim Roukos, Mari Ostendorf, H. Gish, A. Derr","doi":"10.1109/ICASSP.1988.196528","DOIUrl":null,"url":null,"abstract":"A probabilistic model called the stochastic segment model is introduced that describes the statistical dependence of all the frames of a speech segment. The model uses a time-warping transformation to map the sequence of observed frames to the appropriate frames of the segment model. The joint density of the observed frames is then given by the joint density of the selected model frames. The automatic training and recognition algorithms are discussed and a few preliminary recognition results are presented.<<ETX>>","PeriodicalId":448544,"journal":{"name":"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1988-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Stochastic segment modelling using the estimate-maximize algorithm (speech recognition)\",\"authors\":\"Salim Roukos, Mari Ostendorf, H. Gish, A. Derr\",\"doi\":\"10.1109/ICASSP.1988.196528\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A probabilistic model called the stochastic segment model is introduced that describes the statistical dependence of all the frames of a speech segment. The model uses a time-warping transformation to map the sequence of observed frames to the appropriate frames of the segment model. The joint density of the observed frames is then given by the joint density of the selected model frames. The automatic training and recognition algorithms are discussed and a few preliminary recognition results are presented.<<ETX>>\",\"PeriodicalId\":448544,\"journal\":{\"name\":\"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1988-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1988.196528\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1988.196528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Stochastic segment modelling using the estimate-maximize algorithm (speech recognition)
A probabilistic model called the stochastic segment model is introduced that describes the statistical dependence of all the frames of a speech segment. The model uses a time-warping transformation to map the sequence of observed frames to the appropriate frames of the segment model. The joint density of the observed frames is then given by the joint density of the selected model frames. The automatic training and recognition algorithms are discussed and a few preliminary recognition results are presented.<>