M. O'Kane, Judy Gillis, Philip Rose, Michael Wagner
{"title":"Deciphering speech waveforms","authors":"M. O'Kane, Judy Gillis, Philip Rose, Michael Wagner","doi":"10.1109/ICASSP.1986.1168540","DOIUrl":null,"url":null,"abstract":"Many phoneticians are remarkably expert at 'reading' speech waveforms. This paper describes an attempt to capture this knowledge for use as a segmentation and early labelling knowledge source for a continuous speech recognition system. As well as deriving information from the waveform directly, the decisions made by the waveform deciphering knowledge source are based on a related series of functions derived from the waveform. These functions, which relate to both valley-to-peak and zero crossing measures, are computationally very efficient and it would seem that the frequency analogues of these functions could provide an alternative means of deriving a certain amount of the spectral information more usually obtained through spectrograms.","PeriodicalId":242072,"journal":{"name":"ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1986-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1986.1168540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Many phoneticians are remarkably expert at 'reading' speech waveforms. This paper describes an attempt to capture this knowledge for use as a segmentation and early labelling knowledge source for a continuous speech recognition system. As well as deriving information from the waveform directly, the decisions made by the waveform deciphering knowledge source are based on a related series of functions derived from the waveform. These functions, which relate to both valley-to-peak and zero crossing measures, are computationally very efficient and it would seem that the frequency analogues of these functions could provide an alternative means of deriving a certain amount of the spectral information more usually obtained through spectrograms.