{"title":"Recognition Using Classification and Segmentation Scoring","authors":"O. Kimball, Mari Ostendorf, J. R. Rohlicek","doi":"10.3115/1075527.1075570","DOIUrl":"https://doi.org/10.3115/1075527.1075570","url":null,"abstract":"Traditional statistical speech recognition systems typically make strong assumptions about the independence of observation frames and generally do not make use of segmental information. In contrast, when the segmentation is known, existing classifiers can readily accommodate segmental information in the decision process. We describe an approach to connected word recognition that allows the use of segmental information through an explicit decomposition of the recognition criterion into classification and segmentation scoring. Preliminary experiments are presented, demonstrating that the proposed framework, using fixed length sequences of cepstral feature vectors for classification of individual phonemes, performs comparably to more traditional recognition approaches that use the entire observation sequence. We expect that performance gain can be obtained using this structure with additional, more general features.","PeriodicalId":215441,"journal":{"name":"Proceedings of the workshop on Speech and Natural Language - HLT '91","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129388103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Human-Machine Problem Solving Using Spoken Language Systems (SLS): Factors Affecting Performance and User Satisfaction","authors":"Elizabeth Shriberg, Elizabeth Wade, P. Price","doi":"10.3115/1075527.1075538","DOIUrl":"https://doi.org/10.3115/1075527.1075538","url":null,"abstract":"We have analyzed three factors affecting user satisfaction and system performance using an SLS implemented in the ATIS domain. We have found that: (1) trade-offs between speed and accuracy have different implications for user satisfaction; (2) recognition performance improves over time, at least in part because of a reduction in sentence perplexity; and (3) hyperarticulation increases recognition errors, and while instructions can reduce this behavior, they do not result in improved recognition performance. We conclude that while users may adapt to some aspects of an SLS, certain types of user behavior may require technological solutions.","PeriodicalId":215441,"journal":{"name":"Proceedings of the workshop on Speech and Natural Language - HLT '91","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124671868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Seneff, James R. Glass, D. Goddeau, David Goodine, L. Hirschman, H. Leung, M. S. Phillips, J. Polifroni, V. Zue
{"title":"Development and Preliminary Evaluation of the MIT ATIS System","authors":"S. Seneff, James R. Glass, D. Goddeau, David Goodine, L. Hirschman, H. Leung, M. S. Phillips, J. Polifroni, V. Zue","doi":"10.3115/112405.112417","DOIUrl":"https://doi.org/10.3115/112405.112417","url":null,"abstract":"This paper represents a status report on the MIT ATIS system. The most significant new achievement is that we now have a speech-input mode. It is based on the MIT SUMMIT system using context independent phone models, and includes a word-pair grammar with perplexity 92 (on the June-90 test set). In addition, we have completely redesigned the back-end component, in order to emphasize portability and extensibility. The parser now produces an intermediate semantic frame representation, which serves as the focal point for all back-end operations, such as history management, text generation, and SQL query generation. Most of those aspects of the system that are tied to a particular domain are now entered through a set of tables associated with a small artificial language for decoding them. We have also improved the display of the database table, making it considerably easier for a subject to comprehend the information given. We report here on the results of the official DARPA February-91 evaluation, as well as on results of an evaluation on data collected at MIT, for both speech input and text input.","PeriodicalId":215441,"journal":{"name":"Proceedings of the workshop on Speech and Natural Language - HLT '91","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115661626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Evaluating the Use of Prosodic Information in Speech Recognition and Understanding","authors":"Mari Ostendorf, P. Price, S. S. Hufnagel","doi":"10.3115/112405.1138645","DOIUrl":"https://doi.org/10.3115/112405.1138645","url":null,"abstract":"","PeriodicalId":215441,"journal":{"name":"Proceedings of the workshop on Speech and Natural Language - HLT '91","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116356009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}