{"title":"SLHAR: A supervised learning approach for homophone ambiguity reduction from speech recognition system","authors":"P. Ghosh, T. S. Chingtham, M. Ghose","doi":"10.1109/ICRCICN.2016.7813543","DOIUrl":null,"url":null,"abstract":"An automatic Speech to Text (STT) conversion technology has been developed for making a visual text layout of the Speech Input for advancement of Science and Technology. This technology enables people an alternative way to understand voice communication, and pursue instruction using their visual ability. The visual ability becomes more powerful than the listening ability some time more than even in remote communication, and STT conversion plays a role as an important tool in such cases. The system faces many kind of ambiguity during STT. The research focuses on the Homophone Ambiguity and with the help of the Classified Supervised learning it tries to improve it partially. In the proposed Supervised learning based Homophone Ambiguity Reduction (SLHAR), a large dataset are taken as homophones and Homophone Sets are assembled by Hierarchical Clustering Method. The proposed system communicates with the user in case of Homophones and converts them to the text format.","PeriodicalId":254393,"journal":{"name":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2016.7813543","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
An automatic Speech to Text (STT) conversion technology has been developed for making a visual text layout of the Speech Input for advancement of Science and Technology. This technology enables people an alternative way to understand voice communication, and pursue instruction using their visual ability. The visual ability becomes more powerful than the listening ability some time more than even in remote communication, and STT conversion plays a role as an important tool in such cases. The system faces many kind of ambiguity during STT. The research focuses on the Homophone Ambiguity and with the help of the Classified Supervised learning it tries to improve it partially. In the proposed Supervised learning based Homophone Ambiguity Reduction (SLHAR), a large dataset are taken as homophones and Homophone Sets are assembled by Hierarchical Clustering Method. The proposed system communicates with the user in case of Homophones and converts them to the text format.