Lyndainês Santos, Nícolas de Araújo Moreira, Robson Sampaio, Raizielle Lima, Francisco Carlos Mattos Brito Oliveira
{"title":"Automatic Speech Recognition: Comparisons Between Convolutional Neural Networks, Hidden Markov Model and Hybrid Architecture","authors":"Lyndainês Santos, Nícolas de Araújo Moreira, Robson Sampaio, Raizielle Lima, Francisco Carlos Mattos Brito Oliveira","doi":"10.1111/exsy.70032","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Automatic Speech Recognition (ASR) systems have been widely used as a practical method of interaction between humans and devices. They are typically employed to enhance the accessibility of devices and to improve the security of systems, among other purposes. However, the design of speech-based systems imposes many challenges due to their particularities. Currently, the majority of ASR systems is based on the Hidden Markov Model (HMM), and, more recently, on Convolutional Neural Networks (CNN). The present research evaluates the performance of Hidden Markov Model (HMM) and Convolutional Neural Network (CNN) algorithms in speech recognition and proposes a novel hybrid approach that combines both methods. The study assesses various performance metrics, including accuracy, precision, recall, F1-score, response time, and computational cost. The experimental tests show that the integration between HMM and CNN increased the accuracy by 6% and 8% when compared to HMM and CNN isolated, respectively, in accordance with results presented in previous papers. However, the results of the ANOVA test revealed that the difference in question is not statistically significant, and the HMM-only approach still being an interesting option for embedded systems due to its lesser demanded computational effort.</p>\n </div>","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 5","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70032","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic Speech Recognition (ASR) systems have been widely used as a practical method of interaction between humans and devices. They are typically employed to enhance the accessibility of devices and to improve the security of systems, among other purposes. However, the design of speech-based systems imposes many challenges due to their particularities. Currently, the majority of ASR systems is based on the Hidden Markov Model (HMM), and, more recently, on Convolutional Neural Networks (CNN). The present research evaluates the performance of Hidden Markov Model (HMM) and Convolutional Neural Network (CNN) algorithms in speech recognition and proposes a novel hybrid approach that combines both methods. The study assesses various performance metrics, including accuracy, precision, recall, F1-score, response time, and computational cost. The experimental tests show that the integration between HMM and CNN increased the accuracy by 6% and 8% when compared to HMM and CNN isolated, respectively, in accordance with results presented in previous papers. However, the results of the ANOVA test revealed that the difference in question is not statistically significant, and the HMM-only approach still being an interesting option for embedded systems due to its lesser demanded computational effort.
期刊介绍:
Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper.
As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.