{"title":"TFA-CLSTMNN:基于声音诊断的新型卷积网络","authors":"Yuhao He, Xianwei Zheng, Qing Miao","doi":"10.1142/s0219691322500588","DOIUrl":null,"url":null,"abstract":"The outbreak of the global COVID-19 pandemic has become a public crisis and is threatening human life in every country. Recently, researchers have developed testing methods via patients cough recordings. In order to improve the testing accuracy, in this paper, we establish a novel COVID-19 sound-based diagnosis framework, i.e. TFA-CLSTMNN, which integrates time-frequency domain features of the recorded cough with the Attention-Convolution Long Short-Term Memory Neural Network. Specifically, we calculate the Mel-frequency cepstrum coefficient (MFCC) of the cough data to extract the time-frequency domain features. We then apply the convolutional neural network and the attentional mechanism on the time-frequency features, which is followed by the long short-term memory neural network to analyze the MFCC features of the data. The recognition and classification can be then carried out to evaluate the positiveness or negativeness of the tested samples. Experimental results show that the proposed TFA-CLSTMNN framework outperforms the baseline neural networks in sound-based COVID-19 diagnosis and derives an accuracy over 0.95 on the public real-world datasets.","PeriodicalId":158567,"journal":{"name":"Int. J. Wavelets Multiresolution Inf. Process.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"TFA-CLSTMNN: Novel convolutional network for sound-based diagnosis of COVID-19\",\"authors\":\"Yuhao He, Xianwei Zheng, Qing Miao\",\"doi\":\"10.1142/s0219691322500588\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The outbreak of the global COVID-19 pandemic has become a public crisis and is threatening human life in every country. Recently, researchers have developed testing methods via patients cough recordings. In order to improve the testing accuracy, in this paper, we establish a novel COVID-19 sound-based diagnosis framework, i.e. TFA-CLSTMNN, which integrates time-frequency domain features of the recorded cough with the Attention-Convolution Long Short-Term Memory Neural Network. Specifically, we calculate the Mel-frequency cepstrum coefficient (MFCC) of the cough data to extract the time-frequency domain features. We then apply the convolutional neural network and the attentional mechanism on the time-frequency features, which is followed by the long short-term memory neural network to analyze the MFCC features of the data. The recognition and classification can be then carried out to evaluate the positiveness or negativeness of the tested samples. Experimental results show that the proposed TFA-CLSTMNN framework outperforms the baseline neural networks in sound-based COVID-19 diagnosis and derives an accuracy over 0.95 on the public real-world datasets.\",\"PeriodicalId\":158567,\"journal\":{\"name\":\"Int. J. Wavelets Multiresolution Inf. Process.\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Wavelets Multiresolution Inf. Process.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s0219691322500588\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Wavelets Multiresolution Inf. Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219691322500588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
TFA-CLSTMNN: Novel convolutional network for sound-based diagnosis of COVID-19
The outbreak of the global COVID-19 pandemic has become a public crisis and is threatening human life in every country. Recently, researchers have developed testing methods via patients cough recordings. In order to improve the testing accuracy, in this paper, we establish a novel COVID-19 sound-based diagnosis framework, i.e. TFA-CLSTMNN, which integrates time-frequency domain features of the recorded cough with the Attention-Convolution Long Short-Term Memory Neural Network. Specifically, we calculate the Mel-frequency cepstrum coefficient (MFCC) of the cough data to extract the time-frequency domain features. We then apply the convolutional neural network and the attentional mechanism on the time-frequency features, which is followed by the long short-term memory neural network to analyze the MFCC features of the data. The recognition and classification can be then carried out to evaluate the positiveness or negativeness of the tested samples. Experimental results show that the proposed TFA-CLSTMNN framework outperforms the baseline neural networks in sound-based COVID-19 diagnosis and derives an accuracy over 0.95 on the public real-world datasets.