{"title":"基于DNN和迁移学习ResNet50的基于声音生物标志物的COVID-19检测","authors":"Aditya Raj, Ramesh K. Bhukya","doi":"10.1109/UPCON56432.2022.9986454","DOIUrl":null,"url":null,"abstract":"The aim of this study is to automate the detection of COVID-19 patients by analysing the acoustic information embedded in cough samples. COVID-19 is a respiratory disease having cough acoustics as a common symptom and indicator. The primary focus is classification of generated deep features from analytical and mathematical representation of cough acoustics using signal processing techniques Mel-frequency cepstral coefficients (MFCCs) and Mel-spectrogram. MFCCs provides feature vector representation of cough signal and is used as an input for deep neural network (DNN) to generate deep features. Transfer Learning ResNet-50 based Convolutional Neural Network (CNN) model is used to generate deep features from image representation of cough in the form of Mel Spectrogram. Dataset labelling is done with two categories of COVID-19 and Non-COVID-19 classes. Among them, we have used 70% of the dataset for training and 30% for testing purposes. The deep features generated from MFCCs and Mel Spectrograms are concatenated along with a feature value output from a DNN having Metadata as input. The final concatenated feature vector is sent for Softmax based classification. By completing the whole process, we obtained the training AUC (Area Under Curve) (ROC) 95.39%, validation AUC as 88.19% and testing AUC as 88.76%. The analysis of final AUC with epoch curve shows constant increase in training AUC and convergence of validation and testing AUC at certain value representing model training as perfectly fit and no overfitting-underfitting problem.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Vocal Biomarker Based COVID-19 Detection Using DNN and Transfer Learning ResNet50\",\"authors\":\"Aditya Raj, Ramesh K. Bhukya\",\"doi\":\"10.1109/UPCON56432.2022.9986454\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this study is to automate the detection of COVID-19 patients by analysing the acoustic information embedded in cough samples. COVID-19 is a respiratory disease having cough acoustics as a common symptom and indicator. The primary focus is classification of generated deep features from analytical and mathematical representation of cough acoustics using signal processing techniques Mel-frequency cepstral coefficients (MFCCs) and Mel-spectrogram. MFCCs provides feature vector representation of cough signal and is used as an input for deep neural network (DNN) to generate deep features. Transfer Learning ResNet-50 based Convolutional Neural Network (CNN) model is used to generate deep features from image representation of cough in the form of Mel Spectrogram. Dataset labelling is done with two categories of COVID-19 and Non-COVID-19 classes. Among them, we have used 70% of the dataset for training and 30% for testing purposes. The deep features generated from MFCCs and Mel Spectrograms are concatenated along with a feature value output from a DNN having Metadata as input. The final concatenated feature vector is sent for Softmax based classification. By completing the whole process, we obtained the training AUC (Area Under Curve) (ROC) 95.39%, validation AUC as 88.19% and testing AUC as 88.76%. The analysis of final AUC with epoch curve shows constant increase in training AUC and convergence of validation and testing AUC at certain value representing model training as perfectly fit and no overfitting-underfitting problem.\",\"PeriodicalId\":185782,\"journal\":{\"name\":\"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UPCON56432.2022.9986454\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON56432.2022.9986454","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Vocal Biomarker Based COVID-19 Detection Using DNN and Transfer Learning ResNet50
The aim of this study is to automate the detection of COVID-19 patients by analysing the acoustic information embedded in cough samples. COVID-19 is a respiratory disease having cough acoustics as a common symptom and indicator. The primary focus is classification of generated deep features from analytical and mathematical representation of cough acoustics using signal processing techniques Mel-frequency cepstral coefficients (MFCCs) and Mel-spectrogram. MFCCs provides feature vector representation of cough signal and is used as an input for deep neural network (DNN) to generate deep features. Transfer Learning ResNet-50 based Convolutional Neural Network (CNN) model is used to generate deep features from image representation of cough in the form of Mel Spectrogram. Dataset labelling is done with two categories of COVID-19 and Non-COVID-19 classes. Among them, we have used 70% of the dataset for training and 30% for testing purposes. The deep features generated from MFCCs and Mel Spectrograms are concatenated along with a feature value output from a DNN having Metadata as input. The final concatenated feature vector is sent for Softmax based classification. By completing the whole process, we obtained the training AUC (Area Under Curve) (ROC) 95.39%, validation AUC as 88.19% and testing AUC as 88.76%. The analysis of final AUC with epoch curve shows constant increase in training AUC and convergence of validation and testing AUC at certain value representing model training as perfectly fit and no overfitting-underfitting problem.