{"title":"Grouping Intrinsic Mode Functions and Residue for Pathological Classifications via Electroglottograms","authors":"G. Liao, B.W.-K. Ling, K.-G. Pang","doi":"10.1016/j.irbm.2022.11.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><p>The electroglottogram<span> (EGG) is a signal used for measuring the change of the relative contact area in the vocal cord during the throat production. In the recent years, the low cost and the non-invasive applications have been derived. Hence, the EGG has been applied in various science, engineering and medical fields such as in the basic voice science including the phonetics, the singing and the hearing as well as in the speech and the language therapy and the related clinical works including the voice production physiology, the swallowing and the psychology. However, the pathological classifications using the EGGs usually yield the poor performances. This is because the EGGs are required to decompose into the various components for extracting the features for performing the classifications. Nevertheless, the total numbers of the components decomposed by some time frequency representation such as the empirical mode decomposition (EMD) for different EGGs are different. Hence, the dimension of the feature vectors extracted from different EGGs is different. This introduces to the difficulty for building a machine learning model for performing the classification. This paper is to address this issue.</span></p></div><div><h3>Material and methods</h3><p>This paper proposes a method for grouping the intrinsic mode functions<span><span> (IMFs) and the residue obtained by applying the EMD to the EGGs for classifying between the healthy subjects and the pathological subjects. More precisely, this paper proposes a clustering based method to group the IMFs and the residue so that the total numbers of the grouped IMFs of different EGGs are the same. First, the IMFs and the residue of the EGGs are categorized into a desired number of groups based on their correlation coefficients. Second, the IMFs or the residue in each group are summed together to obtain the grouped IMF. Third, the mean frequency and the first formant of each grouped IMF are computed. Finally, a random forest is employed for performing the classification. To our best knowledge, this joint EMD and clustering based method is firstly proposed to preform the pathological voice detection. The </span>computer numerical simulations are conducted using the online available Saarbrücken voice database.</span></p></div><div><h3>Results</h3><p>Here, five cross validations have been performed. The mean accuracy, the mean specificity and the mean sensitivity among these five validations are 86.98, 79.92 and 91.57, respectively. The standard deviation of the accuracy, the specificity and the sensitivity among these five validations are ±2.00%, ±3.71% and ±2.13%, respectively. The simulation results show that our proposed method outperforms the common EGG or speech processing based methods.</p></div><div><h3>Conclusion</h3><p><span>This paper proposes a clustering based method for grouping the IMFs and the residue for performing the pathological classifications via the EGGs. The grouping criterion is based on the correlation coefficients. It is found that our proposed method can achieve the highest classifications for the majority signal to noise ratios compared to the </span>state of the arts methods.</p></div>","PeriodicalId":14605,"journal":{"name":"Irbm","volume":null,"pages":null},"PeriodicalIF":5.6000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Irbm","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1959031822001166","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
The electroglottogram (EGG) is a signal used for measuring the change of the relative contact area in the vocal cord during the throat production. In the recent years, the low cost and the non-invasive applications have been derived. Hence, the EGG has been applied in various science, engineering and medical fields such as in the basic voice science including the phonetics, the singing and the hearing as well as in the speech and the language therapy and the related clinical works including the voice production physiology, the swallowing and the psychology. However, the pathological classifications using the EGGs usually yield the poor performances. This is because the EGGs are required to decompose into the various components for extracting the features for performing the classifications. Nevertheless, the total numbers of the components decomposed by some time frequency representation such as the empirical mode decomposition (EMD) for different EGGs are different. Hence, the dimension of the feature vectors extracted from different EGGs is different. This introduces to the difficulty for building a machine learning model for performing the classification. This paper is to address this issue.
Material and methods
This paper proposes a method for grouping the intrinsic mode functions (IMFs) and the residue obtained by applying the EMD to the EGGs for classifying between the healthy subjects and the pathological subjects. More precisely, this paper proposes a clustering based method to group the IMFs and the residue so that the total numbers of the grouped IMFs of different EGGs are the same. First, the IMFs and the residue of the EGGs are categorized into a desired number of groups based on their correlation coefficients. Second, the IMFs or the residue in each group are summed together to obtain the grouped IMF. Third, the mean frequency and the first formant of each grouped IMF are computed. Finally, a random forest is employed for performing the classification. To our best knowledge, this joint EMD and clustering based method is firstly proposed to preform the pathological voice detection. The computer numerical simulations are conducted using the online available Saarbrücken voice database.
Results
Here, five cross validations have been performed. The mean accuracy, the mean specificity and the mean sensitivity among these five validations are 86.98, 79.92 and 91.57, respectively. The standard deviation of the accuracy, the specificity and the sensitivity among these five validations are ±2.00%, ±3.71% and ±2.13%, respectively. The simulation results show that our proposed method outperforms the common EGG or speech processing based methods.
Conclusion
This paper proposes a clustering based method for grouping the IMFs and the residue for performing the pathological classifications via the EGGs. The grouping criterion is based on the correlation coefficients. It is found that our proposed method can achieve the highest classifications for the majority signal to noise ratios compared to the state of the arts methods.
期刊介绍:
IRBM is the journal of the AGBM (Alliance for engineering in Biology an Medicine / Alliance pour le génie biologique et médical) and the SFGBM (BioMedical Engineering French Society / Société française de génie biologique médical) and the AFIB (French Association of Biomedical Engineers / Association française des ingénieurs biomédicaux).
As a vehicle of information and knowledge in the field of biomedical technologies, IRBM is devoted to fundamental as well as clinical research. Biomedical engineering and use of new technologies are the cornerstones of IRBM, providing authors and users with the latest information. Its six issues per year propose reviews (state-of-the-art and current knowledge), original articles directed at fundamental research and articles focusing on biomedical engineering. All articles are submitted to peer reviewers acting as guarantors for IRBM''s scientific and medical content. The field covered by IRBM includes all the discipline of Biomedical engineering. Thereby, the type of papers published include those that cover the technological and methodological development in:
-Physiological and Biological Signal processing (EEG, MEG, ECG…)-
Medical Image processing-
Biomechanics-
Biomaterials-
Medical Physics-
Biophysics-
Physiological and Biological Sensors-
Information technologies in healthcare-
Disability research-
Computational physiology-
…