Naouar Ouattassi, Mustapha Maaroufi, Hajar Slaoui, Taha Benateya Andaloussi, Arsalane Zarghili, Mohamed Nouredine El Amine El Alami
{"title":"Middle ear-acquired cholesteatoma diagnosis based on CT scan image mining using supervised machine learning models","authors":"Naouar Ouattassi, Mustapha Maaroufi, Hajar Slaoui, Taha Benateya Andaloussi, Arsalane Zarghili, Mohamed Nouredine El Amine El Alami","doi":"10.1186/s43088-024-00534-5","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Distinguishing between middle ear cholesteatoma and chronic suppurative otitis media (CSOM) is an ongoing challenge. While temporal bone computed tomography (CT) scan is highly accurate for diagnosing middle ear conditions, its specificity in discerning between cholesteatoma and CSOM is only moderate. To address this issue, we utilized trained machine learning models to enhance the specificity of temporal bone CT scan in diagnosing middle ear cholesteatoma. Our database consisted of temporal bone CT scan native images from 122 patients diagnosed with middle ear cholesteatoma and a control group of 115 patients diagnosed with CSOM, with both groups labeled based on surgical findings. We preprocessed the native images to isolate the region of interest and then utilized the Inception V3 convolutional neural network for image embedding into data vectors. Classification was performed using machine learning models including support vector machine (SVM), k-nearest neighbors (k-NN), random forest, and neural network. Statistical metrics employed to interpret the results included classification accuracy, precision, recall, F1 score, confusion matrix, area under the receiver operating characteristic curve (AUC), and FreeViz diagram.</p><h3>Results</h3><p>Our training dataset comprised 5390 images, and the testing dataset included 125 different images. The neural network, k-NN, and SVM models demonstrated significantly higher relevance in terms of classification accuracy, precision, and recall compared to the random forest model. For instance, the F1 scores were 0.974, 0.987, and 0.897, respectively, for the former three models, in contrast to 0.661 for the random forest model.</p><h3>Conclusion</h3><p>The performance metrics of the presented trained machine learning models hold promising prospects as potentially clinically useful aids.</p></div>","PeriodicalId":481,"journal":{"name":"Beni-Suef University Journal of Basic and Applied Sciences","volume":"13 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bjbas.springeropen.com/counter/pdf/10.1186/s43088-024-00534-5","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Beni-Suef University Journal of Basic and Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/s43088-024-00534-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Distinguishing between middle ear cholesteatoma and chronic suppurative otitis media (CSOM) is an ongoing challenge. While temporal bone computed tomography (CT) scan is highly accurate for diagnosing middle ear conditions, its specificity in discerning between cholesteatoma and CSOM is only moderate. To address this issue, we utilized trained machine learning models to enhance the specificity of temporal bone CT scan in diagnosing middle ear cholesteatoma. Our database consisted of temporal bone CT scan native images from 122 patients diagnosed with middle ear cholesteatoma and a control group of 115 patients diagnosed with CSOM, with both groups labeled based on surgical findings. We preprocessed the native images to isolate the region of interest and then utilized the Inception V3 convolutional neural network for image embedding into data vectors. Classification was performed using machine learning models including support vector machine (SVM), k-nearest neighbors (k-NN), random forest, and neural network. Statistical metrics employed to interpret the results included classification accuracy, precision, recall, F1 score, confusion matrix, area under the receiver operating characteristic curve (AUC), and FreeViz diagram.
Results
Our training dataset comprised 5390 images, and the testing dataset included 125 different images. The neural network, k-NN, and SVM models demonstrated significantly higher relevance in terms of classification accuracy, precision, and recall compared to the random forest model. For instance, the F1 scores were 0.974, 0.987, and 0.897, respectively, for the former three models, in contrast to 0.661 for the random forest model.
Conclusion
The performance metrics of the presented trained machine learning models hold promising prospects as potentially clinically useful aids.
期刊介绍:
Beni-Suef University Journal of Basic and Applied Sciences (BJBAS) is a peer-reviewed, open-access journal. This journal welcomes submissions of original research, literature reviews, and editorials in its respected fields of fundamental science, applied science (with a particular focus on the fields of applied nanotechnology and biotechnology), medical sciences, pharmaceutical sciences, and engineering. The multidisciplinary aspects of the journal encourage global collaboration between researchers in multiple fields and provide cross-disciplinary dissemination of findings.