Identifying Laryngeal Neoplasms in Laryngoscope Images via Deep Learning Based Object Detection: A Case Study on an Extremely Small Data Set

IF 5.6 4区医学 Q1 ENGINEERING, BIOMEDICAL

Irbm Pub Date : 2023-09-07 DOI:10.1016/j.irbm.2023.100799

Shijie Fang , Jia Fu , Chen Du , Tong Lin , Yan Yan

{"title":"Identifying Laryngeal Neoplasms in Laryngoscope Images via Deep Learning Based Object Detection: A Case Study on an Extremely Small Data Set","authors":"Shijie Fang , Jia Fu , Chen Du , Tong Lin , Yan Yan","doi":"10.1016/j.irbm.2023.100799","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><p><span><span>Laryngoscopy is a medical procedure for obtaining a view of the human </span>larynx. It is challenging for clinicians to distinguish </span>laryngeal neoplasms<span> by human visual observation. Recent deep learning methods can assist clinicians in improving the accuracy of distinguishing. However, existed methods are often trained on large-scale private datasets, while other researchers and hospitals can neither access these private datasets nor afford to build such large-scale datasets. In this paper, we focus on identifying laryngeal neoplasms under the “small data” regime, which is more important for many small hospitals to investigate deep learning models for diagnosis.</span></p></div><div><h3>Material and methods</h3><p>We build an extremely small dataset consisting of 279 laryngoscopic images of different categories. We found that traditional deep learning models for image classification<span> cannot achieve satisfactory performance for small data, due to the great variability of recording laryngoscopic images and the small area of the neoplasms. To address these difficulties, we propose to employ object detection methods for this small data problem. Concretely, a Faster R-CNN is implemented here, which combines the DropBlock regularization technique to alleviate overfitting additionally.</span></p></div><div><h3>Results</h3><p>Compared to previous methods, our model is more robust to overfitting and can predict the location and category of detected neoplasms simultaneously. Our method achieves 73.00% overall accuracy, which is higher than the average of clinicians (65.05%) and the recent state-of-the-art classification method (65.00%).</p></div><div><h3>Conclusion</h3><p>The proposed method shows great ability to detect both the category and location of neoplasms and can be served as a screening tool to help the final decisions of clinicians.</p></div>","PeriodicalId":14605,"journal":{"name":"Irbm","volume":"44 6","pages":"Article 100799"},"PeriodicalIF":5.6000,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Irbm","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1959031823000489","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives

Laryngoscopy is a medical procedure for obtaining a view of the human larynx. It is challenging for clinicians to distinguish laryngeal neoplasms by human visual observation. Recent deep learning methods can assist clinicians in improving the accuracy of distinguishing. However, existed methods are often trained on large-scale private datasets, while other researchers and hospitals can neither access these private datasets nor afford to build such large-scale datasets. In this paper, we focus on identifying laryngeal neoplasms under the “small data” regime, which is more important for many small hospitals to investigate deep learning models for diagnosis.

Material and methods

We build an extremely small dataset consisting of 279 laryngoscopic images of different categories. We found that traditional deep learning models for image classification cannot achieve satisfactory performance for small data, due to the great variability of recording laryngoscopic images and the small area of the neoplasms. To address these difficulties, we propose to employ object detection methods for this small data problem. Concretely, a Faster R-CNN is implemented here, which combines the DropBlock regularization technique to alleviate overfitting additionally.

Results

Compared to previous methods, our model is more robust to overfitting and can predict the location and category of detected neoplasms simultaneously. Our method achieves 73.00% overall accuracy, which is higher than the average of clinicians (65.05%) and the recent state-of-the-art classification method (65.00%).

Conclusion

The proposed method shows great ability to detect both the category and location of neoplasms and can be served as a screening tool to help the final decisions of clinicians.

Abstract Image

查看原文本刊更多论文

基于深度学习的目标检测在喉镜图像中识别喉肿瘤:一个极小数据集的案例研究

目的喉镜检查是一种获取人类喉部视野的医学程序。对于临床医生来说，通过人类视觉观察来区分喉部肿瘤是一项具有挑战性的工作。最近的深度学习方法可以帮助临床医生提高区分的准确性。然而，现有的方法通常是在大规模的私人数据集上进行训练的，而其他研究人员和医院既无法访问这些私人数据集，也无力构建这样的大规模数据集。在本文中，我们专注于在“小数据”机制下识别喉部肿瘤，这对许多小型医院研究诊断的深度学习模型更为重要。材料和方法我们建立了一个极小的数据集，由279张不同类别的喉镜图像组成。我们发现，由于记录喉镜图像的可变性大，肿瘤面积小，传统的图像分类深度学习模型无法在小数据下实现令人满意的性能。为了解决这些困难，我们建议对这个小数据问题使用对象检测方法。具体来说，这里实现了一个更快的R-CNN，它结合了DropBlock正则化技术来额外缓解过拟合。结果与以往的方法相比，我们的模型对过拟合更具鲁棒性，可以同时预测检测到的肿瘤的位置和类别。我们的方法总体准确率达到73.00%，高于临床医生的平均水平（65.05%）和最新的分类方法（65.00%）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Irbm ENGINEERING, BIOMEDICAL-

CiteScore

10.30

自引率

4.20%

发文量

审稿时长

57 days

期刊介绍： IRBM is the journal of the AGBM (Alliance for engineering in Biology an Medicine / Alliance pour le génie biologique et médical) and the SFGBM (BioMedical Engineering French Society / Société française de génie biologique médical) and the AFIB (French Association of Biomedical Engineers / Association française des ingénieurs biomédicaux). As a vehicle of information and knowledge in the field of biomedical technologies, IRBM is devoted to fundamental as well as clinical research. Biomedical engineering and use of new technologies are the cornerstones of IRBM, providing authors and users with the latest information. Its six issues per year propose reviews (state-of-the-art and current knowledge), original articles directed at fundamental research and articles focusing on biomedical engineering. All articles are submitted to peer reviewers acting as guarantors for IRBM''s scientific and medical content. The field covered by IRBM includes all the discipline of Biomedical engineering. Thereby, the type of papers published include those that cover the technological and methodological development in: -Physiological and Biological Signal processing (EEG, MEG, ECG…)- Medical Image processing- Biomechanics- Biomaterials- Medical Physics- Biophysics- Physiological and Biological Sensors- Information technologies in healthcare- Disability research- Computational physiology- …