Taibur Rahman, Lipi B. Mahanta, Anup Kumar Das, Gazi Naseem Ahmed
{"title":"An Enhanced U-Net Model for Precise Oral Epithelial Layer Segmentation Using Patch-Based Training","authors":"Taibur Rahman, Lipi B. Mahanta, Anup Kumar Das, Gazi Naseem Ahmed","doi":"10.1002/ima.70136","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The oral epithelial layer is crucial for detecting oral dysplasia and cancer from histopathology images. Accurate segmentation of the oral epithelial layer in biopsy slide images is essential for early detection and effective treatment planning of conditions like Oral Epithelial Dysplasia, where abnormal changes increase the risk of oral cancer. This study investigates using a Deep Learning model to precisely identify and segment areas of the Oral Epithelial Layer in biopsy images of the oral cavity, aiming to enhance early diagnosis and treatment strategies. The study is conducted with an indigenously collected and benchmarked dataset of 300 histopathology images of the oral cavity, representing 64 patients. We propose a Deep Learning-based modified U-Net model for segmenting oral cavity histopathology images. Various patch sizes and batch size combinations were tested and implemented for comparison. The performance of the optimal patch and batch size combination is further compared with relevant state-of-the-art models. The modified U-Net model utilizing the patch generation technique demonstrated superior performance in oral cavity epithelium segmentation, achieving an IoU of 98.06, precision of 99.66, recall of 99.13, and F1-score of 99.00. Our research underscores the efficacy of deep learning-based segmentation with the patch generation technique in improving oral health diagnostics, outperforming several state-of-the-art models in segmenting the epithelial layer. This research enhances segmentation, a key step in Computer-Aided Diagnosis systems, ensuring accurate analysis, efficient processing, and reliable medical image interpretation for improved patient outcomes.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 4","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70136","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The oral epithelial layer is crucial for detecting oral dysplasia and cancer from histopathology images. Accurate segmentation of the oral epithelial layer in biopsy slide images is essential for early detection and effective treatment planning of conditions like Oral Epithelial Dysplasia, where abnormal changes increase the risk of oral cancer. This study investigates using a Deep Learning model to precisely identify and segment areas of the Oral Epithelial Layer in biopsy images of the oral cavity, aiming to enhance early diagnosis and treatment strategies. The study is conducted with an indigenously collected and benchmarked dataset of 300 histopathology images of the oral cavity, representing 64 patients. We propose a Deep Learning-based modified U-Net model for segmenting oral cavity histopathology images. Various patch sizes and batch size combinations were tested and implemented for comparison. The performance of the optimal patch and batch size combination is further compared with relevant state-of-the-art models. The modified U-Net model utilizing the patch generation technique demonstrated superior performance in oral cavity epithelium segmentation, achieving an IoU of 98.06, precision of 99.66, recall of 99.13, and F1-score of 99.00. Our research underscores the efficacy of deep learning-based segmentation with the patch generation technique in improving oral health diagnostics, outperforming several state-of-the-art models in segmenting the epithelial layer. This research enhances segmentation, a key step in Computer-Aided Diagnosis systems, ensuring accurate analysis, efficient processing, and reliable medical image interpretation for improved patient outcomes.
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.