推进青光眼诊断：采用置信度校准标签平滑损失进行模型校准

IF 3.2 Q1 OPHTHALMOLOGY

Ophthalmology science Pub Date : 2024-06-22 DOI:10.1016/j.xops.2024.100555

Midhula Vijayan PhD, Deepthi Keshav Prasad PhD, Venkatakrishnan Srinivasan MTech

{"title":"推进青光眼诊断：采用置信度校准标签平滑损失进行模型校准","authors":"Midhula Vijayan PhD, Deepthi Keshav Prasad PhD, Venkatakrishnan Srinivasan MTech","doi":"10.1016/j.xops.2024.100555","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>The aim of our research is to enhance the calibration of machine learning models for glaucoma classification through a specialized loss function named Confidence-Calibrated Label Smoothing (CC-LS) loss. This approach is specifically designed to refine model calibration without compromising accuracy by integrating label smoothing and confidence penalty techniques, tailored to the specifics of glaucoma detection.</p></div><div><h3>Design</h3><p>This study focuses on the development and evaluation of a calibrated deep learning model.</p></div><div><h3>Participants</h3><p>The study employs fundus images from both external datasets—the Online Retinal Fundus Image Database for Glaucoma Analysis and Research (482 normal, 168 glaucoma) and the Retinal Fundus Glaucoma Challenge (720 normal, 80 glaucoma)—and an extensive internal dataset (4639 images per category), aiming to bolster the model's generalizability. The model's clinical performance is validated using a comprehensive test set (47 913 normal, 1629 glaucoma) from the internal dataset.</p></div><div><h3>Methods</h3><p>The CC-LS loss function seamlessly integrates label smoothing, which tempers extreme predictions to avoid overfitting, with confidence-based penalties. These penalties deter the model from expressing undue confidence in incorrect classifications. Our study aims at training models using the CC-LS and comparing their performance with those trained using conventional loss functions.</p></div><div><h3>Main Outcome Measures</h3><p>The model's precision is evaluated using metrics like the Brier score, sensitivity, specificity, and the false positive rate, alongside qualitative heatmap analyses for a holistic accuracy assessment.</p></div><div><h3>Results</h3><p>Preliminary findings reveal that models employing the CC-LS mechanism exhibit superior calibration metrics, as evidenced by a Brier score of 0.098, along with notable accuracy measures: sensitivity of 81%, specificity of 80%, and weighted accuracy of 80%. Importantly, these enhancements in calibration are achieved without sacrificing classification accuracy.</p></div><div><h3>Conclusions</h3><p>The CC-LS loss function presents a significant advancement in the pursuit of deploying machine learning models for glaucoma diagnosis. By improving calibration, the CC-LS ensures that clinicians can interpret and trust the predictive probabilities, making artificial intelligence-driven diagnostic tools more clinically viable. From a clinical standpoint, this heightened trust and interpretability can potentially lead to more timely and appropriate interventions, thereby optimizing patient outcomes and safety.</p></div><div><h3>Financial Disclosure(s)</h3><p>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</p></div>","PeriodicalId":74363,"journal":{"name":"Ophthalmology science","volume":"4 6","pages":"Article 100555"},"PeriodicalIF":3.2000,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666914524000915/pdfft?md5=427dfb03f669c0bff9531a39938549c2&pid=1-s2.0-S2666914524000915-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Advancing Glaucoma Diagnosis: Employing Confidence-Calibrated Label Smoothing Loss for Model Calibration\",\"authors\":\"Midhula Vijayan PhD, Deepthi Keshav Prasad PhD, Venkatakrishnan Srinivasan MTech\",\"doi\":\"10.1016/j.xops.2024.100555\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><p>The aim of our research is to enhance the calibration of machine learning models for glaucoma classification through a specialized loss function named Confidence-Calibrated Label Smoothing (CC-LS) loss. This approach is specifically designed to refine model calibration without compromising accuracy by integrating label smoothing and confidence penalty techniques, tailored to the specifics of glaucoma detection.</p></div><div><h3>Design</h3><p>This study focuses on the development and evaluation of a calibrated deep learning model.</p></div><div><h3>Participants</h3><p>The study employs fundus images from both external datasets—the Online Retinal Fundus Image Database for Glaucoma Analysis and Research (482 normal, 168 glaucoma) and the Retinal Fundus Glaucoma Challenge (720 normal, 80 glaucoma)—and an extensive internal dataset (4639 images per category), aiming to bolster the model's generalizability. The model's clinical performance is validated using a comprehensive test set (47 913 normal, 1629 glaucoma) from the internal dataset.</p></div><div><h3>Methods</h3><p>The CC-LS loss function seamlessly integrates label smoothing, which tempers extreme predictions to avoid overfitting, with confidence-based penalties. These penalties deter the model from expressing undue confidence in incorrect classifications. Our study aims at training models using the CC-LS and comparing their performance with those trained using conventional loss functions.</p></div><div><h3>Main Outcome Measures</h3><p>The model's precision is evaluated using metrics like the Brier score, sensitivity, specificity, and the false positive rate, alongside qualitative heatmap analyses for a holistic accuracy assessment.</p></div><div><h3>Results</h3><p>Preliminary findings reveal that models employing the CC-LS mechanism exhibit superior calibration metrics, as evidenced by a Brier score of 0.098, along with notable accuracy measures: sensitivity of 81%, specificity of 80%, and weighted accuracy of 80%. Importantly, these enhancements in calibration are achieved without sacrificing classification accuracy.</p></div><div><h3>Conclusions</h3><p>The CC-LS loss function presents a significant advancement in the pursuit of deploying machine learning models for glaucoma diagnosis. By improving calibration, the CC-LS ensures that clinicians can interpret and trust the predictive probabilities, making artificial intelligence-driven diagnostic tools more clinically viable. From a clinical standpoint, this heightened trust and interpretability can potentially lead to more timely and appropriate interventions, thereby optimizing patient outcomes and safety.</p></div><div><h3>Financial Disclosure(s)</h3><p>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</p></div>\",\"PeriodicalId\":74363,\"journal\":{\"name\":\"Ophthalmology science\",\"volume\":\"4 6\",\"pages\":\"Article 100555\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666914524000915/pdfft?md5=427dfb03f669c0bff9531a39938549c2&pid=1-s2.0-S2666914524000915-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ophthalmology science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666914524000915\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OPHTHALMOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666914524000915","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

目标我们的研究旨在通过一种名为 "置信度校准标签平滑（CC-LS）损失 "的专门损失函数，加强青光眼分类机器学习模型的校准。这种方法通过整合标签平滑和置信度惩罚技术，专门针对青光眼检测的具体情况，在不影响准确性的前提下完善模型校准。设计本研究重点关注校准深度学习模型的开发和评估。参与者该研究采用了来自外部数据集--用于青光眼分析和研究的在线视网膜眼底图像数据库（482 张正常图像，168 张青光眼图像）和视网膜眼底青光眼挑战赛（720 张正常图像，80 张青光眼图像）--以及广泛的内部数据集（每个类别 4639 张图像）的眼底图像，旨在增强模型的通用性。该模型的临床性能通过内部数据集的综合测试集（47 913 张正常图像、1629 张青光眼图像）进行了验证。方法CC-LS 损失函数将标签平滑与基于置信度的惩罚无缝整合在一起，标签平滑可以缓和极端预测以避免过度拟合。这些惩罚措施可防止模型对不正确的分类表现出过度的信心。我们的研究旨在使用 CC-LS 训练模型，并将它们的性能与使用传统损失函数训练的模型进行比较。主要结果测量使用 Brier 分数、灵敏度、特异性和假阳性率等指标评估模型的精确度，同时进行定性热图分析，以全面评估精确度。结果初步研究结果表明，采用 CC-LS 机制的模型显示出更优越的校准指标，具体表现为布赖尔评分为 0.098，以及显著的准确性指标：灵敏度为 81%，特异性为 80%，加权准确性为 80%。重要的是，这些校准方面的改进是在不牺牲分类准确性的前提下实现的。结论CC-LS 损失函数在为青光眼诊断部署机器学习模型方面取得了重大进展。通过改进校准，CC-LS 可确保临床医生能够解释并信任预测概率，从而使人工智能驱动的诊断工具在临床上更加可行。从临床角度来看，这种信任度和可解释性的提高有可能带来更及时、更适当的干预，从而优化患者的治疗效果和安全性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Advancing Glaucoma Diagnosis: Employing Confidence-Calibrated Label Smoothing Loss for Model Calibration

Objective

The aim of our research is to enhance the calibration of machine learning models for glaucoma classification through a specialized loss function named Confidence-Calibrated Label Smoothing (CC-LS) loss. This approach is specifically designed to refine model calibration without compromising accuracy by integrating label smoothing and confidence penalty techniques, tailored to the specifics of glaucoma detection.

Design

This study focuses on the development and evaluation of a calibrated deep learning model.

Participants

The study employs fundus images from both external datasets—the Online Retinal Fundus Image Database for Glaucoma Analysis and Research (482 normal, 168 glaucoma) and the Retinal Fundus Glaucoma Challenge (720 normal, 80 glaucoma)—and an extensive internal dataset (4639 images per category), aiming to bolster the model's generalizability. The model's clinical performance is validated using a comprehensive test set (47 913 normal, 1629 glaucoma) from the internal dataset.

Methods

The CC-LS loss function seamlessly integrates label smoothing, which tempers extreme predictions to avoid overfitting, with confidence-based penalties. These penalties deter the model from expressing undue confidence in incorrect classifications. Our study aims at training models using the CC-LS and comparing their performance with those trained using conventional loss functions.

Main Outcome Measures

The model's precision is evaluated using metrics like the Brier score, sensitivity, specificity, and the false positive rate, alongside qualitative heatmap analyses for a holistic accuracy assessment.

Results

Preliminary findings reveal that models employing the CC-LS mechanism exhibit superior calibration metrics, as evidenced by a Brier score of 0.098, along with notable accuracy measures: sensitivity of 81%, specificity of 80%, and weighted accuracy of 80%. Importantly, these enhancements in calibration are achieved without sacrificing classification accuracy.

Conclusions

The CC-LS loss function presents a significant advancement in the pursuit of deploying machine learning models for glaucoma diagnosis. By improving calibration, the CC-LS ensures that clinicians can interpret and trust the predictive probabilities, making artificial intelligence-driven diagnostic tools more clinically viable. From a clinical standpoint, this heightened trust and interpretability can potentially lead to more timely and appropriate interventions, thereby optimizing patient outcomes and safety.