通过自我监督学习从肾小球图像中提取特征并用最少的注释进行疾病分类

IF 10.3 1区 医学 Q1 UROLOGY & NEPHROLOGY
Masatoshi Abe, Hirohiko Niioka, Ayumi Matsumoto, Yusuke Katsuma, Atsuhiro Imai, Hiroki Okushima, Shingo Ozaki, Naohiko Fujii, Kazumasa Oka, Yusuke Sakaguchi, Kazunori Inoue, Yoshitaka Isaka, Isao Matsui
{"title":"通过自我监督学习从肾小球图像中提取特征并用最少的注释进行疾病分类","authors":"Masatoshi Abe, Hirohiko Niioka, Ayumi Matsumoto, Yusuke Katsuma, Atsuhiro Imai, Hiroki Okushima, Shingo Ozaki, Naohiko Fujii, Kazumasa Oka, Yusuke Sakaguchi, Kazunori Inoue, Yoshitaka Isaka, Isao Matsui","doi":"10.1681/ASN.0000000514","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Deep learning has great potential in digital kidney pathology. However, its effectiveness depends heavily on the availability of extensively labeled datasets, which are often limited due to the specialized knowledge and time required for their creation. This limitation hinders the widespread application of deep learning for the analysis of kidney biopsy images.</p><p><strong>Methods: </strong>We applied self-distillation with no labels (DINO), a self-supervised learning method, to a dataset of 10,423 glomerular images obtained from 384 PAS-stained kidney biopsy slides. Glomerular features extracted from the DINO-pretrained backbone were visualized using principal component analysis (PCA). We then performed classification tasks by adding either k-nearest neighbor (kNN) classifiers or linear head layers to the DINO-pretrained or ImageNet-pretrained backbones. These models were trained on our labeled classification dataset. Performance was evaluated using metrics such as the area under the receiver operating characteristic curve (ROC-AUC). The classification tasks encompassed four disease categories (minimal change disease, mesangial proliferative glomerulonephritis, membranous nephropathy, and diabetic nephropathy) as well as clinical parameters such as hypertension, proteinuria, and hematuria.</p><p><strong>Results: </strong>PCA visualization revealed distinct principal components corresponding to different glomerular structures, demonstrating the capability of the DINO-pretrained backbone to capture morphological features. In disease classification, the DINO-pretrained transferred model (ROC-AUC = 0.93) outperformed the ImageNet-pretrained fine-tuned model (ROC-AUC = 0.89). When the labeled data were limited, the ImageNet-pretrained fine-tuned model's ROC-AUC dropped to 0.76 (95% confidence interval [CI], 0.72-0.80), whereas the DINO-pretrained transferred model maintained superior performance (ROC-AUC 0.88, 95% CI 0.86-0.90). The DINO-pretrained transferred model also exhibited higher AUCs for the classification of several clinical parameters. External validation using two independent datasets confirmed DINO pre-training's superiority, particularly when labeled data were limited.</p><p><strong>Conclusions: </strong>The application of DINO to unlabeled PAS-stained glomerular images facilitated the extraction of histological features that can be effectively utilized for disease classification.</p>","PeriodicalId":17217,"journal":{"name":"Journal of The American Society of Nephrology","volume":" ","pages":""},"PeriodicalIF":10.3000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Self-Supervised Learning for Feature Extraction from Glomerular Images and Disease Classification with Minimal Annotations.\",\"authors\":\"Masatoshi Abe, Hirohiko Niioka, Ayumi Matsumoto, Yusuke Katsuma, Atsuhiro Imai, Hiroki Okushima, Shingo Ozaki, Naohiko Fujii, Kazumasa Oka, Yusuke Sakaguchi, Kazunori Inoue, Yoshitaka Isaka, Isao Matsui\",\"doi\":\"10.1681/ASN.0000000514\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Deep learning has great potential in digital kidney pathology. However, its effectiveness depends heavily on the availability of extensively labeled datasets, which are often limited due to the specialized knowledge and time required for their creation. This limitation hinders the widespread application of deep learning for the analysis of kidney biopsy images.</p><p><strong>Methods: </strong>We applied self-distillation with no labels (DINO), a self-supervised learning method, to a dataset of 10,423 glomerular images obtained from 384 PAS-stained kidney biopsy slides. Glomerular features extracted from the DINO-pretrained backbone were visualized using principal component analysis (PCA). We then performed classification tasks by adding either k-nearest neighbor (kNN) classifiers or linear head layers to the DINO-pretrained or ImageNet-pretrained backbones. These models were trained on our labeled classification dataset. Performance was evaluated using metrics such as the area under the receiver operating characteristic curve (ROC-AUC). The classification tasks encompassed four disease categories (minimal change disease, mesangial proliferative glomerulonephritis, membranous nephropathy, and diabetic nephropathy) as well as clinical parameters such as hypertension, proteinuria, and hematuria.</p><p><strong>Results: </strong>PCA visualization revealed distinct principal components corresponding to different glomerular structures, demonstrating the capability of the DINO-pretrained backbone to capture morphological features. In disease classification, the DINO-pretrained transferred model (ROC-AUC = 0.93) outperformed the ImageNet-pretrained fine-tuned model (ROC-AUC = 0.89). When the labeled data were limited, the ImageNet-pretrained fine-tuned model's ROC-AUC dropped to 0.76 (95% confidence interval [CI], 0.72-0.80), whereas the DINO-pretrained transferred model maintained superior performance (ROC-AUC 0.88, 95% CI 0.86-0.90). The DINO-pretrained transferred model also exhibited higher AUCs for the classification of several clinical parameters. External validation using two independent datasets confirmed DINO pre-training's superiority, particularly when labeled data were limited.</p><p><strong>Conclusions: </strong>The application of DINO to unlabeled PAS-stained glomerular images facilitated the extraction of histological features that can be effectively utilized for disease classification.</p>\",\"PeriodicalId\":17217,\"journal\":{\"name\":\"Journal of The American Society of Nephrology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":10.3000,\"publicationDate\":\"2024-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of The American Society of Nephrology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1681/ASN.0000000514\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"UROLOGY & NEPHROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The American Society of Nephrology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1681/ASN.0000000514","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:深度学习在数字肾脏病理学领域具有巨大潜力。然而,其有效性在很大程度上取决于是否有广泛标记的数据集,而由于创建数据集需要专业知识和时间,这些数据集往往是有限的。这一限制阻碍了深度学习在肾活检图像分析中的广泛应用:我们对从 384 张 PAS 染色的肾活检切片中获得的 10,423 张肾小球图像数据集应用了一种自监督学习方法--无标签自蒸馏(DINO)。使用主成分分析(PCA)对从 DINO 训练的骨干中提取的肾小球特征进行了可视化。然后,我们在 DINO 训练或 ImageNet 训练的骨干上添加了 k 近邻(kNN)分类器或线性头层,从而完成了分类任务。这些模型是在我们的标注分类数据集上训练的。使用接收者操作特征曲线下面积(ROC-AUC)等指标对性能进行评估。分类任务包括四个疾病类别(微小病变、间质增生性肾小球肾炎、膜性肾病和糖尿病肾病)以及高血压、蛋白尿和血尿等临床参数:PCA可视化显示了与不同肾小球结构相对应的不同主成分,证明了DINO训练骨架捕捉形态特征的能力。在疾病分类中,DINO 预训练的转移模型(ROC-AUC = 0.93)优于 ImageNet 预训练的微调模型(ROC-AUC = 0.89)。当标注数据有限时,ImageNet 训练的微调模型的 ROC-AUC 下降到 0.76(95% 置信区间 [CI],0.72-0.80),而 DINO 训练的转移模型则保持了优异的性能(ROC-AUC 0.88,95% CI 0.86-0.90)。经过 DINO 训练的转移模型在多个临床参数的分类中也表现出更高的 AUC。使用两个独立数据集进行的外部验证证实了 DINO 预训练的优越性,尤其是在标记数据有限的情况下:结论:将 DINO 应用于未标记的 PAS 染色肾小球图像有助于提取组织学特征,可有效用于疾病分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Self-Supervised Learning for Feature Extraction from Glomerular Images and Disease Classification with Minimal Annotations.

Background: Deep learning has great potential in digital kidney pathology. However, its effectiveness depends heavily on the availability of extensively labeled datasets, which are often limited due to the specialized knowledge and time required for their creation. This limitation hinders the widespread application of deep learning for the analysis of kidney biopsy images.

Methods: We applied self-distillation with no labels (DINO), a self-supervised learning method, to a dataset of 10,423 glomerular images obtained from 384 PAS-stained kidney biopsy slides. Glomerular features extracted from the DINO-pretrained backbone were visualized using principal component analysis (PCA). We then performed classification tasks by adding either k-nearest neighbor (kNN) classifiers or linear head layers to the DINO-pretrained or ImageNet-pretrained backbones. These models were trained on our labeled classification dataset. Performance was evaluated using metrics such as the area under the receiver operating characteristic curve (ROC-AUC). The classification tasks encompassed four disease categories (minimal change disease, mesangial proliferative glomerulonephritis, membranous nephropathy, and diabetic nephropathy) as well as clinical parameters such as hypertension, proteinuria, and hematuria.

Results: PCA visualization revealed distinct principal components corresponding to different glomerular structures, demonstrating the capability of the DINO-pretrained backbone to capture morphological features. In disease classification, the DINO-pretrained transferred model (ROC-AUC = 0.93) outperformed the ImageNet-pretrained fine-tuned model (ROC-AUC = 0.89). When the labeled data were limited, the ImageNet-pretrained fine-tuned model's ROC-AUC dropped to 0.76 (95% confidence interval [CI], 0.72-0.80), whereas the DINO-pretrained transferred model maintained superior performance (ROC-AUC 0.88, 95% CI 0.86-0.90). The DINO-pretrained transferred model also exhibited higher AUCs for the classification of several clinical parameters. External validation using two independent datasets confirmed DINO pre-training's superiority, particularly when labeled data were limited.

Conclusions: The application of DINO to unlabeled PAS-stained glomerular images facilitated the extraction of histological features that can be effectively utilized for disease classification.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of The American Society of Nephrology
Journal of The American Society of Nephrology 医学-泌尿学与肾脏学
CiteScore
22.40
自引率
2.90%
发文量
492
审稿时长
3-8 weeks
期刊介绍: The Journal of the American Society of Nephrology (JASN) stands as the preeminent kidney journal globally, offering an exceptional synthesis of cutting-edge basic research, clinical epidemiology, meta-analysis, and relevant editorial content. Representing a comprehensive resource, JASN encompasses clinical research, editorials distilling key findings, perspectives, and timely reviews. Editorials are skillfully crafted to elucidate the essential insights of the parent article, while JASN actively encourages the submission of Letters to the Editor discussing recently published articles. The reviews featured in JASN are consistently erudite and comprehensive, providing thorough coverage of respective fields. Since its inception in July 1990, JASN has been a monthly publication. JASN publishes original research reports and editorial content across a spectrum of basic and clinical science relevant to the broad discipline of nephrology. Topics covered include renal cell biology, developmental biology of the kidney, genetics of kidney disease, cell and transport physiology, hemodynamics and vascular regulation, mechanisms of blood pressure regulation, renal immunology, kidney pathology, pathophysiology of kidney diseases, nephrolithiasis, clinical nephrology (including dialysis and transplantation), and hypertension. Furthermore, articles addressing healthcare policy and care delivery issues relevant to nephrology are warmly welcomed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信