Vision transformer introduces a new vitality to the classification of renal pathology.

IF 2.2 4区医学 Q2 UROLOGY & NEPHROLOGY

BMC Nephrology Pub Date : 2024-10-09 DOI:10.1186/s12882-024-03800-x

Ji Zhang, Jia Dan Lu, Bo Chen, ShuFang Pan, LingWei Jin, Yu Zheng, Min Pan

{"title":"Vision transformer introduces a new vitality to the classification of renal pathology.","authors":"Ji Zhang, Jia Dan Lu, Bo Chen, ShuFang Pan, LingWei Jin, Yu Zheng, Min Pan","doi":"10.1186/s12882-024-03800-x","DOIUrl":null,"url":null,"abstract":"<p><p>Recent advancements in computer vision within the field of artificial intelligence (AI) have made significant inroads into the medical domain. However, the application of AI for classifying renal pathology remains challenging due to the subtle variations in multiple renal pathological classifications. Vision Transformers (ViT), an adaptation of the Transformer model for image recognition, have demonstrated superior capabilities in capturing global features and providing greater explainability. In our study, we developed a ViT model using a diverse set of stained renal histopathology images to evaluate its effectiveness in classifying renal pathology. A total of 1861 whole slide images (WSI) stained with HE, MASSON, PAS, and PASM were collected from 635 patients. Renal tissue images were then extracted, tiled, and categorized into 14 classes on the basis of renal pathology. We employed the classic ViT model from the Timm library, utilizing images sized 384 × 384 pixels with 16 × 16 pixel patches, to train the classification model. A comparative analysis was conducted to evaluate the performance of the ViT model against traditional convolutional neural network (CNN) models. The results indicated that the ViT model demonstrated superior recognition ability (accuracy: 0.96-0.99). Furthermore, we visualized the identification process of the ViT models to investigate potentially significant pathological ultrastructures. Our study demonstrated that ViT models outperformed CNN models in accurately classifying renal pathology. Additionally, ViT models are able to focus on specific, significant structures within renal histopathology, which could be crucial for identifying novel and meaningful pathological features in the diagnosis and treatment of renal disease.</p>","PeriodicalId":9089,"journal":{"name":"BMC Nephrology","volume":"25 1","pages":"337"},"PeriodicalIF":2.2000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11465538/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Nephrology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12882-024-03800-x","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Recent advancements in computer vision within the field of artificial intelligence (AI) have made significant inroads into the medical domain. However, the application of AI for classifying renal pathology remains challenging due to the subtle variations in multiple renal pathological classifications. Vision Transformers (ViT), an adaptation of the Transformer model for image recognition, have demonstrated superior capabilities in capturing global features and providing greater explainability. In our study, we developed a ViT model using a diverse set of stained renal histopathology images to evaluate its effectiveness in classifying renal pathology. A total of 1861 whole slide images (WSI) stained with HE, MASSON, PAS, and PASM were collected from 635 patients. Renal tissue images were then extracted, tiled, and categorized into 14 classes on the basis of renal pathology. We employed the classic ViT model from the Timm library, utilizing images sized 384 × 384 pixels with 16 × 16 pixel patches, to train the classification model. A comparative analysis was conducted to evaluate the performance of the ViT model against traditional convolutional neural network (CNN) models. The results indicated that the ViT model demonstrated superior recognition ability (accuracy: 0.96-0.99). Furthermore, we visualized the identification process of the ViT models to investigate potentially significant pathological ultrastructures. Our study demonstrated that ViT models outperformed CNN models in accurately classifying renal pathology. Additionally, ViT models are able to focus on specific, significant structures within renal histopathology, which could be crucial for identifying novel and meaningful pathological features in the diagnosis and treatment of renal disease.

查看原文本刊更多论文

视觉转换器为肾脏病理分类注入了新的活力。

近年来，人工智能（AI）领域的计算机视觉技术取得了长足进步，在医疗领域也取得了重大进展。然而，由于多种肾脏病理分类存在细微差别，将人工智能应用于肾脏病理分类仍具有挑战性。视觉变换器（ViT）是图像识别变换器模型的一种改良，在捕捉全局特征和提供更高的可解释性方面表现出了卓越的能力。在我们的研究中，我们使用一组不同的染色肾组织病理学图像开发了一个 ViT 模型，以评估其在肾病理学分类中的有效性。我们从 635 名患者中收集了 1861 张用 HE、MASSON、PAS 和 PASM 染色的全切片图像（WSI）。然后提取、平铺肾组织图像，并根据肾脏病理分为 14 类。我们采用了 Timm 库中的经典 ViT 模型，利用大小为 384 × 384 像素的图像和 16 × 16 像素的斑块来训练分类模型。我们进行了对比分析，以评估 ViT 模型与传统卷积神经网络 (CNN) 模型的性能。结果表明，ViT 模型的识别能力更强（准确率：0.96-0.99）。此外，我们还将 ViT 模型的识别过程可视化，以研究潜在的重要病理超微结构。我们的研究表明，在对肾脏病理进行准确分类方面，ViT 模型优于 CNN 模型。此外，ViT 模型还能关注肾脏组织病理学中特定的、重要的结构，这对于在诊断和治疗肾脏疾病时识别新的、有意义的病理特征至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Nephrology UROLOGY & NEPHROLOGY-

CiteScore

4.30

自引率

0.00%

发文量

375

审稿时长

3-8 weeks

期刊介绍： BMC Nephrology is an open access journal publishing original peer-reviewed research articles in all aspects of the prevention, diagnosis and management of kidney and associated disorders, as well as related molecular genetics, pathophysiology, and epidemiology.