Hrafn Weishaupt, Justinas Besusparis, Cleo-Aron Weis, Stefan Porubsky, Arvydas Laurinavicius, Sabine Leh
{"title":"Unsupervised learning for labeling global glomerulosclerosis","authors":"Hrafn Weishaupt, Justinas Besusparis, Cleo-Aron Weis, Stefan Porubsky, Arvydas Laurinavicius, Sabine Leh","doi":"10.1101/2024.09.01.610244","DOIUrl":null,"url":null,"abstract":"Current deep learning models for classifying glomeruli in nephropathology are trained almost exclusively in a supervised manner, requiring expert-labeled images. Very little is known about the potential for unsupervised learning to overcome this bottleneck. To address this open question in a proof-of-concept, the project focused on the most fundamental classification task: globally sclerosed versus non-globally sclerosed glomeruli. The performance of clustering between the two classes was extensively studied across a variety of labeled datasets with diverse compositions and histological stains, and across the feature embeddings produced by 34 different pre-trained CNN models. As demonstrated by the study, clustering of globally and non-globally sclerosed glomeruli is generally highly feasible, yielding accuracies of over 95% in most datasets. Further work will be required to expand these experiments towards the clustering of additional glomerular lesion categories. We are convinced that these efforts (i) will open up opportunities for semi-automatic labeling approaches, thus alleviating the need for labor-intensive manual labeling, and (ii) illustrate that glomerular classification models can potentially be trained even in the absence of expert-derived class labels.","PeriodicalId":501471,"journal":{"name":"bioRxiv - Pathology","volume":"19 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Pathology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.01.610244","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Current deep learning models for classifying glomeruli in nephropathology are trained almost exclusively in a supervised manner, requiring expert-labeled images. Very little is known about the potential for unsupervised learning to overcome this bottleneck. To address this open question in a proof-of-concept, the project focused on the most fundamental classification task: globally sclerosed versus non-globally sclerosed glomeruli. The performance of clustering between the two classes was extensively studied across a variety of labeled datasets with diverse compositions and histological stains, and across the feature embeddings produced by 34 different pre-trained CNN models. As demonstrated by the study, clustering of globally and non-globally sclerosed glomeruli is generally highly feasible, yielding accuracies of over 95% in most datasets. Further work will be required to expand these experiments towards the clustering of additional glomerular lesion categories. We are convinced that these efforts (i) will open up opportunities for semi-automatic labeling approaches, thus alleviating the need for labor-intensive manual labeling, and (ii) illustrate that glomerular classification models can potentially be trained even in the absence of expert-derived class labels.