Spencer Ellis, Steven Song, Derek Reiman, Xuan Hui, Renyu Zhang, Mohammad Hasan Shahriar, Maria Argos, Mohammed Kamal, Christopher R Shea, Robert L Grossman, Aly A Khan, Habibul Ahsan
{"title":"Improved Diagnosis of Non-Melanoma Skin Cancer in Resource-Limited Settings.","authors":"Spencer Ellis, Steven Song, Derek Reiman, Xuan Hui, Renyu Zhang, Mohammad Hasan Shahriar, Maria Argos, Mohammed Kamal, Christopher R Shea, Robert L Grossman, Aly A Khan, Habibul Ahsan","doi":"10.1158/1055-9965.EPI-25-0132","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Early and precise diagnosis is vital to improving patient outcomes and reducing morbidity. In resource-limited settings, cancer diagnosis is often challenging due to shortages of expert pathologists. We assess the effectiveness of general-purpose pathology foundation models (FMs) for the diagnosis and annotation of nonmelanoma skin cancer (NMSC) in resource-limited settings.</p><p><strong>Methods: </strong>We evaluated three pathology FMs (UNI, PRISM, and Prov-GigaPath) using de-identified NMSC histology images from the Bangladesh Vitamin E and Selenium Trial to predict cancer subtype based on zero-shot whole slide embeddings. In addition, we evaluated tile aggregation methods and machine learning models for prediction. Lastly, we employed few-shot learning of PRISM tile embeddings to perform whole slide annotation.</p><p><strong>Results: </strong>We found that the best model used PRISM's aggregated tile embeddings to train a multi-layer perceptron model (MLP) to predict NMSC subtype (mean AUROC=0.925; p<0.001). Within the other FMs, we found that using attention-based multi-instance learning to aggregate tile embeddings to train an MLP model was optimal (UNI: mean AUROC=0.913; p<0.001; Prov-GigaPath: mean AUROC=0.908, p<0.001). We finally exemplify the utility of few-shot annotation in computation- and expertise-limited settings.</p><p><strong>Conclusions: </strong>Our study highlights the important role FMs may play in confronting public health challenges and exhibits a real-world potential for machine learning aided cancer diagnosis.</p><p><strong>Impact: </strong>Pathology foundation models offer a promising pathway to improve early and precise NMSC diagnosis, especially in resource-limited environments. These tools could also facilitate patient stratification and recruitment for prospective clinical trials aimed at improving NMSC management.</p>","PeriodicalId":9458,"journal":{"name":"Cancer Epidemiology Biomarkers & Prevention","volume":" ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Epidemiology Biomarkers & Prevention","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1158/1055-9965.EPI-25-0132","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Early and precise diagnosis is vital to improving patient outcomes and reducing morbidity. In resource-limited settings, cancer diagnosis is often challenging due to shortages of expert pathologists. We assess the effectiveness of general-purpose pathology foundation models (FMs) for the diagnosis and annotation of nonmelanoma skin cancer (NMSC) in resource-limited settings.
Methods: We evaluated three pathology FMs (UNI, PRISM, and Prov-GigaPath) using de-identified NMSC histology images from the Bangladesh Vitamin E and Selenium Trial to predict cancer subtype based on zero-shot whole slide embeddings. In addition, we evaluated tile aggregation methods and machine learning models for prediction. Lastly, we employed few-shot learning of PRISM tile embeddings to perform whole slide annotation.
Results: We found that the best model used PRISM's aggregated tile embeddings to train a multi-layer perceptron model (MLP) to predict NMSC subtype (mean AUROC=0.925; p<0.001). Within the other FMs, we found that using attention-based multi-instance learning to aggregate tile embeddings to train an MLP model was optimal (UNI: mean AUROC=0.913; p<0.001; Prov-GigaPath: mean AUROC=0.908, p<0.001). We finally exemplify the utility of few-shot annotation in computation- and expertise-limited settings.
Conclusions: Our study highlights the important role FMs may play in confronting public health challenges and exhibits a real-world potential for machine learning aided cancer diagnosis.
Impact: Pathology foundation models offer a promising pathway to improve early and precise NMSC diagnosis, especially in resource-limited environments. These tools could also facilitate patient stratification and recruitment for prospective clinical trials aimed at improving NMSC management.
期刊介绍:
Cancer Epidemiology, Biomarkers & Prevention publishes original peer-reviewed, population-based research on cancer etiology, prevention, surveillance, and survivorship. The following topics are of special interest: descriptive, analytical, and molecular epidemiology; biomarkers including assay development, validation, and application; chemoprevention and other types of prevention research in the context of descriptive and observational studies; the role of behavioral factors in cancer etiology and prevention; survivorship studies; risk factors; implementation science and cancer care delivery; and the science of cancer health disparities. Besides welcoming manuscripts that address individual subjects in any of the relevant disciplines, CEBP editors encourage the submission of manuscripts with a transdisciplinary approach.