Jack McMahon, Naofumi Tomita, Elizabeth S Tatishev, Adrienne A Workman, Cristina R Costales, Niaz Banaei, Isabella W Martin, Saeed Hassanpour
{"title":"A novel framework for the automated characterization of Gram-stained blood culture slides using a large-scale vision transformer.","authors":"Jack McMahon, Naofumi Tomita, Elizabeth S Tatishev, Adrienne A Workman, Cristina R Costales, Niaz Banaei, Isabella W Martin, Saeed Hassanpour","doi":"10.1128/jcm.01514-24","DOIUrl":null,"url":null,"abstract":"<p><p>This study introduces a new framework for the artificial intelligence-based characterization of Gram-stained whole-slide images (WSIs). As a test for the diagnosis of bloodstream infections, Gram stains provide critical early data to inform patient treatment in conjunction with data from rapid molecular tests. In this work, we developed a novel transformer-based model for Gram-stained WSI classification, which is more scalable to large data sets than previous convolutional neural network-based methods as it does not require patch-level manual annotations. We also introduce a large Gram stain data set from Dartmouth-Hitchcock Medical Center (Lebanon, New Hampshire, USA) to evaluate our model, exploring the classification of five major categories of Gram-stained WSIs: gram-positive cocci in clusters, gram-positive cocci in pairs/chains, gram-positive rods, gram-negative rods, and slides with no bacteria. Our model achieves a classification accuracy of 0.858 (95% CI: 0.805, 0.905) and an area under the receiver operating characteristic curve (AUC) of 0.952 (95% CI: 0.922, 0.976) using fivefold nested cross-validation on our 475-slide data set, demonstrating the potential of large-scale transformer models for Gram stain classification. Results were measured against the final clinical laboratory Gram stain report after growth of organism in culture. We further demonstrate the generalizability of our trained model by applying it without additional fine-tuning on a second 27-slide external data set from Stanford Health (Palo Alto, California, USA) where it achieves a binary classification accuracy of 0.926 (95% CI: 0.885, 0.960) and an AUC of 0.8651 (95% CI: 0.6337, 0.9917) while distinguishing gram-positive from gram-negative bacteria.</p><p><strong>Importance: </strong>This study introduces a scalable transformer-based deep learning model for automating Gram-stained whole-slide image classification. It surpasses previous methods by eliminating the need for manual annotations and demonstrates high accuracy and generalizability across multiple data sets, enhancing the speed and reliability of Gram stain analysis.</p>","PeriodicalId":15511,"journal":{"name":"Journal of Clinical Microbiology","volume":" ","pages":"e0151424"},"PeriodicalIF":6.1000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Microbiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1128/jcm.01514-24","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This study introduces a new framework for the artificial intelligence-based characterization of Gram-stained whole-slide images (WSIs). As a test for the diagnosis of bloodstream infections, Gram stains provide critical early data to inform patient treatment in conjunction with data from rapid molecular tests. In this work, we developed a novel transformer-based model for Gram-stained WSI classification, which is more scalable to large data sets than previous convolutional neural network-based methods as it does not require patch-level manual annotations. We also introduce a large Gram stain data set from Dartmouth-Hitchcock Medical Center (Lebanon, New Hampshire, USA) to evaluate our model, exploring the classification of five major categories of Gram-stained WSIs: gram-positive cocci in clusters, gram-positive cocci in pairs/chains, gram-positive rods, gram-negative rods, and slides with no bacteria. Our model achieves a classification accuracy of 0.858 (95% CI: 0.805, 0.905) and an area under the receiver operating characteristic curve (AUC) of 0.952 (95% CI: 0.922, 0.976) using fivefold nested cross-validation on our 475-slide data set, demonstrating the potential of large-scale transformer models for Gram stain classification. Results were measured against the final clinical laboratory Gram stain report after growth of organism in culture. We further demonstrate the generalizability of our trained model by applying it without additional fine-tuning on a second 27-slide external data set from Stanford Health (Palo Alto, California, USA) where it achieves a binary classification accuracy of 0.926 (95% CI: 0.885, 0.960) and an AUC of 0.8651 (95% CI: 0.6337, 0.9917) while distinguishing gram-positive from gram-negative bacteria.
Importance: This study introduces a scalable transformer-based deep learning model for automating Gram-stained whole-slide image classification. It surpasses previous methods by eliminating the need for manual annotations and demonstrates high accuracy and generalizability across multiple data sets, enhancing the speed and reliability of Gram stain analysis.
期刊介绍:
The Journal of Clinical Microbiology® disseminates the latest research concerning the laboratory diagnosis of human and animal infections, along with the laboratory's role in epidemiology and the management of infectious diseases.