In Jae Jeong, Jin-Kyung Hong, Young Jun Bae, Tea Kwon Lee
{"title":"Enhancing Bacterial Phenotype Classification Through the Integration of Autogating and Automated Machine Learning in Flow Cytometric Analysis","authors":"In Jae Jeong, Jin-Kyung Hong, Young Jun Bae, Tea Kwon Lee","doi":"10.1002/cyto.a.24923","DOIUrl":null,"url":null,"abstract":"<p>Although flow cytometry produces reliable results, the data processing from gating to fingerprinting is prone to subjective bias. Here, we integrated autogating with Automated Machine Learning in flow cytometry to enhance the classification of bacterial phenotypes. We analyzed six bacterial strains prevalent in the soil and groundwater—\n <i>Bacillus subtilis</i>\n , \n <i>Burkholderia thailandensis</i>\n , \n <i>Corynebacterium glutamicum</i>\n , \n <i>Escherichia coli</i>\n , \n <i>Pseudomonas putida</i>\n , and \n <i>Pseudomonas stutzeri</i>\n . Using the H2O-AutoML framework, we applied gradient-boosting machine (GBM) models to classify bacteria across different metabolic phases. Our results demonstrated an overall classification accuracy of 82.34% for GBM. Notably, accuracy varied across metabolic phases, with the highest observed during the late log (88.06%), lag (88.43%), and early log phases (89.37%), whereas the stationary phase showed a slightly lower accuracy of 80.73%. \n <i>P. stutzeri</i>\n exhibited consistently high sensitivity and specificity across all the phases, which indicated that it was the most distinctly identifiable strain. In contrast, \n <i>E. coli</i>\n showed low sensitivity, particularly in the stationary phase, which indicated challenges in its classification. Overall, this study with incorporating autogating and the AutoML framework, substantially reduces subjective biases and enhances the reproducibility and accuracy of microbial classification. Our methodology offers a robust framework for microbial classification in flow cytometric analysis, paving the way for more precise and comprehensive analyses of microbial ecology.</p>","PeriodicalId":11068,"journal":{"name":"Cytometry Part A","volume":"107 3","pages":"203-213"},"PeriodicalIF":2.5000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cyto.a.24923","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cytometry Part A","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cyto.a.24923","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Although flow cytometry produces reliable results, the data processing from gating to fingerprinting is prone to subjective bias. Here, we integrated autogating with Automated Machine Learning in flow cytometry to enhance the classification of bacterial phenotypes. We analyzed six bacterial strains prevalent in the soil and groundwater—
Bacillus subtilis
,
Burkholderia thailandensis
,
Corynebacterium glutamicum
,
Escherichia coli
,
Pseudomonas putida
, and
Pseudomonas stutzeri
. Using the H2O-AutoML framework, we applied gradient-boosting machine (GBM) models to classify bacteria across different metabolic phases. Our results demonstrated an overall classification accuracy of 82.34% for GBM. Notably, accuracy varied across metabolic phases, with the highest observed during the late log (88.06%), lag (88.43%), and early log phases (89.37%), whereas the stationary phase showed a slightly lower accuracy of 80.73%.
P. stutzeri
exhibited consistently high sensitivity and specificity across all the phases, which indicated that it was the most distinctly identifiable strain. In contrast,
E. coli
showed low sensitivity, particularly in the stationary phase, which indicated challenges in its classification. Overall, this study with incorporating autogating and the AutoML framework, substantially reduces subjective biases and enhances the reproducibility and accuracy of microbial classification. Our methodology offers a robust framework for microbial classification in flow cytometric analysis, paving the way for more precise and comprehensive analyses of microbial ecology.
期刊介绍:
Cytometry Part A, the journal of quantitative single-cell analysis, features original research reports and reviews of innovative scientific studies employing quantitative single-cell measurement, separation, manipulation, and modeling techniques, as well as original articles on mechanisms of molecular and cellular functions obtained by cytometry techniques.
The journal welcomes submissions from multiple research fields that fully embrace the study of the cytome:
Biomedical Instrumentation Engineering
Biophotonics
Bioinformatics
Cell Biology
Computational Biology
Data Science
Immunology
Parasitology
Microbiology
Neuroscience
Cancer
Stem Cells
Tissue Regeneration.