Kwanghoon Lee, Jaemin Jeon, Jin Woo Park, Suwan Yu, Jae-Kyung Won, Kwangsoo Kim, Chul-Kee Park, Sung-Hye Park
{"title":"SNUH methylation classifier for CNS tumors.","authors":"Kwanghoon Lee, Jaemin Jeon, Jin Woo Park, Suwan Yu, Jae-Kyung Won, Kwangsoo Kim, Chul-Kee Park, Sung-Hye Park","doi":"10.1186/s13148-025-01824-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Methylation profiling of central nervous system (CNS) tumors, pioneered by the German Cancer Research Center, has significantly improved diagnostic accuracy. This study aimed to further enhance the performance of methylation classifiers by leveraging publicly available data and innovative machine-learning techniques.</p><p><strong>Results: </strong>Seoul National University Hospital Methylation Classifier (SNUH-MC) addressed data imbalance using the Synthetic Minority Over-sampling Technique (SMOTE) algorithm and incorporated OpenMax within a Multi-Layer Perceptron to prevent labeling errors in low-confidence diagnoses. Compared to two published CNS tumor methylation classification models (DKFZ-MC: Deutsches Krebsforschungszentrum Methylation Classifier v11b4: RandomForest, 767-MC: Multi-Layer Perceptron), our SNUH-MC showed improved performance in F1-score. For 'Filtered Test Data Set 1,' the SNUH-MC achieved higher F1-micro (0.932) and F1-macro (0.919) scores compared to DKFZ-MC v11b4 (F1-micro: 0.907, F1-macro: 0.627). We evaluated the performance of three classifiers; SNUH-MC, DKFZ-MC v11b4, and DKFZ-MC v12.5, using specific criteria. We set established 'Decisions' categories based on histopathology, clinical information, and next-generation sequencing to assess the classification results. When applied to 193 unknown SNUH methylation data samples, SNUH-MC notably improved diagnosis compared to DKFZ-MC v11b4. Specifically, 17 cases were reclassified as 'Match' and 34 cases as 'Likely Match' when transitioning from DKFZ-MC v11b4 to SNUH-MC. Additionally, SNUH-MC demonstrated similar results to DKFZ-MC v12.5 for 23 cases that were unclassified by v11b4.</p><p><strong>Conclusions: </strong>This study presents SNUH-MC, an innovative methylation-based classification tool that significantly advances the field of neuropathology and bioinformatics. Our classifier incorporates cutting-edge techniques such as the SMOTE and OpenMax resulting in improved diagnostic accuracy and robustness, particularly when dealing with unknown or noisy data.</p>","PeriodicalId":10366,"journal":{"name":"Clinical Epigenetics","volume":"17 1","pages":"47"},"PeriodicalIF":4.8000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905536/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Epigenetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13148-025-01824-0","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Methylation profiling of central nervous system (CNS) tumors, pioneered by the German Cancer Research Center, has significantly improved diagnostic accuracy. This study aimed to further enhance the performance of methylation classifiers by leveraging publicly available data and innovative machine-learning techniques.
Results: Seoul National University Hospital Methylation Classifier (SNUH-MC) addressed data imbalance using the Synthetic Minority Over-sampling Technique (SMOTE) algorithm and incorporated OpenMax within a Multi-Layer Perceptron to prevent labeling errors in low-confidence diagnoses. Compared to two published CNS tumor methylation classification models (DKFZ-MC: Deutsches Krebsforschungszentrum Methylation Classifier v11b4: RandomForest, 767-MC: Multi-Layer Perceptron), our SNUH-MC showed improved performance in F1-score. For 'Filtered Test Data Set 1,' the SNUH-MC achieved higher F1-micro (0.932) and F1-macro (0.919) scores compared to DKFZ-MC v11b4 (F1-micro: 0.907, F1-macro: 0.627). We evaluated the performance of three classifiers; SNUH-MC, DKFZ-MC v11b4, and DKFZ-MC v12.5, using specific criteria. We set established 'Decisions' categories based on histopathology, clinical information, and next-generation sequencing to assess the classification results. When applied to 193 unknown SNUH methylation data samples, SNUH-MC notably improved diagnosis compared to DKFZ-MC v11b4. Specifically, 17 cases were reclassified as 'Match' and 34 cases as 'Likely Match' when transitioning from DKFZ-MC v11b4 to SNUH-MC. Additionally, SNUH-MC demonstrated similar results to DKFZ-MC v12.5 for 23 cases that were unclassified by v11b4.
Conclusions: This study presents SNUH-MC, an innovative methylation-based classification tool that significantly advances the field of neuropathology and bioinformatics. Our classifier incorporates cutting-edge techniques such as the SMOTE and OpenMax resulting in improved diagnostic accuracy and robustness, particularly when dealing with unknown or noisy data.
期刊介绍:
Clinical Epigenetics, the official journal of the Clinical Epigenetics Society, is an open access, peer-reviewed journal that encompasses all aspects of epigenetic principles and mechanisms in relation to human disease, diagnosis and therapy. Clinical trials and research in disease model organisms are particularly welcome.