{"title":"Correspondence: Accuracy Is Not Enough: Stability‐Aware Feature Selection for Reproducible Biomarker Discovery","authors":"Yoshiyasu Takefuji","doi":"10.1111/all.70075","DOIUrl":null,"url":null,"abstract":"Random forest (RF) models can achieve high predictive accuracy, yet their model‐specific feature importances may be unstable and misleading. Using an allergy benchmark dataset (10,000 instances, 11 features), we compared five selection strategies—RF, logistic regression, feature agglomeration (FA), highly variable gene selection (HVGS), and Spearman correlation—evaluating cross‐validated accuracy with the top five features and after removing the top two (reselecting the top three). RF attained 0.9999 accuracy with the top five but fell to 0.8836 and showed unstable rankings; logistic regression maintained 0.9116 but was also unstable. FA, HVGS, and Spearman achieved near‐perfect accuracy (0.9999) with the top five and modest declines (0.9076–0.9116) with stable rankings. Results underscore that accuracy does not imply reliable importance; stability‐aware, model‐agnostic, or unsupervised methods better support reproducible biomarker discovery.","PeriodicalId":122,"journal":{"name":"Allergy","volume":"29 1","pages":""},"PeriodicalIF":12.0000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Allergy","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/all.70075","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ALLERGY","Score":null,"Total":0}
引用次数: 0
Abstract
Random forest (RF) models can achieve high predictive accuracy, yet their model‐specific feature importances may be unstable and misleading. Using an allergy benchmark dataset (10,000 instances, 11 features), we compared five selection strategies—RF, logistic regression, feature agglomeration (FA), highly variable gene selection (HVGS), and Spearman correlation—evaluating cross‐validated accuracy with the top five features and after removing the top two (reselecting the top three). RF attained 0.9999 accuracy with the top five but fell to 0.8836 and showed unstable rankings; logistic regression maintained 0.9116 but was also unstable. FA, HVGS, and Spearman achieved near‐perfect accuracy (0.9999) with the top five and modest declines (0.9076–0.9116) with stable rankings. Results underscore that accuracy does not imply reliable importance; stability‐aware, model‐agnostic, or unsupervised methods better support reproducible biomarker discovery.
期刊介绍:
Allergy is an international and multidisciplinary journal that aims to advance, impact, and communicate all aspects of the discipline of Allergy/Immunology. It publishes original articles, reviews, position papers, guidelines, editorials, news and commentaries, letters to the editors, and correspondences. The journal accepts articles based on their scientific merit and quality.
Allergy seeks to maintain contact between basic and clinical Allergy/Immunology and encourages contributions from contributors and readers from all countries. In addition to its publication, Allergy also provides abstracting and indexing information. Some of the databases that include Allergy abstracts are Abstracts on Hygiene & Communicable Disease, Academic Search Alumni Edition, AgBiotech News & Information, AGRICOLA Database, Biological Abstracts, PubMed Dietary Supplement Subset, and Global Health, among others.