SCARS-LOGISTIC: A novel variable selection approach for binary classification model to identify the significant determinants of sexually transmitted infections.
{"title":"SCARS-LOGISTIC: A novel variable selection approach for binary classification model to identify the significant determinants of sexually transmitted infections.","authors":"Maryam Sadiq, Nasser A Alsadhan, Ramla Shah, Sidra Younas, Zahid Rasheed","doi":"10.1371/journal.pone.0324395","DOIUrl":null,"url":null,"abstract":"<p><p>Variable selection methods are very popular, especially in the field of big data with large predictors. These procedures improve the accuracy and performance of the model by eliminating irrelevant and redundant variables. The main contribution of this study is to couple a logit model with a novel variable selection approach, \"Stability Competitive Adaptive Re-weighted Sampling\" to address binary response. The efficiency of the proposed method is compared with the traditional logistic regression model based on eight model assessment criteria over real data from sexually transmitted infections in Indian men. Due to higher stability, the proposed method outperformed having a lower Akaike information criterion, and the Bayesian information criterion, as well as higher R-squared measures. The finally selected proposed model identified essential information regarding sexually transmitted infections in India for policymakers.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 6","pages":"e0324395"},"PeriodicalIF":2.6000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12148077/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0324395","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Variable selection methods are very popular, especially in the field of big data with large predictors. These procedures improve the accuracy and performance of the model by eliminating irrelevant and redundant variables. The main contribution of this study is to couple a logit model with a novel variable selection approach, "Stability Competitive Adaptive Re-weighted Sampling" to address binary response. The efficiency of the proposed method is compared with the traditional logistic regression model based on eight model assessment criteria over real data from sexually transmitted infections in Indian men. Due to higher stability, the proposed method outperformed having a lower Akaike information criterion, and the Bayesian information criterion, as well as higher R-squared measures. The finally selected proposed model identified essential information regarding sexually transmitted infections in India for policymakers.
期刊介绍:
PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides:
* Open-access—freely accessible online, authors retain copyright
* Fast publication times
* Peer review by expert, practicing researchers
* Post-publication tools to indicate quality and impact
* Community-based dialogue on articles
* Worldwide media coverage