稀疏正则化通过逻辑回归增强基因选择和白血病亚型分类能力

IF 2.1 4区 医学 Q3 HEMATOLOGY
Nozad Hussein Mahmood , Dler Hussein Kadir
{"title":"稀疏正则化通过逻辑回归增强基因选择和白血病亚型分类能力","authors":"Nozad Hussein Mahmood ,&nbsp;Dler Hussein Kadir","doi":"10.1016/j.leukres.2025.107663","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigated the application of sparsity regularization methods to improve the classification of leukemia subtypes using high-dimensional gene expression data. Multinomial logistic regression models with the sparsity methods of Ridge, Lasso, and Elastic Net regularizations were employed to address overfitting and dimensionality issues while enhancing model interpretability. The study used a leukemia cancer dataset from the Curated Microarray Database (CuMiDa), which included gene expression data for 16,383 genes across 281 samples representing seven different types of leukemia. The statistical metrics of Accuracy, Kappa statistics, AUC, and F1-score were used to measure the models' implementation. Besides, the effectiveness and ability of each method in gene selection and dimensional reduction of the models were discussed. Elastic Net regularization was a better technique than the Ridge and Lasso based on overall classification performance; it also reached the highest accuracy along with Kappa values. On the other hand, both Lasso and Elastic Net were making more effective feature selections, creating sparse models that could efficiently discriminate leukemia subtypes. In this way, the results highlighted that the inclusion of sparsity regularization could enhance knowledge and accuracy in such a challenging task of subclass leukemia classification, enabling much more tailored treatments.</div></div>","PeriodicalId":18051,"journal":{"name":"Leukemia research","volume":"150 ","pages":"Article 107663"},"PeriodicalIF":2.1000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sparsity regularization enhances gene selection and leukemia subtype classification via logistic regression\",\"authors\":\"Nozad Hussein Mahmood ,&nbsp;Dler Hussein Kadir\",\"doi\":\"10.1016/j.leukres.2025.107663\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study investigated the application of sparsity regularization methods to improve the classification of leukemia subtypes using high-dimensional gene expression data. Multinomial logistic regression models with the sparsity methods of Ridge, Lasso, and Elastic Net regularizations were employed to address overfitting and dimensionality issues while enhancing model interpretability. The study used a leukemia cancer dataset from the Curated Microarray Database (CuMiDa), which included gene expression data for 16,383 genes across 281 samples representing seven different types of leukemia. The statistical metrics of Accuracy, Kappa statistics, AUC, and F1-score were used to measure the models' implementation. Besides, the effectiveness and ability of each method in gene selection and dimensional reduction of the models were discussed. Elastic Net regularization was a better technique than the Ridge and Lasso based on overall classification performance; it also reached the highest accuracy along with Kappa values. On the other hand, both Lasso and Elastic Net were making more effective feature selections, creating sparse models that could efficiently discriminate leukemia subtypes. In this way, the results highlighted that the inclusion of sparsity regularization could enhance knowledge and accuracy in such a challenging task of subclass leukemia classification, enabling much more tailored treatments.</div></div>\",\"PeriodicalId\":18051,\"journal\":{\"name\":\"Leukemia research\",\"volume\":\"150 \",\"pages\":\"Article 107663\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-02-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Leukemia research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0145212625000232\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"HEMATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Leukemia research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0145212625000232","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEMATOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sparsity regularization enhances gene selection and leukemia subtype classification via logistic regression
This study investigated the application of sparsity regularization methods to improve the classification of leukemia subtypes using high-dimensional gene expression data. Multinomial logistic regression models with the sparsity methods of Ridge, Lasso, and Elastic Net regularizations were employed to address overfitting and dimensionality issues while enhancing model interpretability. The study used a leukemia cancer dataset from the Curated Microarray Database (CuMiDa), which included gene expression data for 16,383 genes across 281 samples representing seven different types of leukemia. The statistical metrics of Accuracy, Kappa statistics, AUC, and F1-score were used to measure the models' implementation. Besides, the effectiveness and ability of each method in gene selection and dimensional reduction of the models were discussed. Elastic Net regularization was a better technique than the Ridge and Lasso based on overall classification performance; it also reached the highest accuracy along with Kappa values. On the other hand, both Lasso and Elastic Net were making more effective feature selections, creating sparse models that could efficiently discriminate leukemia subtypes. In this way, the results highlighted that the inclusion of sparsity regularization could enhance knowledge and accuracy in such a challenging task of subclass leukemia classification, enabling much more tailored treatments.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Leukemia research
Leukemia research 医学-血液学
CiteScore
4.00
自引率
3.70%
发文量
259
审稿时长
1 months
期刊介绍: Leukemia Research an international journal which brings comprehensive and current information to all health care professionals involved in basic and applied clinical research in hematological malignancies. The editors encourage the submission of articles relevant to hematological malignancies. The Journal scope includes reporting studies of cellular and molecular biology, genetics, immunology, epidemiology, clinical evaluation, and therapy of these diseases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信