Utilizing Feature Selection Techniques for AI-Driven Tumor Subtype Classification: Enhancing Precision in Cancer Diagnostics.

IF 4.8 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Biomolecules Pub Date : 2025-01-08 DOI:10.3390/biom15010081

Jihan Wang, Zhengxiang Zhang, Yangyang Wang

{"title":"Utilizing Feature Selection Techniques for AI-Driven Tumor Subtype Classification: Enhancing Precision in Cancer Diagnostics.","authors":"Jihan Wang, Zhengxiang Zhang, Yangyang Wang","doi":"10.3390/biom15010081","DOIUrl":null,"url":null,"abstract":"<p><p>Cancer's heterogeneity presents significant challenges in accurate diagnosis and effective treatment, including the complexity of identifying tumor subtypes and their diverse biological behaviors. This review examines how feature selection techniques address these challenges by improving the interpretability and performance of machine learning (ML) models in high-dimensional datasets. Feature selection methods-such as filter, wrapper, and embedded techniques-play a critical role in enhancing the precision of cancer diagnostics by identifying relevant biomarkers. The integration of multi-omics data and ML algorithms facilitates a more comprehensive understanding of tumor heterogeneity, advancing both diagnostics and personalized therapies. However, challenges such as ensuring data quality, mitigating overfitting, and addressing scalability remain critical limitations of these methods. Artificial intelligence (AI)-powered feature selection offers promising solutions to these issues by automating and refining the feature extraction process. This review highlights the transformative potential of these approaches while emphasizing future directions, including the incorporation of deep learning (DL) models and integrative multi-omics strategies for more robust and reproducible findings.</p>","PeriodicalId":8943,"journal":{"name":"Biomolecules","volume":"15 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11763904/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomolecules","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3390/biom15010081","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Cancer's heterogeneity presents significant challenges in accurate diagnosis and effective treatment, including the complexity of identifying tumor subtypes and their diverse biological behaviors. This review examines how feature selection techniques address these challenges by improving the interpretability and performance of machine learning (ML) models in high-dimensional datasets. Feature selection methods-such as filter, wrapper, and embedded techniques-play a critical role in enhancing the precision of cancer diagnostics by identifying relevant biomarkers. The integration of multi-omics data and ML algorithms facilitates a more comprehensive understanding of tumor heterogeneity, advancing both diagnostics and personalized therapies. However, challenges such as ensuring data quality, mitigating overfitting, and addressing scalability remain critical limitations of these methods. Artificial intelligence (AI)-powered feature selection offers promising solutions to these issues by automating and refining the feature extraction process. This review highlights the transformative potential of these approaches while emphasizing future directions, including the incorporation of deep learning (DL) models and integrative multi-omics strategies for more robust and reproducible findings.

查看原文本刊更多论文

癌症的异质性给准确诊断和有效治疗带来了巨大挑战，包括识别肿瘤亚型及其不同生物学行为的复杂性。本综述探讨了特征选择技术如何通过提高机器学习（ML）模型在高维数据集中的可解释性和性能来应对这些挑战。特征选择方法--如过滤器、包装器和嵌入式技术--在通过识别相关生物标记物提高癌症诊断精确度方面发挥着至关重要的作用。多组学数据与 ML 算法的整合有助于更全面地了解肿瘤的异质性，从而促进诊断和个性化治疗。然而，确保数据质量、减少过拟合和解决可扩展性等挑战仍然是这些方法的关键局限。人工智能（AI）驱动的特征选择通过自动化和完善特征提取过程，为解决这些问题提供了前景广阔的解决方案。这篇综述强调了这些方法的变革潜力，同时强调了未来的发展方向，包括纳入深度学习（DL）模型和综合多组学策略，以获得更可靠、更可重复的研究结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biomolecules Biochemistry, Genetics and Molecular Biology-Molecular Biology

CiteScore

9.40

自引率

3.60%

发文量

1640

审稿时长

18.28 days

期刊介绍： Biomolecules (ISSN 2218-273X) is an international, peer-reviewed open access journal focusing on biogenic substances and their biological functions, structures, interactions with other molecules, and their microenvironment as well as biological systems. Biomolecules publishes reviews, regular research papers and short communications. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced.