Hyein Seo , Jae-Ho Park , Jangho Lee , Byung Chang Chung
{"title":"Explainable AI based feature selection in cancer RNA-seq","authors":"Hyein Seo , Jae-Ho Park , Jangho Lee , Byung Chang Chung","doi":"10.1016/j.icte.2025.05.004","DOIUrl":null,"url":null,"abstract":"<div><div>Identifying informative features in bioinformatics is challenging due to their small proportion within large datasets. We propose a scalable and interpretable feature selection framework for cancer RNA-seq by transforming non-image bio-data into 2D formats and applying convolutional neural networks (CNNs) with transfer learning for efficient classification. Explainable artificial intelligence (XAI) techniques identify and prioritize important features, while principal component analysis (PCA) determines the optimal number of selected features, ensuring transparency and reliability. Comparative analysis of CNN and XAI highlights the effectiveness of our approach, providing a robust framework for high-dimensional genomic data analysis with applications in cancer diagnosis and prognosis.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"11 4","pages":"Pages 603-610"},"PeriodicalIF":4.2000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICT Express","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405959525000669","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying informative features in bioinformatics is challenging due to their small proportion within large datasets. We propose a scalable and interpretable feature selection framework for cancer RNA-seq by transforming non-image bio-data into 2D formats and applying convolutional neural networks (CNNs) with transfer learning for efficient classification. Explainable artificial intelligence (XAI) techniques identify and prioritize important features, while principal component analysis (PCA) determines the optimal number of selected features, ensuring transparency and reliability. Comparative analysis of CNN and XAI highlights the effectiveness of our approach, providing a robust framework for high-dimensional genomic data analysis with applications in cancer diagnosis and prognosis.
期刊介绍:
The ICT Express journal published by the Korean Institute of Communications and Information Sciences (KICS) is an international, peer-reviewed research publication covering all aspects of information and communication technology. The journal aims to publish research that helps advance the theoretical and practical understanding of ICT convergence, platform technologies, communication networks, and device technologies. The technology advancement in information and communication technology (ICT) sector enables portable devices to be always connected while supporting high data rate, resulting in the recent popularity of smartphones that have a considerable impact in economic and social development.