{"title":"人工智能驱动的肿瘤学多组学数据分析的挑战:解决维度、稀疏性、透明度和伦理考虑","authors":"Maryem Ouhmouk , Shakuntala Baichoo , Mounia Abik","doi":"10.1016/j.imu.2025.101679","DOIUrl":null,"url":null,"abstract":"<div><div>Artificial intelligence, particularly deep learning, is becoming increasingly prominent in multi-omics research, especially since traditional statistical models struggle to handle the complexity and high dimensionality of such data. By effectively combining different types of omics data, AI techniques can unveil hidden connections, detect biomarkers, and improve disease prediction through the integration of multi-omics layers and modalities, which can lead to significant advancements in precision medicine. In this review, we gathered published methods of deep learning-based multi-omics integration specialized in oncology since 2020. We concentrated exclusively on studies utilizing cancer omics data mainly sourced from The Cancer Genome Atlas (TCGA) database. As a result, we identified 32 articles that generally fulfilled the criteria. We studied their techniques and their ability to handle challenges in analyzing multi-omics data, particularly regarding missing data, dimensionality, and processing workflows. We also discuss how well these methods consider explainability, interpretability, and ethical aspects in developing solutions that treat private medical and sensitive information.</div><div>From the 32 studies, we can divide deep learning-based multi-omics integration methods into two types: non-generative and generative models. Non-generative approaches, such as feedforward neural networks (FFNs), graph convolutional networks (GCNs), and autoencoders, are designed to extract features and perform classification directly. On the other hand, generative methods such as variational autoencoders (VAEs), generative adversarial networks (GANs), and generative pretrained transformers (GPTs) focus on creating adaptable representations that can be shared across multiple modalities. These methods have advanced the handling of missing data and dimensionality, outperforming traditional approaches. However, most reviewed models remain at the proof-of-concept stage, with limited clinical validation or real-world deployment.</div></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"57 ","pages":"Article 101679"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Challenges in AI-driven multi-omics data analysis for Oncology: Addressing dimensionality, sparsity, transparency and ethical considerations\",\"authors\":\"Maryem Ouhmouk , Shakuntala Baichoo , Mounia Abik\",\"doi\":\"10.1016/j.imu.2025.101679\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Artificial intelligence, particularly deep learning, is becoming increasingly prominent in multi-omics research, especially since traditional statistical models struggle to handle the complexity and high dimensionality of such data. By effectively combining different types of omics data, AI techniques can unveil hidden connections, detect biomarkers, and improve disease prediction through the integration of multi-omics layers and modalities, which can lead to significant advancements in precision medicine. In this review, we gathered published methods of deep learning-based multi-omics integration specialized in oncology since 2020. We concentrated exclusively on studies utilizing cancer omics data mainly sourced from The Cancer Genome Atlas (TCGA) database. As a result, we identified 32 articles that generally fulfilled the criteria. We studied their techniques and their ability to handle challenges in analyzing multi-omics data, particularly regarding missing data, dimensionality, and processing workflows. We also discuss how well these methods consider explainability, interpretability, and ethical aspects in developing solutions that treat private medical and sensitive information.</div><div>From the 32 studies, we can divide deep learning-based multi-omics integration methods into two types: non-generative and generative models. Non-generative approaches, such as feedforward neural networks (FFNs), graph convolutional networks (GCNs), and autoencoders, are designed to extract features and perform classification directly. On the other hand, generative methods such as variational autoencoders (VAEs), generative adversarial networks (GANs), and generative pretrained transformers (GPTs) focus on creating adaptable representations that can be shared across multiple modalities. These methods have advanced the handling of missing data and dimensionality, outperforming traditional approaches. However, most reviewed models remain at the proof-of-concept stage, with limited clinical validation or real-world deployment.</div></div>\",\"PeriodicalId\":13953,\"journal\":{\"name\":\"Informatics in Medicine Unlocked\",\"volume\":\"57 \",\"pages\":\"Article 101679\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informatics in Medicine Unlocked\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352914825000681\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914825000681","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
Challenges in AI-driven multi-omics data analysis for Oncology: Addressing dimensionality, sparsity, transparency and ethical considerations
Artificial intelligence, particularly deep learning, is becoming increasingly prominent in multi-omics research, especially since traditional statistical models struggle to handle the complexity and high dimensionality of such data. By effectively combining different types of omics data, AI techniques can unveil hidden connections, detect biomarkers, and improve disease prediction through the integration of multi-omics layers and modalities, which can lead to significant advancements in precision medicine. In this review, we gathered published methods of deep learning-based multi-omics integration specialized in oncology since 2020. We concentrated exclusively on studies utilizing cancer omics data mainly sourced from The Cancer Genome Atlas (TCGA) database. As a result, we identified 32 articles that generally fulfilled the criteria. We studied their techniques and their ability to handle challenges in analyzing multi-omics data, particularly regarding missing data, dimensionality, and processing workflows. We also discuss how well these methods consider explainability, interpretability, and ethical aspects in developing solutions that treat private medical and sensitive information.
From the 32 studies, we can divide deep learning-based multi-omics integration methods into two types: non-generative and generative models. Non-generative approaches, such as feedforward neural networks (FFNs), graph convolutional networks (GCNs), and autoencoders, are designed to extract features and perform classification directly. On the other hand, generative methods such as variational autoencoders (VAEs), generative adversarial networks (GANs), and generative pretrained transformers (GPTs) focus on creating adaptable representations that can be shared across multiple modalities. These methods have advanced the handling of missing data and dimensionality, outperforming traditional approaches. However, most reviewed models remain at the proof-of-concept stage, with limited clinical validation or real-world deployment.
期刊介绍:
Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.