Hossain Shadman, Saghar Gomrok, Christopher Litle, Qianyi Cheng, Yu Jiang, Xiaohua Huang, Jesse D Ziebarth, Yongmei Wang
{"title":"基于机器学习的整合素在癌症和转移中的表达模式研究。","authors":"Hossain Shadman, Saghar Gomrok, Christopher Litle, Qianyi Cheng, Yu Jiang, Xiaohua Huang, Jesse D Ziebarth, Yongmei Wang","doi":"10.1038/s41598-025-89497-w","DOIUrl":null,"url":null,"abstract":"<p><p>Integrins, a family of transmembrane receptor proteins, are well known to play important roles in cancer development and metastasis. However, a comprehensive understanding of these roles has not been achieved due to the complex relationships between specific integrins, cancer types, and the stages of cancer progression. Publicly accessible repositories from the Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) projects provide rich datasets for exploring these relationships using machine learning (ML). In this study, integrin RNA-Seq expression data of ~ 8 healthy tissues in GTEx and corresponding tumors in TCGA were selected. Integrin expression was used to train ML models to distinguish between different healthy tissues, solid tumors, as well as normal and tumor samples from the same tissue type. These ML models can classify samples by tissue origin or disease status with high accuracy, and the integrins essential to these classifiers were identified. In some cases, the expression of only one or two integrins was needed to classify tissue type, tumor type or disease status with accuracy > 0.9. For example, expression of ITGA7 alone can distinguish healthy and cancerous breast tissue. Additionally, integrin co-expression networks in healthy and cancerous breast tissues were compared and were found to change significantly from healthy to cancer, indicating changes in functional involvement of integrins due to cancer. Integrin expression in metastatic tumors were further examined using data from the AURORA project for Metastatic Breast Cancer (MBC), and several integrins such as ITGAD, ITGA4, ITGAL, and ITGA11 were found to have significantly lower expression in metastases than in primary tumors.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"5270"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11821851/pdf/","citationCount":"0","resultStr":"{\"title\":\"A machine learning-based investigation of integrin expression patterns in cancer and metastasis.\",\"authors\":\"Hossain Shadman, Saghar Gomrok, Christopher Litle, Qianyi Cheng, Yu Jiang, Xiaohua Huang, Jesse D Ziebarth, Yongmei Wang\",\"doi\":\"10.1038/s41598-025-89497-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Integrins, a family of transmembrane receptor proteins, are well known to play important roles in cancer development and metastasis. However, a comprehensive understanding of these roles has not been achieved due to the complex relationships between specific integrins, cancer types, and the stages of cancer progression. Publicly accessible repositories from the Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) projects provide rich datasets for exploring these relationships using machine learning (ML). In this study, integrin RNA-Seq expression data of ~ 8 healthy tissues in GTEx and corresponding tumors in TCGA were selected. Integrin expression was used to train ML models to distinguish between different healthy tissues, solid tumors, as well as normal and tumor samples from the same tissue type. These ML models can classify samples by tissue origin or disease status with high accuracy, and the integrins essential to these classifiers were identified. In some cases, the expression of only one or two integrins was needed to classify tissue type, tumor type or disease status with accuracy > 0.9. For example, expression of ITGA7 alone can distinguish healthy and cancerous breast tissue. Additionally, integrin co-expression networks in healthy and cancerous breast tissues were compared and were found to change significantly from healthy to cancer, indicating changes in functional involvement of integrins due to cancer. Integrin expression in metastatic tumors were further examined using data from the AURORA project for Metastatic Breast Cancer (MBC), and several integrins such as ITGAD, ITGA4, ITGAL, and ITGA11 were found to have significantly lower expression in metastases than in primary tumors.</p>\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"5270\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11821851/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-89497-w\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-89497-w","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
A machine learning-based investigation of integrin expression patterns in cancer and metastasis.
Integrins, a family of transmembrane receptor proteins, are well known to play important roles in cancer development and metastasis. However, a comprehensive understanding of these roles has not been achieved due to the complex relationships between specific integrins, cancer types, and the stages of cancer progression. Publicly accessible repositories from the Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) projects provide rich datasets for exploring these relationships using machine learning (ML). In this study, integrin RNA-Seq expression data of ~ 8 healthy tissues in GTEx and corresponding tumors in TCGA were selected. Integrin expression was used to train ML models to distinguish between different healthy tissues, solid tumors, as well as normal and tumor samples from the same tissue type. These ML models can classify samples by tissue origin or disease status with high accuracy, and the integrins essential to these classifiers were identified. In some cases, the expression of only one or two integrins was needed to classify tissue type, tumor type or disease status with accuracy > 0.9. For example, expression of ITGA7 alone can distinguish healthy and cancerous breast tissue. Additionally, integrin co-expression networks in healthy and cancerous breast tissues were compared and were found to change significantly from healthy to cancer, indicating changes in functional involvement of integrins due to cancer. Integrin expression in metastatic tumors were further examined using data from the AURORA project for Metastatic Breast Cancer (MBC), and several integrins such as ITGAD, ITGA4, ITGAL, and ITGA11 were found to have significantly lower expression in metastases than in primary tumors.
期刊介绍:
We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections.
Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021).
•Engineering
Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live.
•Physical sciences
Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics.
•Earth and environmental sciences
Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems.
•Biological sciences
Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants.
•Health sciences
The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.