基于机器学习的整合素在癌症和转移中的表达模式研究。

IF 3.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Hossain Shadman, Saghar Gomrok, Christopher Litle, Qianyi Cheng, Yu Jiang, Xiaohua Huang, Jesse D Ziebarth, Yongmei Wang
{"title":"基于机器学习的整合素在癌症和转移中的表达模式研究。","authors":"Hossain Shadman, Saghar Gomrok, Christopher Litle, Qianyi Cheng, Yu Jiang, Xiaohua Huang, Jesse D Ziebarth, Yongmei Wang","doi":"10.1038/s41598-025-89497-w","DOIUrl":null,"url":null,"abstract":"<p><p>Integrins, a family of transmembrane receptor proteins, are well known to play important roles in cancer development and metastasis. However, a comprehensive understanding of these roles has not been achieved due to the complex relationships between specific integrins, cancer types, and the stages of cancer progression. Publicly accessible repositories from the Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) projects provide rich datasets for exploring these relationships using machine learning (ML). In this study, integrin RNA-Seq expression data of ~ 8 healthy tissues in GTEx and corresponding tumors in TCGA were selected. Integrin expression was used to train ML models to distinguish between different healthy tissues, solid tumors, as well as normal and tumor samples from the same tissue type. These ML models can classify samples by tissue origin or disease status with high accuracy, and the integrins essential to these classifiers were identified. In some cases, the expression of only one or two integrins was needed to classify tissue type, tumor type or disease status with accuracy > 0.9. For example, expression of ITGA7 alone can distinguish healthy and cancerous breast tissue. Additionally, integrin co-expression networks in healthy and cancerous breast tissues were compared and were found to change significantly from healthy to cancer, indicating changes in functional involvement of integrins due to cancer. Integrin expression in metastatic tumors were further examined using data from the AURORA project for Metastatic Breast Cancer (MBC), and several integrins such as ITGAD, ITGA4, ITGAL, and ITGA11 were found to have significantly lower expression in metastases than in primary tumors.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"5270"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11821851/pdf/","citationCount":"0","resultStr":"{\"title\":\"A machine learning-based investigation of integrin expression patterns in cancer and metastasis.\",\"authors\":\"Hossain Shadman, Saghar Gomrok, Christopher Litle, Qianyi Cheng, Yu Jiang, Xiaohua Huang, Jesse D Ziebarth, Yongmei Wang\",\"doi\":\"10.1038/s41598-025-89497-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Integrins, a family of transmembrane receptor proteins, are well known to play important roles in cancer development and metastasis. However, a comprehensive understanding of these roles has not been achieved due to the complex relationships between specific integrins, cancer types, and the stages of cancer progression. Publicly accessible repositories from the Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) projects provide rich datasets for exploring these relationships using machine learning (ML). In this study, integrin RNA-Seq expression data of ~ 8 healthy tissues in GTEx and corresponding tumors in TCGA were selected. Integrin expression was used to train ML models to distinguish between different healthy tissues, solid tumors, as well as normal and tumor samples from the same tissue type. These ML models can classify samples by tissue origin or disease status with high accuracy, and the integrins essential to these classifiers were identified. In some cases, the expression of only one or two integrins was needed to classify tissue type, tumor type or disease status with accuracy > 0.9. For example, expression of ITGA7 alone can distinguish healthy and cancerous breast tissue. Additionally, integrin co-expression networks in healthy and cancerous breast tissues were compared and were found to change significantly from healthy to cancer, indicating changes in functional involvement of integrins due to cancer. Integrin expression in metastatic tumors were further examined using data from the AURORA project for Metastatic Breast Cancer (MBC), and several integrins such as ITGAD, ITGA4, ITGAL, and ITGA11 were found to have significantly lower expression in metastases than in primary tumors.</p>\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"5270\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11821851/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-89497-w\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-89497-w","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

整合素是一个跨膜受体蛋白家族,在癌症的发生和转移中发挥着重要作用。然而,由于特定整合素、癌症类型和癌症进展阶段之间的复杂关系,对这些作用的全面理解尚未实现。来自基因型组织表达(GTEx)和癌症基因组图谱(TCGA)项目的可公开访问的存储库为使用机器学习(ML)探索这些关系提供了丰富的数据集。本研究选取了GTEx中~ 8个健康组织和TCGA中相应肿瘤的整合素RNA-Seq表达数据。利用整合素表达来训练ML模型,以区分不同的健康组织、实体瘤,以及来自同一组织类型的正常和肿瘤样本。这些ML模型可以根据组织来源或疾病状态对样本进行高精度分类,并确定了这些分类器所必需的整合素。在某些情况下,仅需要一种或两种整合素的表达就可以区分组织类型、肿瘤类型或疾病状态,准确率为> 0.9。例如,ITGA7的单独表达可以区分健康和癌性乳腺组织。此外,我们还比较了整合素在健康和癌变乳腺组织中的共表达网络,发现从健康到癌变,整合素的共表达网络发生了显著变化,这表明整合素在癌变过程中的功能参与发生了变化。利用AURORA项目对转移性乳腺癌(MBC)的数据进一步检测了整合素在转移性肿瘤中的表达,发现一些整合素如ITGAD、ITGA4、ITGAL和ITGA11在转移性肿瘤中的表达明显低于原发肿瘤。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A machine learning-based investigation of integrin expression patterns in cancer and metastasis.

Integrins, a family of transmembrane receptor proteins, are well known to play important roles in cancer development and metastasis. However, a comprehensive understanding of these roles has not been achieved due to the complex relationships between specific integrins, cancer types, and the stages of cancer progression. Publicly accessible repositories from the Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) projects provide rich datasets for exploring these relationships using machine learning (ML). In this study, integrin RNA-Seq expression data of ~ 8 healthy tissues in GTEx and corresponding tumors in TCGA were selected. Integrin expression was used to train ML models to distinguish between different healthy tissues, solid tumors, as well as normal and tumor samples from the same tissue type. These ML models can classify samples by tissue origin or disease status with high accuracy, and the integrins essential to these classifiers were identified. In some cases, the expression of only one or two integrins was needed to classify tissue type, tumor type or disease status with accuracy > 0.9. For example, expression of ITGA7 alone can distinguish healthy and cancerous breast tissue. Additionally, integrin co-expression networks in healthy and cancerous breast tissues were compared and were found to change significantly from healthy to cancer, indicating changes in functional involvement of integrins due to cancer. Integrin expression in metastatic tumors were further examined using data from the AURORA project for Metastatic Breast Cancer (MBC), and several integrins such as ITGAD, ITGA4, ITGAL, and ITGA11 were found to have significantly lower expression in metastases than in primary tumors.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Scientific Reports
Scientific Reports Natural Science Disciplines-
CiteScore
7.50
自引率
4.30%
发文量
19567
审稿时长
3.9 months
期刊介绍: We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信