利用机器学习模型鉴定影响猪肌内脂肪沉积的关键基因。

IF 2.8 3区 生物学 Q2 GENETICS & HEREDITY
Frontiers in Genetics Pub Date : 2025-01-06 eCollection Date: 2024-01-01 DOI:10.3389/fgene.2024.1503148
Yumei Shi, Xini Wang, Shaokang Chen, Yanhui Zhao, Yan Wang, Xihui Sheng, Xiaolong Qi, Lei Zhou, Yu Feng, Jianfeng Liu, Chuduan Wang, Kai Xing
{"title":"利用机器学习模型鉴定影响猪肌内脂肪沉积的关键基因。","authors":"Yumei Shi, Xini Wang, Shaokang Chen, Yanhui Zhao, Yan Wang, Xihui Sheng, Xiaolong Qi, Lei Zhou, Yu Feng, Jianfeng Liu, Chuduan Wang, Kai Xing","doi":"10.3389/fgene.2024.1503148","DOIUrl":null,"url":null,"abstract":"<p><p>Intramuscular fat (IMF) is an important indicator for evaluating meat quality. Transcriptome sequencing (RNA-seq) is widely used for the study of IMF deposition. Machine learning (ML) is a new big data fitting method that can effectively fit complex data, accurately identify samples and genes, and it plays an important role in omics research. Therefore, this study aimed to analyze RNA-seq data by ML method to identify differentially expressed genes (DEGs) affecting IMF deposition in pigs. In this study, a total of 74 RNA-seq data from muscle tissue samples were used. A total of 155 DEGs were identified using a limma package between the two groups. 100 and 11 significant genes were identified by support vector machine recursive feature elimination (SVM-RFE) and random forest (RF) models, respectively. A total of six intersecting genes were in both models. KEGG pathway enrichment analysis of the intersecting genes revealed that these genes were enriched in pathways associated with lipid deposition. These pathways include α-linolenic acid metabolism, linoleic acid metabolism, ether lipid metabolism, arachidonic acid metabolism, and glycerophospholipid metabolism. Four key genes affecting intramuscular fat deposition, <i>PLA2G6, MPV17, NUDT2</i>, and <i>ND4L</i>, were identified based on significant pathways. The results of this study are important for the elucidation of the molecular regulatory mechanism of intramuscular fat deposition and the effective improvement of IMF content in pigs.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"15 ","pages":"1503148"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11743517/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identification of key genes affecting intramuscular fat deposition in pigs using machine learning models.\",\"authors\":\"Yumei Shi, Xini Wang, Shaokang Chen, Yanhui Zhao, Yan Wang, Xihui Sheng, Xiaolong Qi, Lei Zhou, Yu Feng, Jianfeng Liu, Chuduan Wang, Kai Xing\",\"doi\":\"10.3389/fgene.2024.1503148\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Intramuscular fat (IMF) is an important indicator for evaluating meat quality. Transcriptome sequencing (RNA-seq) is widely used for the study of IMF deposition. Machine learning (ML) is a new big data fitting method that can effectively fit complex data, accurately identify samples and genes, and it plays an important role in omics research. Therefore, this study aimed to analyze RNA-seq data by ML method to identify differentially expressed genes (DEGs) affecting IMF deposition in pigs. In this study, a total of 74 RNA-seq data from muscle tissue samples were used. A total of 155 DEGs were identified using a limma package between the two groups. 100 and 11 significant genes were identified by support vector machine recursive feature elimination (SVM-RFE) and random forest (RF) models, respectively. A total of six intersecting genes were in both models. KEGG pathway enrichment analysis of the intersecting genes revealed that these genes were enriched in pathways associated with lipid deposition. These pathways include α-linolenic acid metabolism, linoleic acid metabolism, ether lipid metabolism, arachidonic acid metabolism, and glycerophospholipid metabolism. Four key genes affecting intramuscular fat deposition, <i>PLA2G6, MPV17, NUDT2</i>, and <i>ND4L</i>, were identified based on significant pathways. The results of this study are important for the elucidation of the molecular regulatory mechanism of intramuscular fat deposition and the effective improvement of IMF content in pigs.</p>\",\"PeriodicalId\":12750,\"journal\":{\"name\":\"Frontiers in Genetics\",\"volume\":\"15 \",\"pages\":\"1503148\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11743517/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.3389/fgene.2024.1503148\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fgene.2024.1503148","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

肌内脂肪(IMF)是评价肉品质的重要指标。转录组测序(RNA-seq)被广泛用于研究IMF沉积。机器学习(ML)是一种新的大数据拟合方法,可以有效拟合复杂数据,准确识别样本和基因,在组学研究中发挥着重要作用。因此,本研究旨在通过ML方法分析RNA-seq数据,以鉴定影响猪体内IMF沉积的差异表达基因(DEGs)。在本研究中,共使用了74个来自肌肉组织样本的RNA-seq数据。使用limma包在两组之间共鉴定了155个deg。通过支持向量机递归特征消除(SVM-RFE)和随机森林(RF)模型分别识别出100个和11个显著基因。两个模型中共有6个交叉基因。交叉基因的KEGG通路富集分析显示,这些基因在与脂质沉积相关的通路中富集。这些途径包括α-亚麻酸代谢、亚油酸代谢、醚脂代谢、花生四烯酸代谢和甘油磷脂代谢。通过显著通路鉴定出影响肌内脂肪沉积的4个关键基因:PLA2G6、MPV17、NUDT2和ND4L。本研究结果对阐明猪肌内脂肪沉积的分子调控机制和有效提高猪体内IMF含量具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of key genes affecting intramuscular fat deposition in pigs using machine learning models.

Intramuscular fat (IMF) is an important indicator for evaluating meat quality. Transcriptome sequencing (RNA-seq) is widely used for the study of IMF deposition. Machine learning (ML) is a new big data fitting method that can effectively fit complex data, accurately identify samples and genes, and it plays an important role in omics research. Therefore, this study aimed to analyze RNA-seq data by ML method to identify differentially expressed genes (DEGs) affecting IMF deposition in pigs. In this study, a total of 74 RNA-seq data from muscle tissue samples were used. A total of 155 DEGs were identified using a limma package between the two groups. 100 and 11 significant genes were identified by support vector machine recursive feature elimination (SVM-RFE) and random forest (RF) models, respectively. A total of six intersecting genes were in both models. KEGG pathway enrichment analysis of the intersecting genes revealed that these genes were enriched in pathways associated with lipid deposition. These pathways include α-linolenic acid metabolism, linoleic acid metabolism, ether lipid metabolism, arachidonic acid metabolism, and glycerophospholipid metabolism. Four key genes affecting intramuscular fat deposition, PLA2G6, MPV17, NUDT2, and ND4L, were identified based on significant pathways. The results of this study are important for the elucidation of the molecular regulatory mechanism of intramuscular fat deposition and the effective improvement of IMF content in pigs.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Frontiers in Genetics
Frontiers in Genetics Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
5.50
自引率
8.10%
发文量
3491
审稿时长
14 weeks
期刊介绍: Frontiers in Genetics publishes rigorously peer-reviewed research on genes and genomes relating to all the domains of life, from humans to plants to livestock and other model organisms. Led by an outstanding Editorial Board of the world’s leading experts, this multidisciplinary, open-access journal is at the forefront of communicating cutting-edge research to researchers, academics, clinicians, policy makers and the public. The study of inheritance and the impact of the genome on various biological processes is well documented. However, the majority of discoveries are still to come. A new era is seeing major developments in the function and variability of the genome, the use of genetic and genomic tools and the analysis of the genetic basis of various biological phenomena.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信