{"title":"A deep learning tissue classifier based on differential Co-expression genes predicts the pregnancy outcomes of cattle.","authors":"Chenxi Huo, Chuanqiang Zhang, Jing Lu, Xiaofeng Su, Xiaoxia Qi, Yaqiang Guo, Yanchun Bao, Hongxia Jia, Guifang Cao, Risu Na, Wenguang Zhang, Xihe Li","doi":"10.1093/biolre/ioaf009","DOIUrl":null,"url":null,"abstract":"<p><p>Economic losses in cattle farms are frequently associated with failed pregnancies. Some studies found that the transcriptomic profiles of blood and endometrial tissues in cattle with varying pregnancy outcomes display discrepancies even before artificial insemination (AI) or embryo transfer (ET). In the study, 330 samples from seven distinct sources and two tissue types were integrated and divided into two groups based on the ability to establish and maintain pregnancy after AI or ET: P (pregnant) and NP (nonpregnant). By analyzing gene co-variation and employing machine learning algorithms, the objective was to identify genes that could predict pregnancy outcomes in cattle. Initially, within each tissue type, the top 100 differentially co-expressed genes (DCEG) were identified based on the analysis of changes in correlation coefficients and network topological structure. Subsequently, these genes were used in models trained by seven different machine learning algorithms. Overall, models trained on DCEGs exhibited superior predictive accuracy compared to those trained on an equivalent number of differential expression genes (DEGs). Among them, the deep learning models based on differential co-expression genes in blood and endometrial tissue achieved prediction accuracies of 91.7% and 82.6%, respectively. Finally, the importance of DCEGs was ranked using SHapley Additive exPlanations (SHAP) and enrichment analysis, identifying key signaling pathways that influence pregnancy. In summary, this study identified a set of genes potentially affecting pregnancy by analyzing the overall co-variation of gene connections between multiple sources. These key genes facilitated the development of interpretable machine learning models that accurately predict pregnancy outcomes in cattle.</p>","PeriodicalId":8965,"journal":{"name":"Biology of Reproduction","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biology of Reproduction","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/biolre/ioaf009","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"REPRODUCTIVE BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Economic losses in cattle farms are frequently associated with failed pregnancies. Some studies found that the transcriptomic profiles of blood and endometrial tissues in cattle with varying pregnancy outcomes display discrepancies even before artificial insemination (AI) or embryo transfer (ET). In the study, 330 samples from seven distinct sources and two tissue types were integrated and divided into two groups based on the ability to establish and maintain pregnancy after AI or ET: P (pregnant) and NP (nonpregnant). By analyzing gene co-variation and employing machine learning algorithms, the objective was to identify genes that could predict pregnancy outcomes in cattle. Initially, within each tissue type, the top 100 differentially co-expressed genes (DCEG) were identified based on the analysis of changes in correlation coefficients and network topological structure. Subsequently, these genes were used in models trained by seven different machine learning algorithms. Overall, models trained on DCEGs exhibited superior predictive accuracy compared to those trained on an equivalent number of differential expression genes (DEGs). Among them, the deep learning models based on differential co-expression genes in blood and endometrial tissue achieved prediction accuracies of 91.7% and 82.6%, respectively. Finally, the importance of DCEGs was ranked using SHapley Additive exPlanations (SHAP) and enrichment analysis, identifying key signaling pathways that influence pregnancy. In summary, this study identified a set of genes potentially affecting pregnancy by analyzing the overall co-variation of gene connections between multiple sources. These key genes facilitated the development of interpretable machine learning models that accurately predict pregnancy outcomes in cattle.
期刊介绍:
Biology of Reproduction (BOR) is the official journal of the Society for the Study of Reproduction and publishes original research on a broad range of topics in the field of reproductive biology, as well as reviews on topics of current importance or controversy. BOR is consistently one of the most highly cited journals publishing original research in the field of reproductive biology.