Genome-wide Machine Learning Analysis of Anosmia and Ageusia with COVID-19.

Lucas Pietan, Elizabeth Phillippi, Marcelo Melo, Hatem El-Shanti, Brian J Smith, Benjamin Darbro, Terry Braun, Thomas Casavant
{"title":"Genome-wide Machine Learning Analysis of Anosmia and Ageusia with COVID-19.","authors":"Lucas Pietan, Elizabeth Phillippi, Marcelo Melo, Hatem El-Shanti, Brian J Smith, Benjamin Darbro, Terry Braun, Thomas Casavant","doi":"10.1101/2024.12.04.24318493","DOIUrl":null,"url":null,"abstract":"<p><p>The COVID-19 pandemic has caused substantial worldwide disruptions in health, economy, and society, manifesting symptoms such as loss of smell (anosmia) and loss of taste (ageusia), that can result in prolonged sensory impairment. Establishing the host genetic etiology of anosmia and ageusia in COVID-19 will aid in the overall understanding of the sensorineural aspect of the disease and contribute to possible treatments or cures. By using human genome sequencing data from the University of Iowa (UI) COVID-19 cohort (N=187) and the National Institute of Health All of Us (AoU) Research Program COVID-19 cohort (N=947), we investigated the genetics of anosmia and/or ageusia by employing feature selection techniques to construct a novel variant and gene prioritization pipeline, utilizing machine learning methods for the classification of patients. Models were assessed using a permutation-based variable importance (PVI) strategy for final prioritization of candidate variants and genes. The highest held-out test set area under the receiver operating characteristic (AUROC) curve for models and datasets from the UI cohort was 0.735 and 0.798 for the variant and gene analysis respectively and for the AoU cohort was 0.687 for the variant analysis. Our analysis prioritized several novel and known candidate host genetic factors involved in immune response, neuronal signaling, and calcium signaling supporting previously proposed hypotheses for anosmia/ageusia in COVID-19.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643161/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.12.04.24318493","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The COVID-19 pandemic has caused substantial worldwide disruptions in health, economy, and society, manifesting symptoms such as loss of smell (anosmia) and loss of taste (ageusia), that can result in prolonged sensory impairment. Establishing the host genetic etiology of anosmia and ageusia in COVID-19 will aid in the overall understanding of the sensorineural aspect of the disease and contribute to possible treatments or cures. By using human genome sequencing data from the University of Iowa (UI) COVID-19 cohort (N=187) and the National Institute of Health All of Us (AoU) Research Program COVID-19 cohort (N=947), we investigated the genetics of anosmia and/or ageusia by employing feature selection techniques to construct a novel variant and gene prioritization pipeline, utilizing machine learning methods for the classification of patients. Models were assessed using a permutation-based variable importance (PVI) strategy for final prioritization of candidate variants and genes. The highest held-out test set area under the receiver operating characteristic (AUROC) curve for models and datasets from the UI cohort was 0.735 and 0.798 for the variant and gene analysis respectively and for the AoU cohort was 0.687 for the variant analysis. Our analysis prioritized several novel and known candidate host genetic factors involved in immune response, neuronal signaling, and calcium signaling supporting previously proposed hypotheses for anosmia/ageusia in COVID-19.

利用 COVID-19 对无嗅症和老年无嗅症进行全基因组机器学习分析。
COVID-19 大流行在全球范围内对健康、经济和社会造成了严重破坏,表现出嗅觉丧失(anosmia)和味觉丧失(ageusia)等症状,可导致长时间的感觉障碍。确定 COVID-19 中嗅觉缺失和味觉缺失的宿主遗传病因将有助于全面了解该疾病的感音神经方面,并为可能的治疗或治愈方法做出贡献。通过利用爱荷华大学(UI)COVID-19队列(N=187)和美国国立卫生研究院(National Institute of Health All of Us,AoU)研究计划COVID-19队列(N=947)的人类基因组测序数据,我们采用特征选择技术构建了一个新型变体和基因优先级管道,并利用机器学习方法对患者进行分类,从而研究了无嗅症和/或老年性无嗅症的遗传学。使用基于置换的变量重要性(PVI)策略对模型进行评估,以最终确定候选变体和基因的优先级。对于 UI 队列中的模型和数据集,变异分析和基因分析的最高保持测试集接收器操作特征曲线下面积(AUROC)分别为 0.735 和 0.798,而对于 AoU 队列,变异分析的最高保持测试集接收器操作特征曲线下面积为 0.687。我们的分析优先考虑了涉及免疫反应、神经元信号传导和钙信号传导的几个新的和已知的候选宿主遗传因子,支持之前提出的 COVID-19 中无精/老年痴呆症的假说。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信