Prediction of new-onset migraine using clinical-genotypic data from the HUNT Study: a machine learning analysis.

IF 7.3 1区 医学 Q1 CLINICAL NEUROLOGY
Fahim Faisal, Antonios Danelakis, Marte-Helene Bjørk, Bendik Winsvold, Manjit Matharu, Parashkev Nachev, Knut Hagen, Erling Tronvik, Anker Stubberud
{"title":"Prediction of new-onset migraine using clinical-genotypic data from the HUNT Study: a machine learning analysis.","authors":"Fahim Faisal, Antonios Danelakis, Marte-Helene Bjørk, Bendik Winsvold, Manjit Matharu, Parashkev Nachev, Knut Hagen, Erling Tronvik, Anker Stubberud","doi":"10.1186/s10194-025-02014-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Migraine is associated with a range of symptoms and comorbid disorders and has a strong genetic basis, but the currently identified risk loci only explain a small portion of the heritability, often termed the \"missing heritability\". We aimed to investigate if machine learning could exploit the combination of genetic data and general clinical features to identify individuals at risk for new-onset migraine.</p><p><strong>Method: </strong>This study was a population-based cohort study of adults from the second and third Trøndelag Health Study (HUNT2 and HUNT3). Migraine was captured in a validated questionnaire and based on modified criteria of the International Classification of Headache Disorders (ICHD) and participants underwent genome-wide genotyping. The primary outcome was new-onset migraine defined as a change in disease status from headache-free in HUNT2 to migraine in HUNT3. The migraine risk variants identified in the largest GWAS meta-analysis of migraine were used to identify genetic input features for the models. The general clinical features included demographics, selected comorbidities, medication and stimulant use and non-headache symptoms as predictive factors. Several standard machine learning architectures were constructed, trained, optimized and scored with area under the receiver operating characteristics curve (AUC). The best model during training and validation was used on unseen test sets. Different methods for model explainability were employed.</p><p><strong>Results: </strong>A total of 12,995 individuals were included in the predictive modelling (491 new-onset cases). A total of 108 genetic variants and 67 general clinical variables were included in the models. The top performing decision-tree classifier achieved a test set AUC of 0.56 when using only genotypic data, 0.68 when using only clinical data and 0.72 when using both genetic and clinical data. Combining the genotype only and clinical data only models resulted in a lower predictivity with an AUC of 0.67. The most important clinical features were age, marital status and work situation as well as several genetic variants.</p><p><strong>Conclusion: </strong>The combination of genotype and routine demographic and non-headache clinical data correctly predict the new onset of migraine in approximately 2 out of 3 cases, supporting that there are important genotypic-phenotypic interactions partaking in the new-onset of migraine.</p>","PeriodicalId":16013,"journal":{"name":"Journal of Headache and Pain","volume":"26 1","pages":"70"},"PeriodicalIF":7.3000,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Headache and Pain","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s10194-025-02014-2","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Migraine is associated with a range of symptoms and comorbid disorders and has a strong genetic basis, but the currently identified risk loci only explain a small portion of the heritability, often termed the "missing heritability". We aimed to investigate if machine learning could exploit the combination of genetic data and general clinical features to identify individuals at risk for new-onset migraine.

Method: This study was a population-based cohort study of adults from the second and third Trøndelag Health Study (HUNT2 and HUNT3). Migraine was captured in a validated questionnaire and based on modified criteria of the International Classification of Headache Disorders (ICHD) and participants underwent genome-wide genotyping. The primary outcome was new-onset migraine defined as a change in disease status from headache-free in HUNT2 to migraine in HUNT3. The migraine risk variants identified in the largest GWAS meta-analysis of migraine were used to identify genetic input features for the models. The general clinical features included demographics, selected comorbidities, medication and stimulant use and non-headache symptoms as predictive factors. Several standard machine learning architectures were constructed, trained, optimized and scored with area under the receiver operating characteristics curve (AUC). The best model during training and validation was used on unseen test sets. Different methods for model explainability were employed.

Results: A total of 12,995 individuals were included in the predictive modelling (491 new-onset cases). A total of 108 genetic variants and 67 general clinical variables were included in the models. The top performing decision-tree classifier achieved a test set AUC of 0.56 when using only genotypic data, 0.68 when using only clinical data and 0.72 when using both genetic and clinical data. Combining the genotype only and clinical data only models resulted in a lower predictivity with an AUC of 0.67. The most important clinical features were age, marital status and work situation as well as several genetic variants.

Conclusion: The combination of genotype and routine demographic and non-headache clinical data correctly predict the new onset of migraine in approximately 2 out of 3 cases, supporting that there are important genotypic-phenotypic interactions partaking in the new-onset of migraine.

利用HUNT研究的临床基因型数据预测新发偏头痛:机器学习分析。
背景:偏头痛与一系列症状和合并症相关,具有很强的遗传基础,但目前确定的风险位点仅解释了遗传力的一小部分,通常称为“缺失遗传力”。我们的目的是研究机器学习是否可以利用遗传数据和一般临床特征的结合来识别新发偏头痛的风险个体。方法:本研究是一项基于人群的队列研究,研究对象来自第二和第三项Trøndelag健康研究(HUNT2和HUNT3)。偏头痛是根据国际头痛疾病分类(ICHD)的修改标准在一份有效的问卷中捕获的,参与者进行了全基因组基因分型。主要结局是新发偏头痛,定义为疾病状态从HUNT2的无头痛到HUNT3的偏头痛。在最大的偏头痛GWAS荟萃分析中确定的偏头痛风险变异被用于确定模型的遗传输入特征。一般临床特征包括人口统计学特征、选定的合并症、药物和兴奋剂的使用以及非头痛症状作为预测因素。构建、训练、优化了几个标准机器学习架构,并使用接收者操作特征曲线(AUC)下的面积进行评分。在未见过的测试集上使用训练和验证过程中的最佳模型。采用了不同的模型可解释性方法。结果:预测模型共纳入12995人(其中新发病例491例)。共有108个遗传变异和67个一般临床变量被纳入模型。表现最好的决策树分类器在仅使用基因型数据时的测试集AUC为0.56,仅使用临床数据时为0.68,同时使用遗传和临床数据时为0.72。仅基因型和仅临床数据模型相结合的预测结果较低,AUC为0.67。最重要的临床特征是年龄、婚姻状况和工作情况以及一些基因变异。结论:基因型与常规人口学及非头痛临床资料的结合,能正确预测约2 / 3的偏头痛新发病例,支持偏头痛新发存在重要的基因型-表型相互作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Headache and Pain
Journal of Headache and Pain 医学-临床神经学
CiteScore
11.80
自引率
13.50%
发文量
143
审稿时长
6-12 weeks
期刊介绍: The Journal of Headache and Pain, a peer-reviewed open-access journal published under the BMC brand, a part of Springer Nature, is dedicated to researchers engaged in all facets of headache and related pain syndromes. It encompasses epidemiology, public health, basic science, translational medicine, clinical trials, and real-world data. With a multidisciplinary approach, The Journal of Headache and Pain addresses headache medicine and related pain syndromes across all medical disciplines. It particularly encourages submissions in clinical, translational, and basic science fields, focusing on pain management, genetics, neurology, and internal medicine. The journal publishes research articles, reviews, letters to the Editor, as well as consensus articles and guidelines, aimed at promoting best practices in managing patients with headaches and related pain.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信