Spectral Graph Sample Weighting for Interpretable Sub-cohort Analysis in Predictive Models for Neuroimaging.

PRedictive Intelligence in MEdicine. PRIME (Workshop) Pub Date : 2025-01-01 Epub Date: 2024-10-18 DOI:10.1007/978-3-031-74561-4_3

Magdalini Paschali, Yu Hang Jiang, Spencer Siegel, Camila Gonzalez, Kilian M Pohl, Akshay Chaudhari, Qingyu Zhao

{"title":"Spectral Graph Sample Weighting for Interpretable Sub-cohort Analysis in Predictive Models for Neuroimaging.","authors":"Magdalini Paschali, Yu Hang Jiang, Spencer Siegel, Camila Gonzalez, Kilian M Pohl, Akshay Chaudhari, Qingyu Zhao","doi":"10.1007/978-3-031-74561-4_3","DOIUrl":null,"url":null,"abstract":"Recent advancements in medicine have confirmed that brain disorders often comprise multiple subtypes of mechanisms, developmental trajectories, or severity levels. Such heterogeneity is often associated with demographic aspects (e.g., sex) or disease-related contributors (e.g., genetics). Thus, the predictive power of machine learning models used for symptom prediction varies across subjects based on such factors. To model this heterogeneity, one can assign each training sample a factor-dependent weight, which modulates the subject's contribution to the overall objective loss function. To this end, we propose to model the subject weights as a linear combination of the eigenbases of a spectral population graph that captures the similarity of factors across subjects. In doing so, the learned weights smoothly vary across the graph, highlighting sub-cohorts with high and low predictability. Our proposed sample weighting scheme is evaluated on two tasks. First, we predict initiation of heavy alcohol drinking in young adulthood from imaging and neuropsychological measures from the National Consortium on Alcohol and NeuroDevelopment in Adolescence (NCANDA). Next, we detect Dementia vs. Mild Cognitive Impairment (MCI) using imaging and demographic measurements in subjects from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Compared to existing sample weighting schemes, our sample weights improve interpretability and highlight sub-cohorts with distinct characteristics and varying model accuracy.","PeriodicalId":92572,"journal":{"name":"PRedictive Intelligence in MEdicine. PRIME (Workshop)","volume":"15155 ","pages":"24-34"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549025/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PRedictive Intelligence in MEdicine. PRIME (Workshop)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-031-74561-4_3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/18 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recent advancements in medicine have confirmed that brain disorders often comprise multiple subtypes of mechanisms, developmental trajectories, or severity levels. Such heterogeneity is often associated with demographic aspects (e.g., sex) or disease-related contributors (e.g., genetics). Thus, the predictive power of machine learning models used for symptom prediction varies across subjects based on such factors. To model this heterogeneity, one can assign each training sample a factor-dependent weight, which modulates the subject's contribution to the overall objective loss function. To this end, we propose to model the subject weights as a linear combination of the eigenbases of a spectral population graph that captures the similarity of factors across subjects. In doing so, the learned weights smoothly vary across the graph, highlighting sub-cohorts with high and low predictability. Our proposed sample weighting scheme is evaluated on two tasks. First, we predict initiation of heavy alcohol drinking in young adulthood from imaging and neuropsychological measures from the National Consortium on Alcohol and NeuroDevelopment in Adolescence (NCANDA). Next, we detect Dementia vs. Mild Cognitive Impairment (MCI) using imaging and demographic measurements in subjects from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Compared to existing sample weighting schemes, our sample weights improve interpretability and highlight sub-cohorts with distinct characteristics and varying model accuracy.

查看原文本刊更多论文

光谱图样本加权，用于神经成像预测模型中可解释的子队列分析

医学的最新进展证实，脑部疾病通常由多种亚型机制、发展轨迹或严重程度组成。这种异质性通常与人口统计学方面（如性别）或疾病相关因素（如遗传学）有关。因此，基于这些因素，用于症状预测的机器学习模型对不同受试者的预测能力各不相同。为了模拟这种异质性，我们可以给每个训练样本分配一个与因素相关的权重，从而调节受试者对总体目标损失函数的贡献。为此，我们建议将受试者权重建模为光谱群体图特征基的线性组合，以捕捉不同受试者之间的因素相似性。这样，学习到的权重就会在整个图中平滑变化，从而突出具有高预测性和低预测性的子群组。我们提出的样本加权方案在两项任务中进行了评估。首先，我们通过国家青少年酒精和神经发育联盟（NCANDA）的成像和神经心理学测量来预测成年后开始大量饮酒的情况。接下来，我们利用阿尔茨海默病神经影像学倡议（ADNI）受试者的影像学和人口统计学测量结果检测痴呆症与轻度认知障碍（MCI）。与现有的样本加权方案相比，我们的样本加权提高了可解释性，并突出了具有不同特征和不同模型准确性的子队列。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PRedictive Intelligence in MEdicine. PRIME (Workshop)

自引率

0.00%

发文量