Federated Choquet Regression with LASSO for Outcome Prediction in Multisite Longitudinal Trial Data.

IF 8
ACM transactions on computing for healthcare Pub Date : 2026-01-01 Epub Date: 2026-01-14 DOI:10.1145/3761824
Semyon Lomasov, Hua Fang, Honggang Wang
{"title":"Federated Choquet Regression with LASSO for Outcome Prediction in Multisite Longitudinal Trial Data.","authors":"Semyon Lomasov, Hua Fang, Honggang Wang","doi":"10.1145/3761824","DOIUrl":null,"url":null,"abstract":"<p><p>Aggregating person-level data across multiple clinical study sites is often constrained by privacy regulations, necessitating the development of decentralized modeling approaches in biomedical research. To address this requirement, a federated nonlinear regression algorithm based on the Choquet integral has been introduced for outcome prediction. This approach avoids reliance on prior statistical assumptions about data distribution and captures feature interactions, reflecting the non-additive nature of biomedical data characteristics. This work represents the first theoretical application of Choquet integral regression to multisite longitudinal trial data within a federated learning framework. The Multiple Imputation Choquet Integral Regression with LASSO (MIChoquet-LASSO) algorithm is specifically designed to reduce overfitting and enable variable selection in federated learning settings. Its performance has been evaluated using synthetic datasets, publicly available biomedical datasets, and proprietary longitudinal randomized controlled trial data. Comparative evaluations were conducted against benchmark methods, including OLS regression and Choquet OLS regression, under various scenarios such as model misspecification and both linear and nonlinear data structures in non-federated and federated contexts. MSE was used as the primary performance metric. Results indicate that MIChoquet-LASSO outperforms compared models in handling nonlinear longitudinal data with missing values, particularly in scenarios prone to overfitting. In federated settings, Choquet OLS underperforms, whereas the federated variant of the model, FEDMIChoquet-LASSO, demonstrates consistently better performance. These findings suggest that FEDMIChoquet-LASSO offers a reliable solution for outcome prediction in multisite longitudinal trials, addressing challenges such as missing values, nonlinear relationships, and privacy constraints while maintaining strong performance within the federated learning framework.</p>","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"7 1","pages":""},"PeriodicalIF":8.0000,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13052512/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM transactions on computing for healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3761824","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/14 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Aggregating person-level data across multiple clinical study sites is often constrained by privacy regulations, necessitating the development of decentralized modeling approaches in biomedical research. To address this requirement, a federated nonlinear regression algorithm based on the Choquet integral has been introduced for outcome prediction. This approach avoids reliance on prior statistical assumptions about data distribution and captures feature interactions, reflecting the non-additive nature of biomedical data characteristics. This work represents the first theoretical application of Choquet integral regression to multisite longitudinal trial data within a federated learning framework. The Multiple Imputation Choquet Integral Regression with LASSO (MIChoquet-LASSO) algorithm is specifically designed to reduce overfitting and enable variable selection in federated learning settings. Its performance has been evaluated using synthetic datasets, publicly available biomedical datasets, and proprietary longitudinal randomized controlled trial data. Comparative evaluations were conducted against benchmark methods, including OLS regression and Choquet OLS regression, under various scenarios such as model misspecification and both linear and nonlinear data structures in non-federated and federated contexts. MSE was used as the primary performance metric. Results indicate that MIChoquet-LASSO outperforms compared models in handling nonlinear longitudinal data with missing values, particularly in scenarios prone to overfitting. In federated settings, Choquet OLS underperforms, whereas the federated variant of the model, FEDMIChoquet-LASSO, demonstrates consistently better performance. These findings suggest that FEDMIChoquet-LASSO offers a reliable solution for outcome prediction in multisite longitudinal trials, addressing challenges such as missing values, nonlinear relationships, and privacy constraints while maintaining strong performance within the federated learning framework.

多地点纵向试验数据的LASSO联合Choquet回归预测结果。
在多个临床研究地点汇总个人层面的数据通常受到隐私法规的限制,因此有必要在生物医学研究中开发分散的建模方法。为了满足这一要求,引入了一种基于Choquet积分的联邦非线性回归算法进行结果预测。这种方法避免了对数据分布的先验统计假设的依赖,并捕获了特征相互作用,反映了生物医学数据特征的非相加性。这项工作代表了Choquet积分回归在联邦学习框架内对多站点纵向试验数据的第一个理论应用。基于LASSO的多重输入Choquet积分回归(MIChoquet-LASSO)算法是专门为减少过拟合和在联邦学习设置中实现变量选择而设计的。使用合成数据集、公开可用的生物医学数据集和专有的纵向随机对照试验数据对其性能进行了评估。在不同的场景下,如模型规格错误、非联邦和联邦环境下的线性和非线性数据结构,对包括OLS回归和Choquet OLS回归在内的基准方法进行了比较评估。MSE被用作主要性能指标。结果表明,MIChoquet-LASSO在处理具有缺失值的非线性纵向数据方面优于比较模型,特别是在容易过度拟合的情况下。在联邦设置中,Choquet OLS表现不佳,而该模型的联邦变体FEDMIChoquet-LASSO却始终表现出更好的性能。这些发现表明,FEDMIChoquet-LASSO为多地点纵向试验的结果预测提供了可靠的解决方案,解决了缺失值、非线性关系和隐私约束等挑战,同时在联邦学习框架内保持了良好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.30
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书