TREE-REGULARIZED BAYESIAN LATENT CLASS ANALYSIS FOR IMPROVING WEAKLY SEPARATED DIETARY PATTERN SUBTYPING IN SMALL-SIZED SUBPOPULATIONS.

IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY
Annals of Applied Statistics Pub Date : 2025-12-01 Epub Date: 2025-12-05 DOI:10.1214/25-aoas2067
By Mengbing Li, Briana Stephenson, Zhenke Wu
{"title":"TREE-REGULARIZED BAYESIAN LATENT CLASS ANALYSIS FOR IMPROVING WEAKLY SEPARATED DIETARY PATTERN SUBTYPING IN SMALL-SIZED SUBPOPULATIONS.","authors":"By Mengbing Li, Briana Stephenson, Zhenke Wu","doi":"10.1214/25-aoas2067","DOIUrl":null,"url":null,"abstract":"<p><p>Dietary patterns synthesize multiple related diet components, which can be used by nutrition researchers to examine diet-disease relationships. Latent class models (LCMs) have been used to derive dietary patterns from dietary intake assessment, where each class profile represents the probabilities of exposure to a set of diet components. However, LCM-derived dietary patterns can exhibit strong similarities, or weak separation, resulting in numerical and inferential instabilities that challenge scientific interpretation. This issue is exacerbated in small-sized subpopulations. To address these issues, we provide a simple solution that empowers LCMs to improve dietary pattern estimation. We develop a tree-regularized Bayesian LCM that shares statistical strength between dietary patterns to make better estimates using limited data. This is achieved via a Dirichlet diffusion tree process that specifies a prior distribution for the unknown tree over classes. Dietary patterns that share proximity to one another in the tree are shrunk toward ancestral dietary patterns a priori, with the degree of shrinkage varying across prespecified food groups. Using dietary intake data from the Hispanic Community Health Study/Study of Latinos, we apply the proposed approach to a sample of 496 U.S. adults of South American ethnic background to identify and compare dietary patterns.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 4","pages":"3003-3022"},"PeriodicalIF":1.4000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12867110/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Applied Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1214/25-aoas2067","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/12/5 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Dietary patterns synthesize multiple related diet components, which can be used by nutrition researchers to examine diet-disease relationships. Latent class models (LCMs) have been used to derive dietary patterns from dietary intake assessment, where each class profile represents the probabilities of exposure to a set of diet components. However, LCM-derived dietary patterns can exhibit strong similarities, or weak separation, resulting in numerical and inferential instabilities that challenge scientific interpretation. This issue is exacerbated in small-sized subpopulations. To address these issues, we provide a simple solution that empowers LCMs to improve dietary pattern estimation. We develop a tree-regularized Bayesian LCM that shares statistical strength between dietary patterns to make better estimates using limited data. This is achieved via a Dirichlet diffusion tree process that specifies a prior distribution for the unknown tree over classes. Dietary patterns that share proximity to one another in the tree are shrunk toward ancestral dietary patterns a priori, with the degree of shrinkage varying across prespecified food groups. Using dietary intake data from the Hispanic Community Health Study/Study of Latinos, we apply the proposed approach to a sample of 496 U.S. adults of South American ethnic background to identify and compare dietary patterns.

树正则化贝叶斯潜类分析改善小尺度亚群中弱分离饮食模式亚型。
饮食模式综合了多种相关的饮食成分,可以被营养研究人员用来研究饮食与疾病的关系。潜在类别模型(lcm)已被用于从饮食摄入评估中得出饮食模式,其中每个类别概况代表暴露于一组饮食成分的概率。然而,lcm衍生的饮食模式可能表现出强烈的相似性或弱分离性,导致数字和推断的不稳定性,挑战科学解释。这个问题在小型亚种群中更加严重。为了解决这些问题,我们提供了一个简单的解决方案,使lcm能够改进饮食模式估计。我们开发了一种树正则化贝叶斯LCM,它在饮食模式之间共享统计强度,以便使用有限的数据进行更好的估计。这是通过狄利克雷扩散树过程实现的,该过程指定了未知树在类上的先验分布。在树中彼此相近的饮食模式会先验地向祖先的饮食模式缩小,缩小的程度因预先指定的食物组而异。使用来自西班牙裔社区健康研究/拉丁裔研究的饮食摄入数据,我们将建议的方法应用于496名南美种族背景的美国成年人样本,以确定和比较饮食模式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Annals of Applied Statistics
Annals of Applied Statistics 社会科学-统计学与概率论
CiteScore
3.10
自引率
5.60%
发文量
131
审稿时长
6-12 weeks
期刊介绍: Statistical research spans an enormous range from direct subject-matter collaborations to pure mathematical theory. The Annals of Applied Statistics, the newest journal from the IMS, is aimed at papers in the applied half of this range. Published quarterly in both print and electronic form, our goal is to provide a timely and unified forum for all areas of applied statistics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书