用正则化连续时间隐马尔可夫模型识别多烟草使用的潜在状态转变模式。

IF 1.7 4区 数学 Q3 BIOLOGY
Biometrics Pub Date : 2025-10-08 DOI:10.1093/biomtc/ujaf138
Xinyu Yan, Ji-Hyun Lee, Xiang-Yang Lou
{"title":"用正则化连续时间隐马尔可夫模型识别多烟草使用的潜在状态转变模式。","authors":"Xinyu Yan, Ji-Hyun Lee, Xiang-Yang Lou","doi":"10.1093/biomtc/ujaf138","DOIUrl":null,"url":null,"abstract":"<p><p>Hidden Markov models (HMMs) are widely used to characterize latent state transition patterns in substance use. However, traditional HMM frameworks are incompetent when dealing with the complexities introduced by high-dimensional risk factors and varying time intervals, particularly in determining the number of hidden states and selecting variables for state transition parameters. To tackle the analytical challenges in the Population Assessment of Tobacco and Health (PATH) Study, a nationally representative longitudinal cohort study on tobacco use, we propose a continuous-time HMM framework with a regularization algorithm to identify multi-dimensional risk factors underlying complex poly-tobacco use transitions. We develop an elastic-net regularization on the transition covariates to identify informative covariates and improve model estimation accuracy. The inclusion of key covariates enables accurate determination of the number of hidden states. We incorporate survey weights and information on strata and clustering throughout the modeling framework. We demonstrate the validity of our approach in determining state numbers, identifying informative covariates, and estimating model parameters through a series of simulations. Application of the proposed approach to PATH data analysis revealed several demographic, behavioral, and psychosocial factors that contribute to the differential risks of transition between tobacco-use states among youth and young adults. The model's capacity in identifying high-dimensional risk factors for underlying hidden variables substantiates its potential for enhancing public health research and informing interventions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A regularized continuous-time hidden Markov model for identifying latent state transition patterns of poly-tobacco use.\",\"authors\":\"Xinyu Yan, Ji-Hyun Lee, Xiang-Yang Lou\",\"doi\":\"10.1093/biomtc/ujaf138\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Hidden Markov models (HMMs) are widely used to characterize latent state transition patterns in substance use. However, traditional HMM frameworks are incompetent when dealing with the complexities introduced by high-dimensional risk factors and varying time intervals, particularly in determining the number of hidden states and selecting variables for state transition parameters. To tackle the analytical challenges in the Population Assessment of Tobacco and Health (PATH) Study, a nationally representative longitudinal cohort study on tobacco use, we propose a continuous-time HMM framework with a regularization algorithm to identify multi-dimensional risk factors underlying complex poly-tobacco use transitions. We develop an elastic-net regularization on the transition covariates to identify informative covariates and improve model estimation accuracy. The inclusion of key covariates enables accurate determination of the number of hidden states. We incorporate survey weights and information on strata and clustering throughout the modeling framework. We demonstrate the validity of our approach in determining state numbers, identifying informative covariates, and estimating model parameters through a series of simulations. Application of the proposed approach to PATH data analysis revealed several demographic, behavioral, and psychosocial factors that contribute to the differential risks of transition between tobacco-use states among youth and young adults. The model's capacity in identifying high-dimensional risk factors for underlying hidden variables substantiates its potential for enhancing public health research and informing interventions.</p>\",\"PeriodicalId\":8930,\"journal\":{\"name\":\"Biometrics\",\"volume\":\"81 4\",\"pages\":\"\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biometrics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1093/biomtc/ujaf138\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biometrics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biomtc/ujaf138","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

隐马尔可夫模型(hmm)被广泛用于描述物质使用中的潜在状态转移模式。然而,传统HMM框架在处理由高维风险因素和变时间间隔引入的复杂性时,特别是在确定隐藏状态数量和选择状态转移参数变量方面表现不佳。为了解决烟草与健康人口评估(PATH)研究(一项具有全国代表性的烟草使用纵向队列研究)中的分析挑战,我们提出了一个带有正则化算法的连续时间HMM框架,以识别复杂多烟草使用转变背后的多维风险因素。我们对过渡协变量进行弹性网络正则化,以识别信息协变量,提高模型估计精度。关键协变量的包含可以准确地确定隐藏状态的数量。我们在整个建模框架中结合了调查权重和关于分层和聚类的信息。通过一系列的模拟,我们证明了我们的方法在确定状态数、识别信息协变量和估计模型参数方面的有效性。将提出的方法应用于适宜卫生技术方案数据分析,揭示了若干人口统计学、行为和社会心理因素,这些因素导致了青少年和年轻人在烟草使用状态之间的转变风险差异。该模型在确定潜在隐藏变量的高维风险因素方面的能力证实了它在加强公共卫生研究和为干预措施提供信息方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A regularized continuous-time hidden Markov model for identifying latent state transition patterns of poly-tobacco use.

Hidden Markov models (HMMs) are widely used to characterize latent state transition patterns in substance use. However, traditional HMM frameworks are incompetent when dealing with the complexities introduced by high-dimensional risk factors and varying time intervals, particularly in determining the number of hidden states and selecting variables for state transition parameters. To tackle the analytical challenges in the Population Assessment of Tobacco and Health (PATH) Study, a nationally representative longitudinal cohort study on tobacco use, we propose a continuous-time HMM framework with a regularization algorithm to identify multi-dimensional risk factors underlying complex poly-tobacco use transitions. We develop an elastic-net regularization on the transition covariates to identify informative covariates and improve model estimation accuracy. The inclusion of key covariates enables accurate determination of the number of hidden states. We incorporate survey weights and information on strata and clustering throughout the modeling framework. We demonstrate the validity of our approach in determining state numbers, identifying informative covariates, and estimating model parameters through a series of simulations. Application of the proposed approach to PATH data analysis revealed several demographic, behavioral, and psychosocial factors that contribute to the differential risks of transition between tobacco-use states among youth and young adults. The model's capacity in identifying high-dimensional risk factors for underlying hidden variables substantiates its potential for enhancing public health research and informing interventions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biometrics
Biometrics 生物-生物学
CiteScore
2.70
自引率
5.30%
发文量
178
审稿时长
4-8 weeks
期刊介绍: The International Biometric Society is an international society promoting the development and application of statistical and mathematical theory and methods in the biosciences, including agriculture, biomedical science and public health, ecology, environmental sciences, forestry, and allied disciplines. The Society welcomes as members statisticians, mathematicians, biological scientists, and others devoted to interdisciplinary efforts in advancing the collection and interpretation of information in the biosciences. The Society sponsors the biennial International Biometric Conference, held in sites throughout the world; through its National Groups and Regions, it also Society sponsors regional and local meetings.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信