当代青年危险饮酒的多变量机器学习预测

IF 5.3 1区 医学 Q1 PSYCHIATRY
Addiction Pub Date : 2025-07-16 DOI:10.1111/add.70145
Lucinda Grummitt, Rachel Visontay, Philip Clare, Tim Slade, Louise Birrell
{"title":"当代青年危险饮酒的多变量机器学习预测","authors":"Lucinda Grummitt, Rachel Visontay, Philip Clare, Tim Slade, Louise Birrell","doi":"10.1111/add.70145","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and aims: </strong>Risky alcohol use in young adulthood is a significant public health concern. Understanding the predictors of risky drinking during this period is essential for prevention. This study aimed to measure the predictive accuracy of ensemble machine learning and identify the most important predictors of risky alcohol use in early adulthood.</p><p><strong>Design and setting: </strong>Secondary analysis of the Longitudinal Study of Australian Children, an Australian national longitudinal cohort study.</p><p><strong>Participants: </strong>A total of 4983 children, aged 4-5 years in 2004 (Wave 1), followed up for eight waves (to age 18/19 in 2018).</p><p><strong>Measurements: </strong>Risky alcohol use was measured at age 18 and defined as more than 10 standard drinks per week, as per Australian National guidelines. Predictors from multiple domains-sociodemographic, adolescent substance use, adolescent mental health and behaviours, parental mental health and substance use, school factors, peer influences, parenting practices and parental stress-were included, measured from Wave 1 to 7. The SuperLearner package in R was used to test a series of models [regularised regression (LASSO, ridge and elastic net), random forest and kernel support vector machine (SVM)] using nested 10-fold cross-validation to identify the overall predictive ability of the model (measured by area under the curve; AUC) and the most important predictors of risky alcohol use across childhood and adolescence. Predictor importance was derived by normalising algorithm-specific scores per fold, weighting them by SuperLearner coefficients and aggregating across folds to rank predictors by mean weighted importance on a scale of 0 to 1 (higher scores indicating greater importance).</p><p><strong>Findings: </strong>The ensemble model showed good prediction on the test set, with an AUC of 0.792, a slight improvement over any single algorithm (AUC = 0.783 for the best performing individual algorithm). The most important predictors were weekly drinking at the previous wave (mean weighted importance 0.999), lifetime cannabis use (0.446), lifetime parent financial stress (0.420), identifying as female (0.365), identifying as male (0.344; compared with a reference category of gender diverse), lifetime attention deficit hyperactivity disorder (0.248), pre-natal alcohol exposure (0.248), housing insecurity (0.243), religious involvement (0.238) and parent alcohol use problems (0.215).</p><p><strong>Conclusions: </strong>An ensemble learning approach appears to have good predictive ability of risky alcohol use among a contemporary cohort of young Australians. It underscores the complex interplay of individual, familial and social factors occurring across childhood and adolescence that influences risky alcohol use in early adulthood.</p>","PeriodicalId":109,"journal":{"name":"Addiction","volume":" ","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multivariable machine learning prediction of risky alcohol use in contemporary youth.\",\"authors\":\"Lucinda Grummitt, Rachel Visontay, Philip Clare, Tim Slade, Louise Birrell\",\"doi\":\"10.1111/add.70145\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background and aims: </strong>Risky alcohol use in young adulthood is a significant public health concern. Understanding the predictors of risky drinking during this period is essential for prevention. This study aimed to measure the predictive accuracy of ensemble machine learning and identify the most important predictors of risky alcohol use in early adulthood.</p><p><strong>Design and setting: </strong>Secondary analysis of the Longitudinal Study of Australian Children, an Australian national longitudinal cohort study.</p><p><strong>Participants: </strong>A total of 4983 children, aged 4-5 years in 2004 (Wave 1), followed up for eight waves (to age 18/19 in 2018).</p><p><strong>Measurements: </strong>Risky alcohol use was measured at age 18 and defined as more than 10 standard drinks per week, as per Australian National guidelines. Predictors from multiple domains-sociodemographic, adolescent substance use, adolescent mental health and behaviours, parental mental health and substance use, school factors, peer influences, parenting practices and parental stress-were included, measured from Wave 1 to 7. The SuperLearner package in R was used to test a series of models [regularised regression (LASSO, ridge and elastic net), random forest and kernel support vector machine (SVM)] using nested 10-fold cross-validation to identify the overall predictive ability of the model (measured by area under the curve; AUC) and the most important predictors of risky alcohol use across childhood and adolescence. Predictor importance was derived by normalising algorithm-specific scores per fold, weighting them by SuperLearner coefficients and aggregating across folds to rank predictors by mean weighted importance on a scale of 0 to 1 (higher scores indicating greater importance).</p><p><strong>Findings: </strong>The ensemble model showed good prediction on the test set, with an AUC of 0.792, a slight improvement over any single algorithm (AUC = 0.783 for the best performing individual algorithm). The most important predictors were weekly drinking at the previous wave (mean weighted importance 0.999), lifetime cannabis use (0.446), lifetime parent financial stress (0.420), identifying as female (0.365), identifying as male (0.344; compared with a reference category of gender diverse), lifetime attention deficit hyperactivity disorder (0.248), pre-natal alcohol exposure (0.248), housing insecurity (0.243), religious involvement (0.238) and parent alcohol use problems (0.215).</p><p><strong>Conclusions: </strong>An ensemble learning approach appears to have good predictive ability of risky alcohol use among a contemporary cohort of young Australians. It underscores the complex interplay of individual, familial and social factors occurring across childhood and adolescence that influences risky alcohol use in early adulthood.</p>\",\"PeriodicalId\":109,\"journal\":{\"name\":\"Addiction\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Addiction\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/add.70145\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Addiction","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/add.70145","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

摘要

背景和目的:青年期危险饮酒是一个重大的公共卫生问题。了解这一时期危险饮酒的预测因素对于预防至关重要。本研究旨在测量集成机器学习的预测准确性,并确定成年早期危险饮酒的最重要预测因素。设计和背景:对澳大利亚儿童纵向研究的二次分析,这是一项澳大利亚国家纵向队列研究。参与者:共有4983名儿童,2004年4-5岁(第1波),随访8波(2018年至18/19岁)。测量方法:根据澳大利亚国家指南,在18岁时测量危险酒精使用情况,并将其定义为每周超过10次标准饮酒。包括来自多个领域的预测因子——社会人口统计学、青少年物质使用、青少年心理健康和行为、父母心理健康和物质使用、学校因素、同伴影响、父母做法和父母压力,从第1波到第7波进行测量。使用R中的SuperLearner包来测试一系列模型[正则化回归(LASSO, ridge和elastic net),随机森林和核支持向量机(SVM)],使用嵌套的10倍交叉验证来识别模型的整体预测能力(通过曲线下面积测量;AUC)和儿童期和青春期危险饮酒的最重要预测因子。预测器的重要性是通过标准化每层算法特定的分数,通过超级学习者系数对它们进行加权,并在0到1的范围内通过平均加权重要性对预测器进行汇总而得出的(分数越高表明重要性越高)。结果:集成模型在测试集上表现出良好的预测效果,AUC为0.792,比任何单一算法都略有提高(表现最好的单个算法的AUC = 0.783)。最重要的预测因素是前一波每周饮酒(平均加权重要性0.999),终生大麻使用(0.446),终生父母财务压力(0.420),女性(0.365),男性(0.344;终生注意缺陷多动障碍(0.248)、产前酒精暴露(0.248)、住房不安全(0.243)、宗教参与(0.238)和父母酒精使用问题(0.215)。结论:在当代澳大利亚青年队列中,集成学习方法似乎对危险酒精使用具有良好的预测能力。它强调了在童年和青春期发生的个人、家庭和社会因素的复杂相互作用,这些因素会影响成年早期的危险酒精使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multivariable machine learning prediction of risky alcohol use in contemporary youth.

Background and aims: Risky alcohol use in young adulthood is a significant public health concern. Understanding the predictors of risky drinking during this period is essential for prevention. This study aimed to measure the predictive accuracy of ensemble machine learning and identify the most important predictors of risky alcohol use in early adulthood.

Design and setting: Secondary analysis of the Longitudinal Study of Australian Children, an Australian national longitudinal cohort study.

Participants: A total of 4983 children, aged 4-5 years in 2004 (Wave 1), followed up for eight waves (to age 18/19 in 2018).

Measurements: Risky alcohol use was measured at age 18 and defined as more than 10 standard drinks per week, as per Australian National guidelines. Predictors from multiple domains-sociodemographic, adolescent substance use, adolescent mental health and behaviours, parental mental health and substance use, school factors, peer influences, parenting practices and parental stress-were included, measured from Wave 1 to 7. The SuperLearner package in R was used to test a series of models [regularised regression (LASSO, ridge and elastic net), random forest and kernel support vector machine (SVM)] using nested 10-fold cross-validation to identify the overall predictive ability of the model (measured by area under the curve; AUC) and the most important predictors of risky alcohol use across childhood and adolescence. Predictor importance was derived by normalising algorithm-specific scores per fold, weighting them by SuperLearner coefficients and aggregating across folds to rank predictors by mean weighted importance on a scale of 0 to 1 (higher scores indicating greater importance).

Findings: The ensemble model showed good prediction on the test set, with an AUC of 0.792, a slight improvement over any single algorithm (AUC = 0.783 for the best performing individual algorithm). The most important predictors were weekly drinking at the previous wave (mean weighted importance 0.999), lifetime cannabis use (0.446), lifetime parent financial stress (0.420), identifying as female (0.365), identifying as male (0.344; compared with a reference category of gender diverse), lifetime attention deficit hyperactivity disorder (0.248), pre-natal alcohol exposure (0.248), housing insecurity (0.243), religious involvement (0.238) and parent alcohol use problems (0.215).

Conclusions: An ensemble learning approach appears to have good predictive ability of risky alcohol use among a contemporary cohort of young Australians. It underscores the complex interplay of individual, familial and social factors occurring across childhood and adolescence that influences risky alcohol use in early adulthood.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Addiction
Addiction 医学-精神病学
CiteScore
10.80
自引率
6.70%
发文量
319
审稿时长
3 months
期刊介绍: Addiction publishes peer-reviewed research reports on pharmacological and behavioural addictions, bringing together research conducted within many different disciplines. Its goal is to serve international and interdisciplinary scientific and clinical communication, to strengthen links between science and policy, and to stimulate and enhance the quality of debate. We seek submissions that are not only technically competent but are also original and contain information or ideas of fresh interest to our international readership. We seek to serve low- and middle-income (LAMI) countries as well as more economically developed countries. Addiction’s scope spans human experimental, epidemiological, social science, historical, clinical and policy research relating to addiction, primarily but not exclusively in the areas of psychoactive substance use and/or gambling. In addition to original research, the journal features editorials, commentaries, reviews, letters, and book reviews.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信