Lucinda Grummitt, Rachel Visontay, Philip Clare, Tim Slade, Louise Birrell
{"title":"当代青年危险饮酒的多变量机器学习预测","authors":"Lucinda Grummitt, Rachel Visontay, Philip Clare, Tim Slade, Louise Birrell","doi":"10.1111/add.70145","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and aims: </strong>Risky alcohol use in young adulthood is a significant public health concern. Understanding the predictors of risky drinking during this period is essential for prevention. This study aimed to measure the predictive accuracy of ensemble machine learning and identify the most important predictors of risky alcohol use in early adulthood.</p><p><strong>Design and setting: </strong>Secondary analysis of the Longitudinal Study of Australian Children, an Australian national longitudinal cohort study.</p><p><strong>Participants: </strong>A total of 4983 children, aged 4-5 years in 2004 (Wave 1), followed up for eight waves (to age 18/19 in 2018).</p><p><strong>Measurements: </strong>Risky alcohol use was measured at age 18 and defined as more than 10 standard drinks per week, as per Australian National guidelines. Predictors from multiple domains-sociodemographic, adolescent substance use, adolescent mental health and behaviours, parental mental health and substance use, school factors, peer influences, parenting practices and parental stress-were included, measured from Wave 1 to 7. The SuperLearner package in R was used to test a series of models [regularised regression (LASSO, ridge and elastic net), random forest and kernel support vector machine (SVM)] using nested 10-fold cross-validation to identify the overall predictive ability of the model (measured by area under the curve; AUC) and the most important predictors of risky alcohol use across childhood and adolescence. Predictor importance was derived by normalising algorithm-specific scores per fold, weighting them by SuperLearner coefficients and aggregating across folds to rank predictors by mean weighted importance on a scale of 0 to 1 (higher scores indicating greater importance).</p><p><strong>Findings: </strong>The ensemble model showed good prediction on the test set, with an AUC of 0.792, a slight improvement over any single algorithm (AUC = 0.783 for the best performing individual algorithm). The most important predictors were weekly drinking at the previous wave (mean weighted importance 0.999), lifetime cannabis use (0.446), lifetime parent financial stress (0.420), identifying as female (0.365), identifying as male (0.344; compared with a reference category of gender diverse), lifetime attention deficit hyperactivity disorder (0.248), pre-natal alcohol exposure (0.248), housing insecurity (0.243), religious involvement (0.238) and parent alcohol use problems (0.215).</p><p><strong>Conclusions: </strong>An ensemble learning approach appears to have good predictive ability of risky alcohol use among a contemporary cohort of young Australians. It underscores the complex interplay of individual, familial and social factors occurring across childhood and adolescence that influences risky alcohol use in early adulthood.</p>","PeriodicalId":109,"journal":{"name":"Addiction","volume":" ","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multivariable machine learning prediction of risky alcohol use in contemporary youth.\",\"authors\":\"Lucinda Grummitt, Rachel Visontay, Philip Clare, Tim Slade, Louise Birrell\",\"doi\":\"10.1111/add.70145\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background and aims: </strong>Risky alcohol use in young adulthood is a significant public health concern. Understanding the predictors of risky drinking during this period is essential for prevention. This study aimed to measure the predictive accuracy of ensemble machine learning and identify the most important predictors of risky alcohol use in early adulthood.</p><p><strong>Design and setting: </strong>Secondary analysis of the Longitudinal Study of Australian Children, an Australian national longitudinal cohort study.</p><p><strong>Participants: </strong>A total of 4983 children, aged 4-5 years in 2004 (Wave 1), followed up for eight waves (to age 18/19 in 2018).</p><p><strong>Measurements: </strong>Risky alcohol use was measured at age 18 and defined as more than 10 standard drinks per week, as per Australian National guidelines. Predictors from multiple domains-sociodemographic, adolescent substance use, adolescent mental health and behaviours, parental mental health and substance use, school factors, peer influences, parenting practices and parental stress-were included, measured from Wave 1 to 7. The SuperLearner package in R was used to test a series of models [regularised regression (LASSO, ridge and elastic net), random forest and kernel support vector machine (SVM)] using nested 10-fold cross-validation to identify the overall predictive ability of the model (measured by area under the curve; AUC) and the most important predictors of risky alcohol use across childhood and adolescence. Predictor importance was derived by normalising algorithm-specific scores per fold, weighting them by SuperLearner coefficients and aggregating across folds to rank predictors by mean weighted importance on a scale of 0 to 1 (higher scores indicating greater importance).</p><p><strong>Findings: </strong>The ensemble model showed good prediction on the test set, with an AUC of 0.792, a slight improvement over any single algorithm (AUC = 0.783 for the best performing individual algorithm). The most important predictors were weekly drinking at the previous wave (mean weighted importance 0.999), lifetime cannabis use (0.446), lifetime parent financial stress (0.420), identifying as female (0.365), identifying as male (0.344; compared with a reference category of gender diverse), lifetime attention deficit hyperactivity disorder (0.248), pre-natal alcohol exposure (0.248), housing insecurity (0.243), religious involvement (0.238) and parent alcohol use problems (0.215).</p><p><strong>Conclusions: </strong>An ensemble learning approach appears to have good predictive ability of risky alcohol use among a contemporary cohort of young Australians. It underscores the complex interplay of individual, familial and social factors occurring across childhood and adolescence that influences risky alcohol use in early adulthood.</p>\",\"PeriodicalId\":109,\"journal\":{\"name\":\"Addiction\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Addiction\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/add.70145\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Addiction","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/add.70145","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
Multivariable machine learning prediction of risky alcohol use in contemporary youth.
Background and aims: Risky alcohol use in young adulthood is a significant public health concern. Understanding the predictors of risky drinking during this period is essential for prevention. This study aimed to measure the predictive accuracy of ensemble machine learning and identify the most important predictors of risky alcohol use in early adulthood.
Design and setting: Secondary analysis of the Longitudinal Study of Australian Children, an Australian national longitudinal cohort study.
Participants: A total of 4983 children, aged 4-5 years in 2004 (Wave 1), followed up for eight waves (to age 18/19 in 2018).
Measurements: Risky alcohol use was measured at age 18 and defined as more than 10 standard drinks per week, as per Australian National guidelines. Predictors from multiple domains-sociodemographic, adolescent substance use, adolescent mental health and behaviours, parental mental health and substance use, school factors, peer influences, parenting practices and parental stress-were included, measured from Wave 1 to 7. The SuperLearner package in R was used to test a series of models [regularised regression (LASSO, ridge and elastic net), random forest and kernel support vector machine (SVM)] using nested 10-fold cross-validation to identify the overall predictive ability of the model (measured by area under the curve; AUC) and the most important predictors of risky alcohol use across childhood and adolescence. Predictor importance was derived by normalising algorithm-specific scores per fold, weighting them by SuperLearner coefficients and aggregating across folds to rank predictors by mean weighted importance on a scale of 0 to 1 (higher scores indicating greater importance).
Findings: The ensemble model showed good prediction on the test set, with an AUC of 0.792, a slight improvement over any single algorithm (AUC = 0.783 for the best performing individual algorithm). The most important predictors were weekly drinking at the previous wave (mean weighted importance 0.999), lifetime cannabis use (0.446), lifetime parent financial stress (0.420), identifying as female (0.365), identifying as male (0.344; compared with a reference category of gender diverse), lifetime attention deficit hyperactivity disorder (0.248), pre-natal alcohol exposure (0.248), housing insecurity (0.243), religious involvement (0.238) and parent alcohol use problems (0.215).
Conclusions: An ensemble learning approach appears to have good predictive ability of risky alcohol use among a contemporary cohort of young Australians. It underscores the complex interplay of individual, familial and social factors occurring across childhood and adolescence that influences risky alcohol use in early adulthood.
期刊介绍:
Addiction publishes peer-reviewed research reports on pharmacological and behavioural addictions, bringing together research conducted within many different disciplines.
Its goal is to serve international and interdisciplinary scientific and clinical communication, to strengthen links between science and policy, and to stimulate and enhance the quality of debate. We seek submissions that are not only technically competent but are also original and contain information or ideas of fresh interest to our international readership. We seek to serve low- and middle-income (LAMI) countries as well as more economically developed countries.
Addiction’s scope spans human experimental, epidemiological, social science, historical, clinical and policy research relating to addiction, primarily but not exclusively in the areas of psychoactive substance use and/or gambling. In addition to original research, the journal features editorials, commentaries, reviews, letters, and book reviews.