Davide Marengo, Francesco Quilghini, Michele Settanni
{"title":"Leveraging social media and large language models for scalable alcohol risk assessment: Examining validity with AUDIT-C and post recency effects","authors":"Davide Marengo, Francesco Quilghini, Michele Settanni","doi":"10.1016/j.addbeh.2025.108375","DOIUrl":null,"url":null,"abstract":"<div><div>Risky alcohol consumption is a major public health concern, yet significant barriers exist to effective screening. The present study examines the potential of Large Language Models (LLMs) to infer risky alcohol use from social media text. The unobtrusive nature of this approach could provide a more scalable way to assess alcohol risk in large populations. To this aim, we analyzed Facebook status updates from 208 adults from Italy (mean age = 26.8, 70.7 % female) who also completed the Alcohol Use Disorders Identification Test-Consumption (AUDIT-C), a brief validated self-report measure of risky drinking. Two state-of-the-art LLMs, Gemini 1.5 Pro and GPT-4o, were used to assess alcohol risk and to quantify alcohol references. Results demonstrated strong inter-model agreement between risk inferences (ρ = 0.572, p < 0.001). LLM-inferred risk scores showed moderate correlations with AUDIT-C scores (Gemini 1.5 Pro: ρ = 0.344, p < 0.001; GPT-4o: ρ = 0.375, p < 0.001; Average: ρ = 0.405, p < 0.001). These correlations were significantly stronger among participants with recent posts (Average risk score: ρ = 0.500, p < 0.001) than among those without (ρ = 0.294, p = 0.008). The strongest correlation was observed between average LLM-inferred risk scores and AUDIT-C in the recent posts group (disattenuated ρ = 0.606). These findings suggest that LLMs offer a promising tool for identifying risky alcohol use when analyzing recent social media activity. Their accuracy is comparable to some traditional alcohol assessment methods, highlighting their potential to enhance early detection efforts. Limitations and future research directions are discussed.</div></div>","PeriodicalId":7155,"journal":{"name":"Addictive behaviors","volume":"168 ","pages":"Article 108375"},"PeriodicalIF":3.7000,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Addictive behaviors","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306460325001364","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, CLINICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Risky alcohol consumption is a major public health concern, yet significant barriers exist to effective screening. The present study examines the potential of Large Language Models (LLMs) to infer risky alcohol use from social media text. The unobtrusive nature of this approach could provide a more scalable way to assess alcohol risk in large populations. To this aim, we analyzed Facebook status updates from 208 adults from Italy (mean age = 26.8, 70.7 % female) who also completed the Alcohol Use Disorders Identification Test-Consumption (AUDIT-C), a brief validated self-report measure of risky drinking. Two state-of-the-art LLMs, Gemini 1.5 Pro and GPT-4o, were used to assess alcohol risk and to quantify alcohol references. Results demonstrated strong inter-model agreement between risk inferences (ρ = 0.572, p < 0.001). LLM-inferred risk scores showed moderate correlations with AUDIT-C scores (Gemini 1.5 Pro: ρ = 0.344, p < 0.001; GPT-4o: ρ = 0.375, p < 0.001; Average: ρ = 0.405, p < 0.001). These correlations were significantly stronger among participants with recent posts (Average risk score: ρ = 0.500, p < 0.001) than among those without (ρ = 0.294, p = 0.008). The strongest correlation was observed between average LLM-inferred risk scores and AUDIT-C in the recent posts group (disattenuated ρ = 0.606). These findings suggest that LLMs offer a promising tool for identifying risky alcohol use when analyzing recent social media activity. Their accuracy is comparable to some traditional alcohol assessment methods, highlighting their potential to enhance early detection efforts. Limitations and future research directions are discussed.
期刊介绍:
Addictive Behaviors is an international peer-reviewed journal publishing high quality human research on addictive behaviors and disorders since 1975. The journal accepts submissions of full-length papers and short communications on substance-related addictions such as the abuse of alcohol, drugs and nicotine, and behavioral addictions involving gambling and technology. We primarily publish behavioral and psychosocial research but our articles span the fields of psychology, sociology, psychiatry, epidemiology, social policy, medicine, pharmacology and neuroscience. While theoretical orientations are diverse, the emphasis of the journal is primarily empirical. That is, sound experimental design combined with valid, reliable assessment and evaluation procedures are a requisite for acceptance. However, innovative and empirically oriented case studies that might encourage new lines of inquiry are accepted as well. Studies that clearly contribute to current knowledge of etiology, prevention, social policy or treatment are given priority. Scholarly commentaries on topical issues, systematic reviews, and mini reviews are encouraged. We especially welcome multimedia papers that incorporate video or audio components to better display methodology or findings.
Studies can also be submitted to Addictive Behaviors? companion title, the open access journal Addictive Behaviors Reports, which has a particular interest in ''non-traditional'', innovative and empirically-oriented research such as negative/null data papers, replication studies, case reports on novel treatments, and cross-cultural research.