Fabrice Harel-Canada , Anabel Salimian , Brandon Moghanian , Sarah Clingan , Allan Nguyen , Tucker Avra , Michelle Poimboeuf , Ruby Romero , Arthur Funnell , Panayiotis Petousis , Michael Shin , Nanyun Peng , Chelsea L. Shover , David Goodman-Meza
{"title":"利用大语言模型加强临床记录中的物质使用检测。","authors":"Fabrice Harel-Canada , Anabel Salimian , Brandon Moghanian , Sarah Clingan , Allan Nguyen , Tucker Avra , Michelle Poimboeuf , Ruby Romero , Arthur Funnell , Panayiotis Petousis , Michael Shin , Nanyun Peng , Chelsea L. Shover , David Goodman-Meza","doi":"10.1016/j.drugalcdep.2025.112888","DOIUrl":null,"url":null,"abstract":"<div><div>Identifying substance use behaviors in electronic health records (EHRs) is challenging because critical details are often buried in unstructured notes that use varied terminology and negation, requiring careful contextual interpretation to distinguish relevant use from historical mentions or denials. Using MIMIC-III/IV discharge summaries, we created a large, annotated drug detection dataset to tackle this problem and support future systemic substance use surveillance. We then investigated the performance of multiple large language models (LLMs) for detecting eight substance use categories within this data. Evaluating models in zero-shot, few-shot, and fine-tuning configurations, we found that a fine-tuned model, Llama-DrugDetector-70B, outperformed others. It achieved near-perfect F1-scores (<span><math><mrow><mo>≥</mo><mn>0</mn><mo>.</mo><mn>95</mn></mrow></math></span>) for most individual substances and strong scores for more complex tasks like prescription opioid misuse (F1=0.815) and polysubstance use (F1=0.917). These findings demonstrated that LLMs significantly enhance detection, showing promise for clinical decision support and research, although further work on scalability is warranted.</div></div>","PeriodicalId":11322,"journal":{"name":"Drug and alcohol dependence","volume":"276 ","pages":"Article 112888"},"PeriodicalIF":3.6000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing substance use detection in clinical notes with large language models\",\"authors\":\"Fabrice Harel-Canada , Anabel Salimian , Brandon Moghanian , Sarah Clingan , Allan Nguyen , Tucker Avra , Michelle Poimboeuf , Ruby Romero , Arthur Funnell , Panayiotis Petousis , Michael Shin , Nanyun Peng , Chelsea L. Shover , David Goodman-Meza\",\"doi\":\"10.1016/j.drugalcdep.2025.112888\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Identifying substance use behaviors in electronic health records (EHRs) is challenging because critical details are often buried in unstructured notes that use varied terminology and negation, requiring careful contextual interpretation to distinguish relevant use from historical mentions or denials. Using MIMIC-III/IV discharge summaries, we created a large, annotated drug detection dataset to tackle this problem and support future systemic substance use surveillance. We then investigated the performance of multiple large language models (LLMs) for detecting eight substance use categories within this data. Evaluating models in zero-shot, few-shot, and fine-tuning configurations, we found that a fine-tuned model, Llama-DrugDetector-70B, outperformed others. It achieved near-perfect F1-scores (<span><math><mrow><mo>≥</mo><mn>0</mn><mo>.</mo><mn>95</mn></mrow></math></span>) for most individual substances and strong scores for more complex tasks like prescription opioid misuse (F1=0.815) and polysubstance use (F1=0.917). These findings demonstrated that LLMs significantly enhance detection, showing promise for clinical decision support and research, although further work on scalability is warranted.</div></div>\",\"PeriodicalId\":11322,\"journal\":{\"name\":\"Drug and alcohol dependence\",\"volume\":\"276 \",\"pages\":\"Article 112888\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Drug and alcohol dependence\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0376871625003412\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drug and alcohol dependence","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0376871625003412","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
Enhancing substance use detection in clinical notes with large language models
Identifying substance use behaviors in electronic health records (EHRs) is challenging because critical details are often buried in unstructured notes that use varied terminology and negation, requiring careful contextual interpretation to distinguish relevant use from historical mentions or denials. Using MIMIC-III/IV discharge summaries, we created a large, annotated drug detection dataset to tackle this problem and support future systemic substance use surveillance. We then investigated the performance of multiple large language models (LLMs) for detecting eight substance use categories within this data. Evaluating models in zero-shot, few-shot, and fine-tuning configurations, we found that a fine-tuned model, Llama-DrugDetector-70B, outperformed others. It achieved near-perfect F1-scores () for most individual substances and strong scores for more complex tasks like prescription opioid misuse (F1=0.815) and polysubstance use (F1=0.917). These findings demonstrated that LLMs significantly enhance detection, showing promise for clinical decision support and research, although further work on scalability is warranted.
期刊介绍:
Drug and Alcohol Dependence is an international journal devoted to publishing original research, scholarly reviews, commentaries, and policy analyses in the area of drug, alcohol and tobacco use and dependence. Articles range from studies of the chemistry of substances of abuse, their actions at molecular and cellular sites, in vitro and in vivo investigations of their biochemical, pharmacological and behavioural actions, laboratory-based and clinical research in humans, substance abuse treatment and prevention research, and studies employing methods from epidemiology, sociology, and economics.