Tsung-Chien Lu , Chih-Chuan Lin , Te-I Weng , Fan-Ya Chou , Cheng- Chung Fang , TEDAS Research Group
{"title":"A web-based tool utilizing machine learning algorithms for predicting illicit drug use in emergency departments","authors":"Tsung-Chien Lu , Chih-Chuan Lin , Te-I Weng , Fan-Ya Chou , Cheng- Chung Fang , TEDAS Research Group","doi":"10.1016/j.ijmedinf.2025.106031","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Identifying illicit drug use through urine testing is time-consuming in the era of new psychoactive substances. This study aimed to develop a machine learning (ML) prediction model for early identification of illicit drug use in suspected emergency department (ED) patients.</div></div><div><h3>Methods</h3><div>Data from the Taiwan Emergency Department Drug Abuse Surveillance (TEDAS) database (2020–2023) was used. Six feature categories—demographics, triage data, referral source, symptoms, physical findings, and clinical characteristics—were included. The primary outcome was positive urine results for illicit drugs, confirmed by liquid chromatography-tandem mass spectrometry. Data were divided chronologically into training/validation and testing sets. Three supervised ML algorithms, including random forest, CatBoost, and light gradient boosting machine, were tested using K-fold cross-validation, and performance was evaluated by the area under the receiver operating characteristic curve (AUC) in the test set.</div></div><div><h3>Results</h3><div>The analysis included 13,615 urine test results from ED cases, identifying 3,185 positive cases (23.4%). A total of 9,529 cases (2020–2022) formed the training/validation cohort, and 4,086 (2023) were used for testing. Twenty features were used to construct the prediction model. The CatBoost classifier performed best, achieving an AUC of 0.846 (95% confidence interval [CI]: 0.831–0.859) in the testing cohort. A web-based tool and mobile apps were implemented to assist emergency physicians in predicting illicit drug use.</div></div><div><h3>Conclusions</h3><div>The machine learning model effectively predicts illicit drug use in ED patients and has been successfully implemented for free access. Further analysis is needed to assess post-implementation performance and its potential for use in other countries.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"203 ","pages":"Article 106031"},"PeriodicalIF":4.1000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505625002485","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Identifying illicit drug use through urine testing is time-consuming in the era of new psychoactive substances. This study aimed to develop a machine learning (ML) prediction model for early identification of illicit drug use in suspected emergency department (ED) patients.
Methods
Data from the Taiwan Emergency Department Drug Abuse Surveillance (TEDAS) database (2020–2023) was used. Six feature categories—demographics, triage data, referral source, symptoms, physical findings, and clinical characteristics—were included. The primary outcome was positive urine results for illicit drugs, confirmed by liquid chromatography-tandem mass spectrometry. Data were divided chronologically into training/validation and testing sets. Three supervised ML algorithms, including random forest, CatBoost, and light gradient boosting machine, were tested using K-fold cross-validation, and performance was evaluated by the area under the receiver operating characteristic curve (AUC) in the test set.
Results
The analysis included 13,615 urine test results from ED cases, identifying 3,185 positive cases (23.4%). A total of 9,529 cases (2020–2022) formed the training/validation cohort, and 4,086 (2023) were used for testing. Twenty features were used to construct the prediction model. The CatBoost classifier performed best, achieving an AUC of 0.846 (95% confidence interval [CI]: 0.831–0.859) in the testing cohort. A web-based tool and mobile apps were implemented to assist emergency physicians in predicting illicit drug use.
Conclusions
The machine learning model effectively predicts illicit drug use in ED patients and has been successfully implemented for free access. Further analysis is needed to assess post-implementation performance and its potential for use in other countries.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.