{"title":"使用机器学习技术预测潜在的药物滥用者","authors":"Zhaoying Qiao, Tianrui Chai, Qinjing Zhang, Xinyi Zhou, Zhuoling Chu","doi":"10.1109/ICIIBMS46890.2019.8991550","DOIUrl":null,"url":null,"abstract":"Drug abuse is a noteworthy public health problem in the world. Recently, machine learning has become a favorable tool for classification problem. In this paper, we selected three machine learning algorithms (random forest (RF), Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) ) to predict potential abuse individuals of methamphetamine and amyl nitrite, two kinds of central stimulants, as well as users’ last consumption time based on personality traits and demographic information. Compared with k-NearestNeighbor, a widely-used classification algorithm, RF, XGBoost and LightGBM have superior performance, with LightGBM being the most efficient one in predicting potential user and estimating usage time. The results of feature importance indicate neuroticism is the most important predictor of drug abuse risk.","PeriodicalId":444797,"journal":{"name":"2019 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Predicting potential drug abusers using machine learning techniques\",\"authors\":\"Zhaoying Qiao, Tianrui Chai, Qinjing Zhang, Xinyi Zhou, Zhuoling Chu\",\"doi\":\"10.1109/ICIIBMS46890.2019.8991550\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Drug abuse is a noteworthy public health problem in the world. Recently, machine learning has become a favorable tool for classification problem. In this paper, we selected three machine learning algorithms (random forest (RF), Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) ) to predict potential abuse individuals of methamphetamine and amyl nitrite, two kinds of central stimulants, as well as users’ last consumption time based on personality traits and demographic information. Compared with k-NearestNeighbor, a widely-used classification algorithm, RF, XGBoost and LightGBM have superior performance, with LightGBM being the most efficient one in predicting potential user and estimating usage time. The results of feature importance indicate neuroticism is the most important predictor of drug abuse risk.\",\"PeriodicalId\":444797,\"journal\":{\"name\":\"2019 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)\",\"volume\":\"65 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIIBMS46890.2019.8991550\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIBMS46890.2019.8991550","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predicting potential drug abusers using machine learning techniques
Drug abuse is a noteworthy public health problem in the world. Recently, machine learning has become a favorable tool for classification problem. In this paper, we selected three machine learning algorithms (random forest (RF), Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) ) to predict potential abuse individuals of methamphetamine and amyl nitrite, two kinds of central stimulants, as well as users’ last consumption time based on personality traits and demographic information. Compared with k-NearestNeighbor, a widely-used classification algorithm, RF, XGBoost and LightGBM have superior performance, with LightGBM being the most efficient one in predicting potential user and estimating usage time. The results of feature importance indicate neuroticism is the most important predictor of drug abuse risk.