A machine-learning approach for predicting success in smoking cessation intervention

Khishigsuren Davagdorj, Jong Seol Lee, K. Park, K. Ryu
{"title":"A machine-learning approach for predicting success in smoking cessation intervention","authors":"Khishigsuren Davagdorj, Jong Seol Lee, K. Park, K. Ryu","doi":"10.1109/ICAwST.2019.8923252","DOIUrl":null,"url":null,"abstract":"Smoking is one of the significant avoidable risk factors for premature death. Most smokers make multiple quit attempts during their lifetime but smoking dependence is not easy and many people eventually failed in smoking quit. Therefore, predicting the likelihood of success in smoking cessation intervention is necessary for public health.In this paper, we analyzed the smoking cessation intervention dataset conducted from the Korea National Health and Nutrition Examination Survey (KNHANES) 2009 to 2017. Accordingly, the chi-square test used to filter relevant and significant features, thus the multivariate analysis was used with logistic regression. In essence, age, education, and frequent alcohol use are important predictors in smoking cessation success. Furthermore, the lowest level of subjective health status has increased the likelihood of unsuccessful smoking cessation.In terms of the class imbalance problem, we have employed an efficient Synthetic Minority Over-sampling Technique (SMOTE) in order to generate new synthetic records. In the current study, we compared the SMOTE regular and borderline-1 techniques with 3, 5 and 7 number of nearest neighbors, respectively. Subsequently, we evaluate the success prediction model of smoking intervention using Naïve Bayes (NB), logistic regression (LR), multilayer perceptron neural network (MLPNN), random forest (RF) and gradient boosting trees (GBT) classifiers, as well as classifier performance has evaluated by precision, recall and F-measure.Our result demonstrated that NB with SMOTE borderline-1 (K=5) outperformed the precision of 0.8701. Meanwhile, RF with SMOTE borderline-1 (K=5) performed of 0.8766 and F-score of 0.8476. On the contrary, However, LR presents the lowest F-score as SMOTE regular (K=3) of 0.6726 and borderline (K=3) of 0.6700 in experimental comparison result.In addition, a combination of statistical and machine learning techniques is supposed to be helpful tools in the decisions of smoking cessation intervention and public health domain.","PeriodicalId":156538,"journal":{"name":"2019 IEEE 10th International Conference on Awareness Science and Technology (iCAST)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Awareness Science and Technology (iCAST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAwST.2019.8923252","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Smoking is one of the significant avoidable risk factors for premature death. Most smokers make multiple quit attempts during their lifetime but smoking dependence is not easy and many people eventually failed in smoking quit. Therefore, predicting the likelihood of success in smoking cessation intervention is necessary for public health.In this paper, we analyzed the smoking cessation intervention dataset conducted from the Korea National Health and Nutrition Examination Survey (KNHANES) 2009 to 2017. Accordingly, the chi-square test used to filter relevant and significant features, thus the multivariate analysis was used with logistic regression. In essence, age, education, and frequent alcohol use are important predictors in smoking cessation success. Furthermore, the lowest level of subjective health status has increased the likelihood of unsuccessful smoking cessation.In terms of the class imbalance problem, we have employed an efficient Synthetic Minority Over-sampling Technique (SMOTE) in order to generate new synthetic records. In the current study, we compared the SMOTE regular and borderline-1 techniques with 3, 5 and 7 number of nearest neighbors, respectively. Subsequently, we evaluate the success prediction model of smoking intervention using Naïve Bayes (NB), logistic regression (LR), multilayer perceptron neural network (MLPNN), random forest (RF) and gradient boosting trees (GBT) classifiers, as well as classifier performance has evaluated by precision, recall and F-measure.Our result demonstrated that NB with SMOTE borderline-1 (K=5) outperformed the precision of 0.8701. Meanwhile, RF with SMOTE borderline-1 (K=5) performed of 0.8766 and F-score of 0.8476. On the contrary, However, LR presents the lowest F-score as SMOTE regular (K=3) of 0.6726 and borderline (K=3) of 0.6700 in experimental comparison result.In addition, a combination of statistical and machine learning techniques is supposed to be helpful tools in the decisions of smoking cessation intervention and public health domain.
预测戒烟干预成功的机器学习方法
吸烟是导致过早死亡的重要可避免风险因素之一。大多数吸烟者在一生中多次尝试戒烟,但吸烟依赖并不容易,许多人最终戒烟失败。因此,预测戒烟干预成功的可能性对公共卫生是必要的。在本文中,我们分析了2009年至2017年韩国国家健康与营养调查(KNHANES)的戒烟干预数据集。因此,使用卡方检验来过滤相关和显著特征,因此使用逻辑回归进行多变量分析。从本质上讲,年龄、教育程度和频繁饮酒是戒烟成功的重要预测因素。此外,最低水平的主观健康状况增加了戒烟不成功的可能性。在类不平衡问题方面,我们采用了一种高效的合成少数派过采样技术(SMOTE)来生成新的合成记录。在本研究中,我们将SMOTE正则和borderline-1技术分别与3、5和7个最近邻进行了比较。随后,我们使用Naïve贝叶斯(NB)、逻辑回归(LR)、多层感知器神经网络(MLPNN)、随机森林(RF)和梯度增强树(GBT)分类器评估吸烟干预的成功预测模型,并通过精度、召回率和F-measure来评估分类器的性能。我们的结果表明,具有SMOTE边界-1 (K=5)的NB优于0.8701的精度。同时,SMOTE borderline-1 (K=5)的RF得分为0.8766,f得分为0.8476。但在实验对比结果中,LR作为SMOTE正则(K=3)为0.6726,边界(K=3)为0.6700,f值最低。此外,统计和机器学习技术的结合应该是戒烟干预和公共卫生领域决策的有用工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信