评估尼日利亚各地区假冒抗疟药流行情况的机器学习模型

Oluwole Adegoke Nuga, Anuoluwapo Adigun, Emmanuel Shobanke, A. Abdulhamid
{"title":"评估尼日利亚各地区假冒抗疟药流行情况的机器学习模型","authors":"Oluwole Adegoke Nuga, Anuoluwapo Adigun, Emmanuel Shobanke, A. Abdulhamid","doi":"10.23958/ijirms/vol09-i05/1877","DOIUrl":null,"url":null,"abstract":"Introduction: Antimalarial is listed among the most common type of live saving medicines that are counterfeited. In Nigeria counterfeited antimalarial continue to pose a great threat to the health of the citizens and there is the need to assess its incidence within the country’s six zones. This study assessed the prevalence of counterfeited antimalarial within the six geographical zones of Nigeria and the impact of zones on counterfeiting using a machine learning model for classification. Methodology: Secondary data on 2442 antimalarial collected from all the states in Nigeria were grouped based on geographical zones. The medicines were tested for originality using the gold approach for detection of counterfeit medicine; the Standard Scientific Laboratory (SSL) Data was separated to 70% training and 30% testing and 10- fold Cross Validation (CV) was performed. The training set was used to derive the models while the test set was used to evaluate the performance of the models. Three varieties of the training data were generated using the Synthetic Minority Oversampling Technique (SMOTE). This was done to ensure a more accurate prediction and a better model performance. Binary Logistic Regression (BLR) models were thereafter fitted to the training data and the three varieties of its resampling. The four models namely M1, M2, M3 and M4 were fitted with data containing 33%, 40%, 45% and 50% of the counterfeited antimalarial class respectively. The performance of the four fitted models were assessed with metrics like sensitivity, specificity and model accuracy. Results: The results showed that there is a higher incidence of counterfeited antimalarial in the north-east and south-east zones than in the other four zones of Nigeria. The work also revealed model accuracies of 67%, 65.8%, 65.8% and 56,8% for M1, M2, M3 and M4 respectively. M1 was biased as it did not correctly predict any counterfeited antimalarial. M2 and M3 performed better than M4 in terms of model accuracy and specificity while M4 performed better only in terms of model sensitivity. Conclusion: Overall, only 66% of antimalarial was correctly predicted by the best performing model. This suggest that zone alone is not adequate to classify or predict originality/counterfeiting status of Antimalarial in Nigeria.","PeriodicalId":94374,"journal":{"name":"International journal of innovative research in medical science","volume":" 88","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Machine Learning Model for Assessing the Prevalence of Counterfeit Antimalarial in Geographical Zones of Nigeria\",\"authors\":\"Oluwole Adegoke Nuga, Anuoluwapo Adigun, Emmanuel Shobanke, A. Abdulhamid\",\"doi\":\"10.23958/ijirms/vol09-i05/1877\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Antimalarial is listed among the most common type of live saving medicines that are counterfeited. In Nigeria counterfeited antimalarial continue to pose a great threat to the health of the citizens and there is the need to assess its incidence within the country’s six zones. This study assessed the prevalence of counterfeited antimalarial within the six geographical zones of Nigeria and the impact of zones on counterfeiting using a machine learning model for classification. Methodology: Secondary data on 2442 antimalarial collected from all the states in Nigeria were grouped based on geographical zones. The medicines were tested for originality using the gold approach for detection of counterfeit medicine; the Standard Scientific Laboratory (SSL) Data was separated to 70% training and 30% testing and 10- fold Cross Validation (CV) was performed. The training set was used to derive the models while the test set was used to evaluate the performance of the models. Three varieties of the training data were generated using the Synthetic Minority Oversampling Technique (SMOTE). This was done to ensure a more accurate prediction and a better model performance. Binary Logistic Regression (BLR) models were thereafter fitted to the training data and the three varieties of its resampling. The four models namely M1, M2, M3 and M4 were fitted with data containing 33%, 40%, 45% and 50% of the counterfeited antimalarial class respectively. The performance of the four fitted models were assessed with metrics like sensitivity, specificity and model accuracy. Results: The results showed that there is a higher incidence of counterfeited antimalarial in the north-east and south-east zones than in the other four zones of Nigeria. The work also revealed model accuracies of 67%, 65.8%, 65.8% and 56,8% for M1, M2, M3 and M4 respectively. M1 was biased as it did not correctly predict any counterfeited antimalarial. M2 and M3 performed better than M4 in terms of model accuracy and specificity while M4 performed better only in terms of model sensitivity. Conclusion: Overall, only 66% of antimalarial was correctly predicted by the best performing model. This suggest that zone alone is not adequate to classify or predict originality/counterfeiting status of Antimalarial in Nigeria.\",\"PeriodicalId\":94374,\"journal\":{\"name\":\"International journal of innovative research in medical science\",\"volume\":\" 88\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of innovative research in medical science\",\"FirstCategoryId\":\"0\",\"ListUrlMain\":\"https://doi.org/10.23958/ijirms/vol09-i05/1877\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of innovative research in medical science","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.23958/ijirms/vol09-i05/1877","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

导言:抗疟药是最常见的假冒救命药之一。在尼日利亚,假冒抗疟药继续对公民的健康构成巨大威胁,因此有必要对全国六个地区的假冒抗疟药发生率进行评估。本研究使用机器学习分类模型评估了尼日利亚六个地理区域内假冒抗疟药的流行情况以及各区域对假冒行为的影响。研究方法:根据地理区域对从尼日利亚各州收集的 2442 种抗疟药物的二手数据进行分组。使用检测假药的黄金方法对药品的原创性进行检测;将标准科学实验室(SSL)数据分成 70% 的训练集和 30% 的测试集,并进行 10 倍交叉验证(CV)。训练集用于推导模型,测试集用于评估模型的性能。使用合成少数群体过度取样技术(SMOTE)生成了三种训练数据。这样做是为了确保更准确的预测和更好的模型性能。随后,二元逻辑回归(BLR)模型被拟合到训练数据及其重新取样的三个变量中。四个模型,即 M1、M2、M3 和 M4 分别与含有 33%、40%、45% 和 50%伪造抗疟药类的数据进行了拟合。用灵敏度、特异性和模型准确性等指标评估了四个拟合模型的性能。结果显示结果表明,尼日利亚东北部和东南部地区假冒抗疟药的发生率高于其他四个地区。工作还显示,M1、M2、M3 和 M4 的模型准确率分别为 67%、65.8%、65.8% 和 56.8%。M1 有偏差,因为它没有正确预测任何假冒抗疟药。M2 和 M3 在模型准确性和特异性方面均优于 M4,而 M4 仅在模型灵敏度方面表现较好。结论总体而言,性能最好的模型只能正确预测 66% 的抗疟药。这表明,仅凭区域不足以对尼日利亚抗疟药的原产/伪造状况进行分类或预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Machine Learning Model for Assessing the Prevalence of Counterfeit Antimalarial in Geographical Zones of Nigeria
Introduction: Antimalarial is listed among the most common type of live saving medicines that are counterfeited. In Nigeria counterfeited antimalarial continue to pose a great threat to the health of the citizens and there is the need to assess its incidence within the country’s six zones. This study assessed the prevalence of counterfeited antimalarial within the six geographical zones of Nigeria and the impact of zones on counterfeiting using a machine learning model for classification. Methodology: Secondary data on 2442 antimalarial collected from all the states in Nigeria were grouped based on geographical zones. The medicines were tested for originality using the gold approach for detection of counterfeit medicine; the Standard Scientific Laboratory (SSL) Data was separated to 70% training and 30% testing and 10- fold Cross Validation (CV) was performed. The training set was used to derive the models while the test set was used to evaluate the performance of the models. Three varieties of the training data were generated using the Synthetic Minority Oversampling Technique (SMOTE). This was done to ensure a more accurate prediction and a better model performance. Binary Logistic Regression (BLR) models were thereafter fitted to the training data and the three varieties of its resampling. The four models namely M1, M2, M3 and M4 were fitted with data containing 33%, 40%, 45% and 50% of the counterfeited antimalarial class respectively. The performance of the four fitted models were assessed with metrics like sensitivity, specificity and model accuracy. Results: The results showed that there is a higher incidence of counterfeited antimalarial in the north-east and south-east zones than in the other four zones of Nigeria. The work also revealed model accuracies of 67%, 65.8%, 65.8% and 56,8% for M1, M2, M3 and M4 respectively. M1 was biased as it did not correctly predict any counterfeited antimalarial. M2 and M3 performed better than M4 in terms of model accuracy and specificity while M4 performed better only in terms of model sensitivity. Conclusion: Overall, only 66% of antimalarial was correctly predicted by the best performing model. This suggest that zone alone is not adequate to classify or predict originality/counterfeiting status of Antimalarial in Nigeria.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信