Joan Jonathan, Alcardo Alex Barakabitze, Cynthia D Fast, Christophe Cox
{"title":"用于肺结核检测预测的机器学习:训练非洲大袋鼠的案例研究。","authors":"Joan Jonathan, Alcardo Alex Barakabitze, Cynthia D Fast, Christophe Cox","doi":"10.2196/50771","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Technological advancement has led to the growth and rapid increase of tuberculosis (TB) medical data generated from different health care areas, including diagnosis. Prioritizing better adoption and acceptance of innovative diagnostic technology to reduce the spread of TB significantly benefits developing countries. Trained TB-detection rats are used in Tanzania and Ethiopia for operational research to complement other TB diagnostic tools. This technology has increased new TB case detection owing to its speed, cost-effectiveness, and sensitivity.</p><p><strong>Objective: </strong>During the TB detection process, rats produce vast amounts of data, providing an opportunity to identify interesting patterns that influence TB detection performance. This study aimed to develop models that predict if the rat will hit (indicate the presence of TB within) the sample or not using machine learning (ML) techniques. The goal was to improve the diagnostic accuracy and performance of TB detection involving rats.</p><p><strong>Methods: </strong>APOPO (Anti-Persoonsmijnen Ontmijnende Product Ontwikkeling) Center in Morogoro provided data for this study from 2012 to 2019, and 366,441 observations were used to build predictive models using ML techniques, including decision tree, random forest, naïve Bayes, support vector machine, and k-nearest neighbor, by incorporating a variety of variables, such as the diagnostic results from partner health clinics using methods endorsed by the World Health Organization (WHO).</p><p><strong>Results: </strong>The support vector machine technique yielded the highest accuracy of 83.39% for prediction compared to other ML techniques used. Furthermore, this study found that the inclusion of variables related to whether the sample contained TB or not increased the performance accuracy of the predictive model.</p><p><strong>Conclusions: </strong>The inclusion of variables related to the diagnostic results of TB samples may improve the detection performance of the trained rats. The study results may be of importance to TB-detection rat trainers and TB decision-makers as the results may prompt them to take action to maintain the usefulness of the technology and increase the TB detection performance of trained rats.</p>","PeriodicalId":74345,"journal":{"name":"Online journal of public health informatics","volume":"16 ","pages":"e50771"},"PeriodicalIF":0.0000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11061786/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine Learning for Prediction of Tuberculosis Detection: Case Study of Trained African Giant Pouched Rats.\",\"authors\":\"Joan Jonathan, Alcardo Alex Barakabitze, Cynthia D Fast, Christophe Cox\",\"doi\":\"10.2196/50771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Technological advancement has led to the growth and rapid increase of tuberculosis (TB) medical data generated from different health care areas, including diagnosis. Prioritizing better adoption and acceptance of innovative diagnostic technology to reduce the spread of TB significantly benefits developing countries. Trained TB-detection rats are used in Tanzania and Ethiopia for operational research to complement other TB diagnostic tools. This technology has increased new TB case detection owing to its speed, cost-effectiveness, and sensitivity.</p><p><strong>Objective: </strong>During the TB detection process, rats produce vast amounts of data, providing an opportunity to identify interesting patterns that influence TB detection performance. This study aimed to develop models that predict if the rat will hit (indicate the presence of TB within) the sample or not using machine learning (ML) techniques. The goal was to improve the diagnostic accuracy and performance of TB detection involving rats.</p><p><strong>Methods: </strong>APOPO (Anti-Persoonsmijnen Ontmijnende Product Ontwikkeling) Center in Morogoro provided data for this study from 2012 to 2019, and 366,441 observations were used to build predictive models using ML techniques, including decision tree, random forest, naïve Bayes, support vector machine, and k-nearest neighbor, by incorporating a variety of variables, such as the diagnostic results from partner health clinics using methods endorsed by the World Health Organization (WHO).</p><p><strong>Results: </strong>The support vector machine technique yielded the highest accuracy of 83.39% for prediction compared to other ML techniques used. Furthermore, this study found that the inclusion of variables related to whether the sample contained TB or not increased the performance accuracy of the predictive model.</p><p><strong>Conclusions: </strong>The inclusion of variables related to the diagnostic results of TB samples may improve the detection performance of the trained rats. The study results may be of importance to TB-detection rat trainers and TB decision-makers as the results may prompt them to take action to maintain the usefulness of the technology and increase the TB detection performance of trained rats.</p>\",\"PeriodicalId\":74345,\"journal\":{\"name\":\"Online journal of public health informatics\",\"volume\":\"16 \",\"pages\":\"e50771\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11061786/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Online journal of public health informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/50771\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online journal of public health informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/50771","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
背景:技术进步导致了结核病(TB)医疗数据的增长和迅速增加,这些数据来自不同的医疗保健领域,包括诊断。优先考虑更好地采用和接受创新诊断技术,以减少结核病的传播,这对发展中国家大有裨益。坦桑尼亚和埃塞俄比亚使用训练有素的结核病检测鼠进行业务研究,以补充其他结核病诊断工具。这项技术因其速度快、成本效益高和灵敏度高而提高了结核病新病例的检测率:在结核病检测过程中,老鼠会产生大量数据,这为找出影响结核病检测性能的有趣模式提供了机会。本研究旨在利用机器学习(ML)技术开发模型,以预测大鼠是否会击中样本(表明样本中存在结核病)。目的是提高大鼠结核病检测的诊断准确性和性能:莫罗戈罗的APOPO(Anti-Persoonsmijnen Ontmijnende Product Ontwikkeling)中心为这项研究提供了2012年至2019年的数据,366441个观测值被用于使用ML技术建立预测模型,包括决策树、随机森林、天真贝叶斯、支持向量机和k-近邻,并采用世界卫生组织(WHO)认可的方法纳入了各种变量,如合作伙伴卫生诊所的诊断结果:结果:与使用的其他 ML 技术相比,支持向量机技术的预测准确率最高,达到 83.39%。此外,本研究还发现,加入与样本是否含有结核病相关的变量可提高预测模型的准确性:结论:加入与肺结核样本诊断结果相关的变量可提高训练有素的大鼠的检测性能。研究结果可能对结核病检测鼠训练员和结核病决策者具有重要意义,因为研究结果可能促使他们采取行动,以保持该技术的实用性并提高训练鼠的结核病检测性能。
Machine Learning for Prediction of Tuberculosis Detection: Case Study of Trained African Giant Pouched Rats.
Background: Technological advancement has led to the growth and rapid increase of tuberculosis (TB) medical data generated from different health care areas, including diagnosis. Prioritizing better adoption and acceptance of innovative diagnostic technology to reduce the spread of TB significantly benefits developing countries. Trained TB-detection rats are used in Tanzania and Ethiopia for operational research to complement other TB diagnostic tools. This technology has increased new TB case detection owing to its speed, cost-effectiveness, and sensitivity.
Objective: During the TB detection process, rats produce vast amounts of data, providing an opportunity to identify interesting patterns that influence TB detection performance. This study aimed to develop models that predict if the rat will hit (indicate the presence of TB within) the sample or not using machine learning (ML) techniques. The goal was to improve the diagnostic accuracy and performance of TB detection involving rats.
Methods: APOPO (Anti-Persoonsmijnen Ontmijnende Product Ontwikkeling) Center in Morogoro provided data for this study from 2012 to 2019, and 366,441 observations were used to build predictive models using ML techniques, including decision tree, random forest, naïve Bayes, support vector machine, and k-nearest neighbor, by incorporating a variety of variables, such as the diagnostic results from partner health clinics using methods endorsed by the World Health Organization (WHO).
Results: The support vector machine technique yielded the highest accuracy of 83.39% for prediction compared to other ML techniques used. Furthermore, this study found that the inclusion of variables related to whether the sample contained TB or not increased the performance accuracy of the predictive model.
Conclusions: The inclusion of variables related to the diagnostic results of TB samples may improve the detection performance of the trained rats. The study results may be of importance to TB-detection rat trainers and TB decision-makers as the results may prompt them to take action to maintain the usefulness of the technology and increase the TB detection performance of trained rats.