Blood smear imagery dataset for malaria parasite detection: A case of Tanzania.

IF 1 Q3 MULTIDISCIPLINARY SCIENCES
Data in Brief Pub Date : 2024-11-23 eCollection Date: 2024-12-01 DOI:10.1016/j.dib.2024.111169
Beston Lufyagila, Bonny Mgawe, Anael Sam
{"title":"Blood smear imagery dataset for malaria parasite detection: A case of Tanzania.","authors":"Beston Lufyagila, Bonny Mgawe, Anael Sam","doi":"10.1016/j.dib.2024.111169","DOIUrl":null,"url":null,"abstract":"<p><p>Malaria is a major public health issue in many regions of Africa, including Tanzania. The Tanzania Malaria National Strategic Plan (2021-2025) emphasizes on high-quality testing services availability, high coverage of timely diagnosis of malaria, and availability of innovative diagnostic systems for effective detection, treatment and control of malaria. This would be achieved by employing state of the art technologies like Machine learning. However, Machine learning requires diverse dataset to work effectively and efficiently. Therefore, this paper presents blood smear imagery dataset that can be used by researchers to develop computer vision systems for malaria parasite detection. The imagery dataset were acquired by setting up a 40X-2500X Real 4 K compound microscope with a 4k SONY IMX334 sensor camera mounted to it in five health centres of Tanga region. Blood samples taken according to normal routine of diagnosing patients in health care, were stained using Giemsa reagent and examined under microscope. Following these procedures, the study collected and annotated Thick infected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1139</mn> <mo>)</mo></mrow> </math> ; Thick uninfected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1071</mn></mrow> </math> ); Thin uninfected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>270</mn></mrow> </math> ); and Thin infected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1064</mn></mrow> </math> ). Furthermore, the curated dataset have been uploaded in a public Harvard data verse repository. In summary, the dataset aims to support the creation of diagnostic tools that improve malaria detection, thereby advancing health outcomes and aiding malaria control initiatives in Tanzania and other regions impacted by the disease.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111169"},"PeriodicalIF":1.0000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648091/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.dib.2024.111169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Malaria is a major public health issue in many regions of Africa, including Tanzania. The Tanzania Malaria National Strategic Plan (2021-2025) emphasizes on high-quality testing services availability, high coverage of timely diagnosis of malaria, and availability of innovative diagnostic systems for effective detection, treatment and control of malaria. This would be achieved by employing state of the art technologies like Machine learning. However, Machine learning requires diverse dataset to work effectively and efficiently. Therefore, this paper presents blood smear imagery dataset that can be used by researchers to develop computer vision systems for malaria parasite detection. The imagery dataset were acquired by setting up a 40X-2500X Real 4 K compound microscope with a 4k SONY IMX334 sensor camera mounted to it in five health centres of Tanga region. Blood samples taken according to normal routine of diagnosing patients in health care, were stained using Giemsa reagent and examined under microscope. Following these procedures, the study collected and annotated Thick infected blood smear images ( n = 1139 ) ; Thick uninfected blood smear images ( n = 1071 ); Thin uninfected blood smear images ( n = 270 ); and Thin infected blood smear images ( n = 1064 ). Furthermore, the curated dataset have been uploaded in a public Harvard data verse repository. In summary, the dataset aims to support the creation of diagnostic tools that improve malaria detection, thereby advancing health outcomes and aiding malaria control initiatives in Tanzania and other regions impacted by the disease.

用于疟疾寄生虫检测的血液涂片图像数据集:坦桑尼亚一例。
在包括坦桑尼亚在内的非洲许多地区,疟疾是一个重大的公共卫生问题。坦桑尼亚疟疾国家战略计划(2021-2025年)强调提供高质量的检测服务,及时诊断疟疾的高覆盖率,以及提供有效发现、治疗和控制疟疾的创新诊断系统。这将通过采用机器学习等最先进的技术来实现。然而,机器学习需要不同的数据集才能有效地工作。因此,本文提出了血液涂片图像数据集,可用于研究人员开发用于疟疾寄生虫检测的计算机视觉系统。图像数据集是通过在Tanga地区的五个卫生中心安装一台40X-2500X Real 4 K复合显微镜和一台4k索尼IMX334传感器相机来获取的。对卫生保健诊断患者按常规采血,用吉姆萨试剂染色,显微镜下观察。按照这些程序,研究收集并注释了感染的厚血涂片图像(n = 1139);厚的未感染血涂片图像(n = 1071);未感染薄血涂片图像(n = 270);感染血涂片薄片(n = 1064)。此外,管理的数据集已经上传到公共哈佛数据诗歌存储库中。总而言之,该数据集旨在支持创建诊断工具,以改进疟疾检测,从而促进健康成果,并协助坦桑尼亚和其他受该疾病影响地区的疟疾控制举措。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Data in Brief
Data in Brief MULTIDISCIPLINARY SCIENCES-
CiteScore
3.10
自引率
0.00%
发文量
996
审稿时长
70 days
期刊介绍: Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信