{"title":"Blood smear imagery dataset for malaria parasite detection: A case of Tanzania.","authors":"Beston Lufyagila, Bonny Mgawe, Anael Sam","doi":"10.1016/j.dib.2024.111169","DOIUrl":null,"url":null,"abstract":"<p><p>Malaria is a major public health issue in many regions of Africa, including Tanzania. The Tanzania Malaria National Strategic Plan (2021-2025) emphasizes on high-quality testing services availability, high coverage of timely diagnosis of malaria, and availability of innovative diagnostic systems for effective detection, treatment and control of malaria. This would be achieved by employing state of the art technologies like Machine learning. However, Machine learning requires diverse dataset to work effectively and efficiently. Therefore, this paper presents blood smear imagery dataset that can be used by researchers to develop computer vision systems for malaria parasite detection. The imagery dataset were acquired by setting up a 40X-2500X Real 4 K compound microscope with a 4k SONY IMX334 sensor camera mounted to it in five health centres of Tanga region. Blood samples taken according to normal routine of diagnosing patients in health care, were stained using Giemsa reagent and examined under microscope. Following these procedures, the study collected and annotated Thick infected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1139</mn> <mo>)</mo></mrow> </math> ; Thick uninfected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1071</mn></mrow> </math> ); Thin uninfected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>270</mn></mrow> </math> ); and Thin infected blood smear images ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1064</mn></mrow> </math> ). Furthermore, the curated dataset have been uploaded in a public Harvard data verse repository. In summary, the dataset aims to support the creation of diagnostic tools that improve malaria detection, thereby advancing health outcomes and aiding malaria control initiatives in Tanzania and other regions impacted by the disease.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111169"},"PeriodicalIF":1.0000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648091/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.dib.2024.111169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Malaria is a major public health issue in many regions of Africa, including Tanzania. The Tanzania Malaria National Strategic Plan (2021-2025) emphasizes on high-quality testing services availability, high coverage of timely diagnosis of malaria, and availability of innovative diagnostic systems for effective detection, treatment and control of malaria. This would be achieved by employing state of the art technologies like Machine learning. However, Machine learning requires diverse dataset to work effectively and efficiently. Therefore, this paper presents blood smear imagery dataset that can be used by researchers to develop computer vision systems for malaria parasite detection. The imagery dataset were acquired by setting up a 40X-2500X Real 4 K compound microscope with a 4k SONY IMX334 sensor camera mounted to it in five health centres of Tanga region. Blood samples taken according to normal routine of diagnosing patients in health care, were stained using Giemsa reagent and examined under microscope. Following these procedures, the study collected and annotated Thick infected blood smear images ( ; Thick uninfected blood smear images ( ); Thin uninfected blood smear images ( ); and Thin infected blood smear images ( ). Furthermore, the curated dataset have been uploaded in a public Harvard data verse repository. In summary, the dataset aims to support the creation of diagnostic tools that improve malaria detection, thereby advancing health outcomes and aiding malaria control initiatives in Tanzania and other regions impacted by the disease.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.