R. Al-Dhaibani, Mohammed A. Bamatraf, Khalid.Q. Sha'Afal
{"title":"Data Benchmark Collection of Patients with Malaria for Machine Learning: a study in Hadhramout- Yemen","authors":"R. Al-Dhaibani, Mohammed A. Bamatraf, Khalid.Q. Sha'Afal","doi":"10.1109/ICOICE48418.2019.9035166","DOIUrl":null,"url":null,"abstract":"Studies on data collection of health and medicine are lacking; especially in endemic diseases. This study aimed to create and publish a data benchmark and make it available for researchers. Malaria is one of the endemic and infectious diseases consistently in many countries. must attention. Such diseases need high attention to diagnose, predict, and control by physicians and stakeholders. The initial step of data mining and data analysis is data collection to draw knowledge of this data and speed in the elicitation information. Using data mining techniques can enable one to build predictive models. About a thousand cases of finally diagnosed malaria patients have been collected under the direct supervision of specialized doctors and preprocessed for future use. We reported the percentage of patients with data recorded for 40 attributes and selected 27 signs and symptoms. Data quality was assessed based on statistical scores. Later, Basic analysis is presented in this paper too, employing common statistics metrics and data mining as well. This study uses the SPSS tool for statistical analysis and WEKA as a tool for data mining.","PeriodicalId":109414,"journal":{"name":"2019 First International Conference of Intelligent Computing and Engineering (ICOICE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 First International Conference of Intelligent Computing and Engineering (ICOICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOICE48418.2019.9035166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Studies on data collection of health and medicine are lacking; especially in endemic diseases. This study aimed to create and publish a data benchmark and make it available for researchers. Malaria is one of the endemic and infectious diseases consistently in many countries. must attention. Such diseases need high attention to diagnose, predict, and control by physicians and stakeholders. The initial step of data mining and data analysis is data collection to draw knowledge of this data and speed in the elicitation information. Using data mining techniques can enable one to build predictive models. About a thousand cases of finally diagnosed malaria patients have been collected under the direct supervision of specialized doctors and preprocessed for future use. We reported the percentage of patients with data recorded for 40 attributes and selected 27 signs and symptoms. Data quality was assessed based on statistical scores. Later, Basic analysis is presented in this paper too, employing common statistics metrics and data mining as well. This study uses the SPSS tool for statistical analysis and WEKA as a tool for data mining.