Santiago Sinisterra-Sierra, Salvador Godoy-Calderón, Miriam Pescador-Rojas
{"title":"COVID-19 Data Analysis with a Multi-Objective Evolutionary Algorithm for Causal Association Rule Mining","authors":"Santiago Sinisterra-Sierra, Salvador Godoy-Calderón, Miriam Pescador-Rojas","doi":"10.3390/mca28010012","DOIUrl":null,"url":null,"abstract":"Association rule mining plays a crucial role in the medical area in discovering interesting relationships among the attributes of a data set. Traditional association rule mining algorithms such as Apriori, FP growth, or Eclat require considerable computational resources and generate large volumes of rules. Moreover, these techniques depend on user-defined thresholds which can inadvertently cause the algorithm to omit some interesting rules. In order to solve such challenges, we propose an evolutionary multi-objective algorithm based on NSGA-II to guide the mining process in a data set composed of 15.5 million records with official data describing the COVID-19 pandemic in Mexico. We tested different scenarios optimizing classical and causal estimation measures in four waves, defined as the periods of time where the number of people with COVID-19 increased. The proposed contributions generate, recombine, and evaluate patterns, focusing on recovering promising high-quality rules with actionable cause–effect relationships among the attributes to identify which groups are more susceptible to disease or what combinations of conditions are necessary to receive certain types of medical care.","PeriodicalId":53224,"journal":{"name":"Mathematical & Computational Applications","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2023-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical & Computational Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/mca28010012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 1
Abstract
Association rule mining plays a crucial role in the medical area in discovering interesting relationships among the attributes of a data set. Traditional association rule mining algorithms such as Apriori, FP growth, or Eclat require considerable computational resources and generate large volumes of rules. Moreover, these techniques depend on user-defined thresholds which can inadvertently cause the algorithm to omit some interesting rules. In order to solve such challenges, we propose an evolutionary multi-objective algorithm based on NSGA-II to guide the mining process in a data set composed of 15.5 million records with official data describing the COVID-19 pandemic in Mexico. We tested different scenarios optimizing classical and causal estimation measures in four waves, defined as the periods of time where the number of people with COVID-19 increased. The proposed contributions generate, recombine, and evaluate patterns, focusing on recovering promising high-quality rules with actionable cause–effect relationships among the attributes to identify which groups are more susceptible to disease or what combinations of conditions are necessary to receive certain types of medical care.
期刊介绍:
Mathematical and Computational Applications (MCA) is devoted to original research in the field of engineering, natural sciences or social sciences where mathematical and/or computational techniques are necessary for solving specific problems. The aim of the journal is to provide a medium by which a wide range of experience can be exchanged among researchers from diverse fields such as engineering (electrical, mechanical, civil, industrial, aeronautical, nuclear etc.), natural sciences (physics, mathematics, chemistry, biology etc.) or social sciences (administrative sciences, economics, political sciences etc.). The papers may be theoretical where mathematics is used in a nontrivial way or computational or combination of both. Each paper submitted will be reviewed and only papers of highest quality that contain original ideas and research will be published. Papers containing only experimental techniques and abstract mathematics without any sign of application are discouraged.