Arturo Téllez-Velázquez, Pierre A. Delice, Rafael Salgado-Leyva, Raúl Cruz-Barbosa
{"title":"On the explanation of COVID-19 blood test variables using fuzzy models","authors":"Arturo Téllez-Velázquez, Pierre A. Delice, Rafael Salgado-Leyva, Raúl Cruz-Barbosa","doi":"10.3233/jifs-219372","DOIUrl":null,"url":null,"abstract":"This paper performs an analysis comparing two evolutionary explainable fuzzy models that make inferences in a pipeline with a blood test data set for COVID-19 classification. Firstly, data is preprocessed by the following stages: cleaning, imputation and ranking feature selection. Later, we perform a comparative analysis between several clustering methods used in an Evolutionary Clustering-Structured Fuzzy Classifier (ECSFC) to solve this classification problem using the Differential Evolution (DE) algorithm. Complementarily, we find that the Fuzzy Decision Tree model produces similar performance when is tuned with the DE algorithm (EFDT). The obtained results show that, simpler models are easier to explain qualitatively, i.e., increasing the number of clusters in ECSFC model or the maximum depth of the tree in EFDT model, does not necessarily help to obtain simplified and accurate models. In addition, although the EFDT model is by itself an intuitively explainable model, the ECSFC, with the help of the proposed Weighted Stacked Features Plot, generates more intuitive models that allow not only highlighting the features and the linguistic terms that defines a patient with COVID-19, but also allows users to visualize in a single graph and in specific colors the analyzed classes.","PeriodicalId":509313,"journal":{"name":"Journal of Intelligent & Fuzzy Systems","volume":" 30","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent & Fuzzy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jifs-219372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper performs an analysis comparing two evolutionary explainable fuzzy models that make inferences in a pipeline with a blood test data set for COVID-19 classification. Firstly, data is preprocessed by the following stages: cleaning, imputation and ranking feature selection. Later, we perform a comparative analysis between several clustering methods used in an Evolutionary Clustering-Structured Fuzzy Classifier (ECSFC) to solve this classification problem using the Differential Evolution (DE) algorithm. Complementarily, we find that the Fuzzy Decision Tree model produces similar performance when is tuned with the DE algorithm (EFDT). The obtained results show that, simpler models are easier to explain qualitatively, i.e., increasing the number of clusters in ECSFC model or the maximum depth of the tree in EFDT model, does not necessarily help to obtain simplified and accurate models. In addition, although the EFDT model is by itself an intuitively explainable model, the ECSFC, with the help of the proposed Weighted Stacked Features Plot, generates more intuitive models that allow not only highlighting the features and the linguistic terms that defines a patient with COVID-19, but also allows users to visualize in a single graph and in specific colors the analyzed classes.