{"title":"The prediction of bankruptcy using fuzzy classifiers","authors":"R. Nogueira, S. Vieira, J. Sousa","doi":"10.1109/CIMA.2005.1662315","DOIUrl":null,"url":null,"abstract":"Real-world databases are highly susceptible to noisy, missing, and inconsistent data due to their typically huge size, which is a prevailing problem in data analysis. The easiest way to handle such data sets in classification is to discard data with missing and extreme values. Since this complete case approach may result in a loss of valuable information and reduced data set size, preprocessing techniques are used in this paper. These techniques include data cleaning, data transformations and data reduction. A new feature selection for data reduction is introduced, which uses the fast fuzzy clustering algorithm in classification problems. The experiments show the advantages of the proposed methods for data preprocessing in a real world problem: the prediction of bankruptcy. The data set used in this study has missing values and extreme values. The data set also presents a much smaller bankruptcy class than the not bankruptcy class","PeriodicalId":306045,"journal":{"name":"2005 ICSC Congress on Computational Intelligence Methods and Applications","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 ICSC Congress on Computational Intelligence Methods and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIMA.2005.1662315","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Real-world databases are highly susceptible to noisy, missing, and inconsistent data due to their typically huge size, which is a prevailing problem in data analysis. The easiest way to handle such data sets in classification is to discard data with missing and extreme values. Since this complete case approach may result in a loss of valuable information and reduced data set size, preprocessing techniques are used in this paper. These techniques include data cleaning, data transformations and data reduction. A new feature selection for data reduction is introduced, which uses the fast fuzzy clustering algorithm in classification problems. The experiments show the advantages of the proposed methods for data preprocessing in a real world problem: the prediction of bankruptcy. The data set used in this study has missing values and extreme values. The data set also presents a much smaller bankruptcy class than the not bankruptcy class