{"title":"Basic Data Reduction Techniques and Their Influence on GAME Modeling Method","authors":"Miroslav Cepek, M. Snorek","doi":"10.1109/UKSIM.2008.91","DOIUrl":null,"url":null,"abstract":"The amount of data produced by medicine diagnosis and other means constantly increases -- in both number of measurements and in number of dimensions. For many modeling or data mining methods this increase causes problems. First main problem is well known curse of dimensionality. The second is the amount of training data items which lengthens the training process. Both these problems reduces usability of modeling methods.The aim of this article is to study several data reduction techniques and test their influence on one particular inductive modeling method -- GAME -- developed in our department. Application of each method affecting the performance (accuracy) and learning time of the GAME modeling method has been studied.To obtain representative results several datasets has been tested -- for example well known Iris dataset or real-world application for medical data (e.g. EEG classification).","PeriodicalId":22356,"journal":{"name":"Tenth International Conference on Computer Modeling and Simulation (uksim 2008)","volume":"1 1","pages":"138-143"},"PeriodicalIF":0.0000,"publicationDate":"2008-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tenth International Conference on Computer Modeling and Simulation (uksim 2008)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UKSIM.2008.91","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The amount of data produced by medicine diagnosis and other means constantly increases -- in both number of measurements and in number of dimensions. For many modeling or data mining methods this increase causes problems. First main problem is well known curse of dimensionality. The second is the amount of training data items which lengthens the training process. Both these problems reduces usability of modeling methods.The aim of this article is to study several data reduction techniques and test their influence on one particular inductive modeling method -- GAME -- developed in our department. Application of each method affecting the performance (accuracy) and learning time of the GAME modeling method has been studied.To obtain representative results several datasets has been tested -- for example well known Iris dataset or real-world application for medical data (e.g. EEG classification).