{"title":"以属性关联影响为重点的不完全输入数据处理技术比较","authors":"M. Millán-Giraldo, J. S. Sánchez, V. Traver","doi":"10.1109/ICMLA.2010.126","DOIUrl":null,"url":null,"abstract":"This work presents a new approach based on support vector regression to deal with incomplete input (unseen) data and compares it to other existing techniques. The empirical analysis has been done over 18 real data sets and using five different classifiers, with the aim of foreseeing which technique can be deemed as more suitable for each classifier. Also, this study tries to devise how the relevance of the missing attribute affects the performance of each pair (handling algorithm, classifier). Experimental results demonstrate that no technique is absolutely better than the others for all classifiers. However, combining the proposed strategy with the nearest neighbor classifier appears as the best choice to face the problem of missing attribute values in the input data.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Comparison of Techniques for Handling Incomplete Input Data with a Focus on Attribute Relevance Influence\",\"authors\":\"M. Millán-Giraldo, J. S. Sánchez, V. Traver\",\"doi\":\"10.1109/ICMLA.2010.126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work presents a new approach based on support vector regression to deal with incomplete input (unseen) data and compares it to other existing techniques. The empirical analysis has been done over 18 real data sets and using five different classifiers, with the aim of foreseeing which technique can be deemed as more suitable for each classifier. Also, this study tries to devise how the relevance of the missing attribute affects the performance of each pair (handling algorithm, classifier). Experimental results demonstrate that no technique is absolutely better than the others for all classifiers. However, combining the proposed strategy with the nearest neighbor classifier appears as the best choice to face the problem of missing attribute values in the input data.\",\"PeriodicalId\":336514,\"journal\":{\"name\":\"2010 Ninth International Conference on Machine Learning and Applications\",\"volume\":\"57 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Ninth International Conference on Machine Learning and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2010.126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Ninth International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2010.126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparison of Techniques for Handling Incomplete Input Data with a Focus on Attribute Relevance Influence
This work presents a new approach based on support vector regression to deal with incomplete input (unseen) data and compares it to other existing techniques. The empirical analysis has been done over 18 real data sets and using five different classifiers, with the aim of foreseeing which technique can be deemed as more suitable for each classifier. Also, this study tries to devise how the relevance of the missing attribute affects the performance of each pair (handling algorithm, classifier). Experimental results demonstrate that no technique is absolutely better than the others for all classifiers. However, combining the proposed strategy with the nearest neighbor classifier appears as the best choice to face the problem of missing attribute values in the input data.