{"title":"利用实例典型化在神经网络分类器中有效检测异常值","authors":"S. Sane, A. Ghatol","doi":"10.1109/ICIT.2006.89","DOIUrl":null,"url":null,"abstract":"Detection of outliers is one of the data pre-processing tasks. In all the applications, outliers need to be detected to enhance the accuracy of the classifiers. Several different techniques, such as statistical, distance-based and deviation-based outlier detection exist to detect outliers. Many of these techniques use filter method. A wrapper method using the concept of instance typicality may also be used to detect outliers. This paper deals with a new wrapper method that builds an initial model using neural networks and treats values at the output of neurons in the output layer as the typicality scores. Instances with lowest output values are treated as potential outliers. In addition, the method is also useful to build compact and accurate classifiers by selecting a few most typical instances resulting in significant reduction in storage space. The method is generic and thus can also be used for instance selection with any kind of classifiers. Resultant compact models are useful for imputation of missing values.","PeriodicalId":161120,"journal":{"name":"9th International Conference on Information Technology (ICIT'06)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Use of Instance Typicality for Efficient Detection of Outliers with Neural Network Classifiers\",\"authors\":\"S. Sane, A. Ghatol\",\"doi\":\"10.1109/ICIT.2006.89\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detection of outliers is one of the data pre-processing tasks. In all the applications, outliers need to be detected to enhance the accuracy of the classifiers. Several different techniques, such as statistical, distance-based and deviation-based outlier detection exist to detect outliers. Many of these techniques use filter method. A wrapper method using the concept of instance typicality may also be used to detect outliers. This paper deals with a new wrapper method that builds an initial model using neural networks and treats values at the output of neurons in the output layer as the typicality scores. Instances with lowest output values are treated as potential outliers. In addition, the method is also useful to build compact and accurate classifiers by selecting a few most typical instances resulting in significant reduction in storage space. The method is generic and thus can also be used for instance selection with any kind of classifiers. Resultant compact models are useful for imputation of missing values.\",\"PeriodicalId\":161120,\"journal\":{\"name\":\"9th International Conference on Information Technology (ICIT'06)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"9th International Conference on Information Technology (ICIT'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIT.2006.89\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"9th International Conference on Information Technology (ICIT'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIT.2006.89","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Use of Instance Typicality for Efficient Detection of Outliers with Neural Network Classifiers
Detection of outliers is one of the data pre-processing tasks. In all the applications, outliers need to be detected to enhance the accuracy of the classifiers. Several different techniques, such as statistical, distance-based and deviation-based outlier detection exist to detect outliers. Many of these techniques use filter method. A wrapper method using the concept of instance typicality may also be used to detect outliers. This paper deals with a new wrapper method that builds an initial model using neural networks and treats values at the output of neurons in the output layer as the typicality scores. Instances with lowest output values are treated as potential outliers. In addition, the method is also useful to build compact and accurate classifiers by selecting a few most typical instances resulting in significant reduction in storage space. The method is generic and thus can also be used for instance selection with any kind of classifiers. Resultant compact models are useful for imputation of missing values.