{"title":"基于分类规则特征的迁移数据集相似性检测","authors":"H. Abe, S. Tsumoto","doi":"10.1109/ICDMW.2009.99","DOIUrl":null,"url":null,"abstract":"In order to transfer mined knowledge for various datasets obtained from transferring situations, it is important to detect not only availability of transferring the knowledge but also detecting their limitations of the transfer. Although most of methods to detect the limitations use performance indices of sets of classifiers such as accuracies of classifier sets, those of each classifier are also useful. Data characterizing techniques have been developed to control learning algorithm selection by using statistical measurements of a dataset. Expanding this framework, we consider a method to reuse objective rule evaluation indices of classification rules such as support, precision, and recall, to measure similarity of different datasets. In this paper, we present a method to characterize given datasets based on objective rule evaluation indices and classification learning algorithms. The experimental results show the method can detect similarity of datasets even if the datasets have totally different attribute sets. This indicates that the limitations of transferring both of classifiers and learning algorithms can be detected as the similarity among datasets by using a learning algorithm.","PeriodicalId":351078,"journal":{"name":"2009 IEEE International Conference on Data Mining Workshops","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Detecting Similarity of Transferring Datasets Based on Features of Classification Rules\",\"authors\":\"H. Abe, S. Tsumoto\",\"doi\":\"10.1109/ICDMW.2009.99\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to transfer mined knowledge for various datasets obtained from transferring situations, it is important to detect not only availability of transferring the knowledge but also detecting their limitations of the transfer. Although most of methods to detect the limitations use performance indices of sets of classifiers such as accuracies of classifier sets, those of each classifier are also useful. Data characterizing techniques have been developed to control learning algorithm selection by using statistical measurements of a dataset. Expanding this framework, we consider a method to reuse objective rule evaluation indices of classification rules such as support, precision, and recall, to measure similarity of different datasets. In this paper, we present a method to characterize given datasets based on objective rule evaluation indices and classification learning algorithms. The experimental results show the method can detect similarity of datasets even if the datasets have totally different attribute sets. This indicates that the limitations of transferring both of classifiers and learning algorithms can be detected as the similarity among datasets by using a learning algorithm.\",\"PeriodicalId\":351078,\"journal\":{\"name\":\"2009 IEEE International Conference on Data Mining Workshops\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 IEEE International Conference on Data Mining Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDMW.2009.99\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2009.99","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detecting Similarity of Transferring Datasets Based on Features of Classification Rules
In order to transfer mined knowledge for various datasets obtained from transferring situations, it is important to detect not only availability of transferring the knowledge but also detecting their limitations of the transfer. Although most of methods to detect the limitations use performance indices of sets of classifiers such as accuracies of classifier sets, those of each classifier are also useful. Data characterizing techniques have been developed to control learning algorithm selection by using statistical measurements of a dataset. Expanding this framework, we consider a method to reuse objective rule evaluation indices of classification rules such as support, precision, and recall, to measure similarity of different datasets. In this paper, we present a method to characterize given datasets based on objective rule evaluation indices and classification learning algorithms. The experimental results show the method can detect similarity of datasets even if the datasets have totally different attribute sets. This indicates that the limitations of transferring both of classifiers and learning algorithms can be detected as the similarity among datasets by using a learning algorithm.