{"title":"基于关系距离的多关系聚类","authors":"Liting Wei, Yun Li","doi":"10.1109/WISA.2015.30","DOIUrl":null,"url":null,"abstract":"When clustering the tuples in the target table which is in a relational database, the prior task is to exactly and effectively calculate the relational distance between tuples. A lot of methods are used today, such as the relational distance measuring based on RIBL2. However, all these methods fail to consider the differences of similarity between the objects in both non-target table and target table, which stopped them from getting a high clustering accuracy. Using canonical correlation analysis in this paper and setting a weight for each table in the relational database, the weight indicated its role in the calculation of the distance among target tables. In addition, when calculating the distance between the two clusters to find the center of each cluster, turn the calculation of the distance between clusters into a distance between center points. Experiments show that this method ensures clustering efficiency and improves clustering accuracy.","PeriodicalId":198938,"journal":{"name":"2015 12th Web Information System and Application Conference (WISA)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-relational Clustering Based on Relational Distance\",\"authors\":\"Liting Wei, Yun Li\",\"doi\":\"10.1109/WISA.2015.30\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When clustering the tuples in the target table which is in a relational database, the prior task is to exactly and effectively calculate the relational distance between tuples. A lot of methods are used today, such as the relational distance measuring based on RIBL2. However, all these methods fail to consider the differences of similarity between the objects in both non-target table and target table, which stopped them from getting a high clustering accuracy. Using canonical correlation analysis in this paper and setting a weight for each table in the relational database, the weight indicated its role in the calculation of the distance among target tables. In addition, when calculating the distance between the two clusters to find the center of each cluster, turn the calculation of the distance between clusters into a distance between center points. Experiments show that this method ensures clustering efficiency and improves clustering accuracy.\",\"PeriodicalId\":198938,\"journal\":{\"name\":\"2015 12th Web Information System and Application Conference (WISA)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 12th Web Information System and Application Conference (WISA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISA.2015.30\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 12th Web Information System and Application Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2015.30","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-relational Clustering Based on Relational Distance
When clustering the tuples in the target table which is in a relational database, the prior task is to exactly and effectively calculate the relational distance between tuples. A lot of methods are used today, such as the relational distance measuring based on RIBL2. However, all these methods fail to consider the differences of similarity between the objects in both non-target table and target table, which stopped them from getting a high clustering accuracy. Using canonical correlation analysis in this paper and setting a weight for each table in the relational database, the weight indicated its role in the calculation of the distance among target tables. In addition, when calculating the distance between the two clusters to find the center of each cluster, turn the calculation of the distance between clusters into a distance between center points. Experiments show that this method ensures clustering efficiency and improves clustering accuracy.