Bin Liu, Shu-Gui Cao, D. Cao, Qing-Chun Li, Hai-Tao Liu, Shao-Nan Shi
{"title":"面向分布式数据挖掘优化的基于本体的语义异构度量框架","authors":"Bin Liu, Shu-Gui Cao, D. Cao, Qing-Chun Li, Hai-Tao Liu, Shao-Nan Shi","doi":"10.1109/ICMLC.2012.6358897","DOIUrl":null,"url":null,"abstract":"In distributed data mining (DDM) systems, the semantic heterogeneity between data sources has not got universal attentions, which may produce the potential risks of damaging the quality of the final result. This paper presents a semantic distance measurement framework to extract the essential semantic heterogeneity between data sources. In this framework, an ontology-matching based multi-strategy voting method is utilized to comprehensively synthesize the semantic distances between two data source ontologies in element level and structure level. The output of the framework can be leveraged as the foundation to group the data sources for optimizing the DDM result. Finally, the framework is integrated into a DDM architecture we have proposed.","PeriodicalId":128006,"journal":{"name":"2012 International Conference on Machine Learning and Cybernetics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An ontology based semantic heterogeneity measurement framework for optimization in distributed data mining\",\"authors\":\"Bin Liu, Shu-Gui Cao, D. Cao, Qing-Chun Li, Hai-Tao Liu, Shao-Nan Shi\",\"doi\":\"10.1109/ICMLC.2012.6358897\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In distributed data mining (DDM) systems, the semantic heterogeneity between data sources has not got universal attentions, which may produce the potential risks of damaging the quality of the final result. This paper presents a semantic distance measurement framework to extract the essential semantic heterogeneity between data sources. In this framework, an ontology-matching based multi-strategy voting method is utilized to comprehensively synthesize the semantic distances between two data source ontologies in element level and structure level. The output of the framework can be leveraged as the foundation to group the data sources for optimizing the DDM result. Finally, the framework is integrated into a DDM architecture we have proposed.\",\"PeriodicalId\":128006,\"journal\":{\"name\":\"2012 International Conference on Machine Learning and Cybernetics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 International Conference on Machine Learning and Cybernetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC.2012.6358897\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2012.6358897","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An ontology based semantic heterogeneity measurement framework for optimization in distributed data mining
In distributed data mining (DDM) systems, the semantic heterogeneity between data sources has not got universal attentions, which may produce the potential risks of damaging the quality of the final result. This paper presents a semantic distance measurement framework to extract the essential semantic heterogeneity between data sources. In this framework, an ontology-matching based multi-strategy voting method is utilized to comprehensively synthesize the semantic distances between two data source ontologies in element level and structure level. The output of the framework can be leveraged as the foundation to group the data sources for optimizing the DDM result. Finally, the framework is integrated into a DDM architecture we have proposed.