{"title":"Semantic Layer Construction for Big Data Integration","authors":"N. Soe, Tin Tin Yee, Ei Chaw Htoon","doi":"10.1109/ICAIT51105.2020.9261799","DOIUrl":null,"url":null,"abstract":"Heterogeneity problem (data model, schema and semantics) is the one of the challenges of data integration from different big data stores. In order to overcome that problem in big data integration, semantic conceptual layer is constructed as an intermediate layer between different data stores by using ontology. To achieve this goal, there are two steps involved: generate local ontologies from different data systems and merge extracted local ontologies to build a global ontology. The main focus of the paper is merging ontologies which matches local ontologies by syntactic and semantic similarity measures. The two concepts are syntactically compared by Jaccard similarity measure and semantically compared by using WordNet. The matching approach takes into account the class name, internal structure (attributes) of the class, relationship (objectProperty relation, is-part-of relation) of the compared concepts. The performance of proposed algorithm will be compared other merging algorithms such as PROMPT.","PeriodicalId":173291,"journal":{"name":"2020 International Conference on Advanced Information Technologies (ICAIT)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Advanced Information Technologies (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT51105.2020.9261799","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Heterogeneity problem (data model, schema and semantics) is the one of the challenges of data integration from different big data stores. In order to overcome that problem in big data integration, semantic conceptual layer is constructed as an intermediate layer between different data stores by using ontology. To achieve this goal, there are two steps involved: generate local ontologies from different data systems and merge extracted local ontologies to build a global ontology. The main focus of the paper is merging ontologies which matches local ontologies by syntactic and semantic similarity measures. The two concepts are syntactically compared by Jaccard similarity measure and semantically compared by using WordNet. The matching approach takes into account the class name, internal structure (attributes) of the class, relationship (objectProperty relation, is-part-of relation) of the compared concepts. The performance of proposed algorithm will be compared other merging algorithms such as PROMPT.