Chang-Hoo Jeong, Sung-Pil Choi, Sungho Shin, Seungwoo Lee, Hanmin Jung, Soon-Young Kim, Pyung Kim
{"title":"Creating Semantic Data from Relational Database","authors":"Chang-Hoo Jeong, Sung-Pil Choi, Sungho Shin, Seungwoo Lee, Hanmin Jung, Soon-Young Kim, Pyung Kim","doi":"10.1109/SocialCom.2013.174","DOIUrl":null,"url":null,"abstract":"The semantic web technology contributes to finding accurate information on the basis of the meaning of words or improving access capabilities to information through inference between meanings. However, the reason that the semantic service is not spreading is that the semantic technology is not yet settled for practical use. Another reason is that it is not easy to build ontology which is used as a knowledge representation model for providing the semantic service. Previous studies have concentrated on the process of automatically creating ontology in a RDB (Relational Database), or automatically converting it to instances. This study focuses on allocating a unique identifier to the existing DB data through entity identification to enable more accurate services to be provided in the process of converting the RDBMS data to ontology instances. Besides, this study extracts triple data including important entities and their relationships from unstructured texts stored in BLOB (Binary Large Object) format by using text mining technology, and makes them ontology instances. The DB-to-OWL converting system (Onto URI) uses a mapping rule between the DB schema and the ontology schema, an identification rule for identifying entities, text mining for extracting semantic triple, and the authority data in order to effectively support automatic creation of ontology instances with DB data. As a result, the proposed system contributes to creating semantic data automatically from relational database through URI allocation and identification.","PeriodicalId":129308,"journal":{"name":"2013 International Conference on Social Computing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Social Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SocialCom.2013.174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The semantic web technology contributes to finding accurate information on the basis of the meaning of words or improving access capabilities to information through inference between meanings. However, the reason that the semantic service is not spreading is that the semantic technology is not yet settled for practical use. Another reason is that it is not easy to build ontology which is used as a knowledge representation model for providing the semantic service. Previous studies have concentrated on the process of automatically creating ontology in a RDB (Relational Database), or automatically converting it to instances. This study focuses on allocating a unique identifier to the existing DB data through entity identification to enable more accurate services to be provided in the process of converting the RDBMS data to ontology instances. Besides, this study extracts triple data including important entities and their relationships from unstructured texts stored in BLOB (Binary Large Object) format by using text mining technology, and makes them ontology instances. The DB-to-OWL converting system (Onto URI) uses a mapping rule between the DB schema and the ontology schema, an identification rule for identifying entities, text mining for extracting semantic triple, and the authority data in order to effectively support automatic creation of ontology instances with DB data. As a result, the proposed system contributes to creating semantic data automatically from relational database through URI allocation and identification.
语义网技术有助于根据词的含义找到准确的信息,或通过意义之间的推理提高对信息的访问能力。然而,语义服务没有得到推广的原因是语义技术还没有得到实际应用。另一个原因是本体不容易建立,本体作为提供语义服务的知识表示模型。以往的研究主要集中在RDB(关系型数据库)中本体的自动创建过程,或者将本体自动转换为实例。本研究的重点是通过实体识别为现有的DB数据分配一个唯一的标识符,以便在将RDBMS数据转换为本体实例的过程中提供更准确的服务。此外,利用文本挖掘技术从存储于BLOB (Binary Large Object,二进制大对象)格式的非结构化文本中提取包含重要实体及其关系的三重数据,并将其作为本体实例。数据库到owl的转换系统(Onto URI)使用数据库模式和本体模式之间的映射规则、标识实体的标识规则、提取语义三元组的文本挖掘和权威数据,有效地支持用DB数据自动创建本体实例。通过URI分配和标识,实现了从关系数据库中自动生成语义数据的功能。