Managing Textual Data Semantically In Relational Databases

W. Yafooz, Siti Zalaha Abdin, SK Ahammad Fahad
{"title":"Managing Textual Data Semantically In Relational Databases","authors":"W. Yafooz, Siti Zalaha Abdin, SK Ahammad Fahad","doi":"10.1109/ICSCEE.2018.8538426","DOIUrl":null,"url":null,"abstract":"the massive volume of data in databases, web pages, and document files usually causes information to be disorganized and unclear for the user. Therefore, information in such an environment can be classified into three forms: structured, semistructured, or unstructured. Structured information is the best form of information because it facilitates the acquisition and comprehension of knowledge. Relational Database Management System (RDBMS) has a robust structure that manages, organizes and retrieves data. There are many attempts have been made in order to deal with such data. These attempts can be categorized into three groups: within a database schema, by a developed data model within the database, or by query-based techniques in database. Nonetheless, RDBMS contain massive amount of unstructured data such as textual data.. This paper proposed Textual Virtual Schema Model (TVSM). TVSM is conducted to perform semantic textual data linking and clustering and is embedded in the relational database structure (schema). In addition, linking and converting the unstructured information to structured data. Quality improvement of textual data clusters. Achievement of high query processing efficiency in retrieving data clusters. TVSM initially developed to assist researchers, developers, and database administrators who are concerned on unstructured information management, information extraction, multi-document clustering, information retrieval, query processing efficiency, personal information management, question answering, information integration, news tracking, and news summarization","PeriodicalId":265737,"journal":{"name":"2018 International Conference on Smart Computing and Electronic Enterprise (ICSCEE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Smart Computing and Electronic Enterprise (ICSCEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCEE.2018.8538426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

the massive volume of data in databases, web pages, and document files usually causes information to be disorganized and unclear for the user. Therefore, information in such an environment can be classified into three forms: structured, semistructured, or unstructured. Structured information is the best form of information because it facilitates the acquisition and comprehension of knowledge. Relational Database Management System (RDBMS) has a robust structure that manages, organizes and retrieves data. There are many attempts have been made in order to deal with such data. These attempts can be categorized into three groups: within a database schema, by a developed data model within the database, or by query-based techniques in database. Nonetheless, RDBMS contain massive amount of unstructured data such as textual data.. This paper proposed Textual Virtual Schema Model (TVSM). TVSM is conducted to perform semantic textual data linking and clustering and is embedded in the relational database structure (schema). In addition, linking and converting the unstructured information to structured data. Quality improvement of textual data clusters. Achievement of high query processing efficiency in retrieving data clusters. TVSM initially developed to assist researchers, developers, and database administrators who are concerned on unstructured information management, information extraction, multi-document clustering, information retrieval, query processing efficiency, personal information management, question answering, information integration, news tracking, and news summarization
关系型数据库中文本数据的语义管理
数据库、网页和文档文件中大量的数据通常会导致信息混乱,对用户来说不清晰。因此,这种环境中的信息可以分为三种形式:结构化、半结构化或非结构化。结构化信息是最好的信息形式,因为它有利于知识的获取和理解。关系数据库管理系统(RDBMS)具有管理、组织和检索数据的健壮结构。为了处理这类数据,已经做了许多尝试。这些尝试可以分为三组:在数据库模式中,在数据库中开发的数据模型中,或在数据库中使用基于查询的技术。然而,RDBMS包含大量的非结构化数据,如文本数据。本文提出了文本虚拟图式模型(TVSM)。TVSM用于进行语义文本数据链接和聚类,并嵌入到关系数据库结构(模式)中。此外,将非结构化信息链接并转换为结构化数据。文本数据簇的质量改进。在检索数据簇时实现了高查询处理效率。TVSM最初是为了帮助研究人员、开发人员和数据库管理员进行非结构化信息管理、信息提取、多文档聚类、信息检索、查询处理效率、个人信息管理、问答、信息集成、新闻跟踪和新闻总结而开发的
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信