{"title":"A Comprehensive Review of Unstructured Data Management Approaches in Data Warehouse","authors":"Vedika Gupta, A. Gosain","doi":"10.1109/ISCBI.2013.20","DOIUrl":null,"url":null,"abstract":"The amount of business data is large & keeps on evolving leading to heterogeneous information base. The challenge is to access, analyze & integrate various data sources for making intelligent decisions. Business data can be structured or unstructured. Structured data attains the row-column format easily while unstructured data (USD) is the one that poses problem in such kind of tabular storage. Owing to the fact that USD is more than three times of structured data, and that it is more resourceful business wise and helps in charting out strategies and making decisions, it becomes important to devise methods for handling USD in data warehouse. Since the importance of USD has been realized, various authors have discussed different ways to manage it and extract useful information from it. In this paper, we have first comprehensively reviewed & surveyed the representative research works of various authors that have demonstrated how unstructured data can be handled in the warehouse. Finally, we have manifested & sorted them on various parameters & provided the same in tabular form.","PeriodicalId":311471,"journal":{"name":"2013 International Symposium on Computational and Business Intelligence","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Symposium on Computational and Business Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCBI.2013.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The amount of business data is large & keeps on evolving leading to heterogeneous information base. The challenge is to access, analyze & integrate various data sources for making intelligent decisions. Business data can be structured or unstructured. Structured data attains the row-column format easily while unstructured data (USD) is the one that poses problem in such kind of tabular storage. Owing to the fact that USD is more than three times of structured data, and that it is more resourceful business wise and helps in charting out strategies and making decisions, it becomes important to devise methods for handling USD in data warehouse. Since the importance of USD has been realized, various authors have discussed different ways to manage it and extract useful information from it. In this paper, we have first comprehensively reviewed & surveyed the representative research works of various authors that have demonstrated how unstructured data can be handled in the warehouse. Finally, we have manifested & sorted them on various parameters & provided the same in tabular form.