{"title":"Towards the construction of quality-aware Web Warehouses with BPMN 2.0 Business Processes","authors":"Andrea Delgado, Adriana Marotta, Laura González","doi":"10.1109/RCIS.2014.6861041","DOIUrl":null,"url":null,"abstract":"A Web Warehouse (WW) is a Data Warehouse which consolidates data from the Web. The goal of these systems is to act as an intermediary between data publication and the user, pre-processing data and adding value to them. This pre-processing involves data integration, data aggregation, data re-structuring and data quality measurement and improvement. A Business Process (BP) model helps us to specify the users, activities, precedence relations between activities and restrictions, that have to be carried out in order to obtain the desired output. In this paper we present a two level BP specification approach for constructing a WW which has two distinctive characteristics: it manages data quality and it is configurable. The first level BP model is focused on helping the user to configure the web data sources and the desired data quality characteristics, the second level BP uses the defined configuration to generate the WW. Quality characteristics are also defined for the intermediate data sources used to populate the WW.","PeriodicalId":288073,"journal":{"name":"2014 IEEE Eighth International Conference on Research Challenges in Information Science (RCIS)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Eighth International Conference on Research Challenges in Information Science (RCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCIS.2014.6861041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
A Web Warehouse (WW) is a Data Warehouse which consolidates data from the Web. The goal of these systems is to act as an intermediary between data publication and the user, pre-processing data and adding value to them. This pre-processing involves data integration, data aggregation, data re-structuring and data quality measurement and improvement. A Business Process (BP) model helps us to specify the users, activities, precedence relations between activities and restrictions, that have to be carried out in order to obtain the desired output. In this paper we present a two level BP specification approach for constructing a WW which has two distinctive characteristics: it manages data quality and it is configurable. The first level BP model is focused on helping the user to configure the web data sources and the desired data quality characteristics, the second level BP uses the defined configuration to generate the WW. Quality characteristics are also defined for the intermediate data sources used to populate the WW.