H. Alili, Khalid Belhajjame, Rim Drira, Daniela Grigori, H. Ghézala
{"title":"Quality Based Data Integration for Enriching User Data Sources in Service Lakes","authors":"H. Alili, Khalid Belhajjame, Rim Drira, Daniela Grigori, H. Ghézala","doi":"10.1109/ICWS.2018.00028","DOIUrl":null,"url":null,"abstract":"Data lakes have recently emerged as an alternative solution to costly traditional data warehouse solutions. To exploit data lakes, however, there is a need for means that assist users in combining and integrating data stored within a data lake. In this paper, we position ourselves in the recurrent context where a user has a local dataset that is not sufficient for processing the queries that are of interest to him/her. We show how data lakes, or more specifically the service lakes, since we are focusing on data providing services, can be leveraged to answer user queries, taking into account the quality of the services and respecting the (time and monetary) budget set by the user.","PeriodicalId":231056,"journal":{"name":"2018 IEEE International Conference on Web Services (ICWS)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Web Services (ICWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWS.2018.00028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Data lakes have recently emerged as an alternative solution to costly traditional data warehouse solutions. To exploit data lakes, however, there is a need for means that assist users in combining and integrating data stored within a data lake. In this paper, we position ourselves in the recurrent context where a user has a local dataset that is not sufficient for processing the queries that are of interest to him/her. We show how data lakes, or more specifically the service lakes, since we are focusing on data providing services, can be leveraged to answer user queries, taking into account the quality of the services and respecting the (time and monetary) budget set by the user.