Firas Zouari, C. Ghedira, N. Kabachi, Khouloud Boukadi
{"title":"迈向基于机器学习的自适应策展服务组合","authors":"Firas Zouari, C. Ghedira, N. Kabachi, Khouloud Boukadi","doi":"10.1109/ICWS53863.2021.00022","DOIUrl":null,"url":null,"abstract":"Data curation deals with managing the data by applying different tasks such as extraction, enrichment, cleaning to fit the purpose of use. Indeed, nowadays, there is an increasing need to implement such tasks in the big data era to maintain data management. Big data is involved in decision processes to perform analysis, visualization, prediction, etc. Thus, there is a dependency between the generated outcomes and the input data of such a process. Therefore, decision process features (e.g., decision context, user constraints, and requirements) need to be taken into account during the data management process, including the data curation phase. Although the proposed curation approaches in the literature are diverse, most of them are static and do not consider the decision process features. Moreover, most of the proposals are dedicated to curating a specific data source format (e.g., structured/unstructured data source). To overcome these limitations, we propose a new approach ACUSEC (Adaptive CUration SErvice Composition) that ensures adaptive curation services composition by considering different features: the source type, the user constraints and preferences, and the decision context. To do so, we rely on AI and machine learning mechanisms such as reinforcement learning. Following the approach's definition, we conducted experiments that show encouraging results in overall execution time and adaptation to the above features.","PeriodicalId":213320,"journal":{"name":"2021 IEEE International Conference on Web Services (ICWS)","volume":"77 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Towards an adaptive curation services composition based on machine learning\",\"authors\":\"Firas Zouari, C. Ghedira, N. Kabachi, Khouloud Boukadi\",\"doi\":\"10.1109/ICWS53863.2021.00022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data curation deals with managing the data by applying different tasks such as extraction, enrichment, cleaning to fit the purpose of use. Indeed, nowadays, there is an increasing need to implement such tasks in the big data era to maintain data management. Big data is involved in decision processes to perform analysis, visualization, prediction, etc. Thus, there is a dependency between the generated outcomes and the input data of such a process. Therefore, decision process features (e.g., decision context, user constraints, and requirements) need to be taken into account during the data management process, including the data curation phase. Although the proposed curation approaches in the literature are diverse, most of them are static and do not consider the decision process features. Moreover, most of the proposals are dedicated to curating a specific data source format (e.g., structured/unstructured data source). To overcome these limitations, we propose a new approach ACUSEC (Adaptive CUration SErvice Composition) that ensures adaptive curation services composition by considering different features: the source type, the user constraints and preferences, and the decision context. To do so, we rely on AI and machine learning mechanisms such as reinforcement learning. Following the approach's definition, we conducted experiments that show encouraging results in overall execution time and adaptation to the above features.\",\"PeriodicalId\":213320,\"journal\":{\"name\":\"2021 IEEE International Conference on Web Services (ICWS)\",\"volume\":\"77 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Web Services (ICWS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICWS53863.2021.00022\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Web Services (ICWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWS53863.2021.00022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
数据管理通过应用不同的任务来管理数据,如提取、充实、清理,以适应使用目的。事实上,如今在大数据时代,越来越需要实施这样的任务来维护数据管理。大数据参与决策过程,执行分析、可视化、预测等。因此,在生成的结果和这种流程的输入数据之间存在依赖关系。因此,需要在数据管理过程(包括数据管理阶段)中考虑决策过程特征(例如,决策上下文、用户约束和需求)。虽然文献中提出的策展方法多种多样,但大多数都是静态的,没有考虑决策过程的特征。此外,大多数建议都致力于管理特定的数据源格式(例如,结构化/非结构化数据源)。为了克服这些限制,我们提出了一种新的方法ACUSEC (Adaptive CUration SErvice Composition),该方法通过考虑不同的特征(源类型、用户约束和偏好以及决策上下文)来确保自适应的策展服务组合。为了做到这一点,我们依赖于人工智能和机器学习机制,比如强化学习。根据该方法的定义,我们进行了实验,在总体执行时间和对上述特性的适应方面显示了令人鼓舞的结果。
Towards an adaptive curation services composition based on machine learning
Data curation deals with managing the data by applying different tasks such as extraction, enrichment, cleaning to fit the purpose of use. Indeed, nowadays, there is an increasing need to implement such tasks in the big data era to maintain data management. Big data is involved in decision processes to perform analysis, visualization, prediction, etc. Thus, there is a dependency between the generated outcomes and the input data of such a process. Therefore, decision process features (e.g., decision context, user constraints, and requirements) need to be taken into account during the data management process, including the data curation phase. Although the proposed curation approaches in the literature are diverse, most of them are static and do not consider the decision process features. Moreover, most of the proposals are dedicated to curating a specific data source format (e.g., structured/unstructured data source). To overcome these limitations, we propose a new approach ACUSEC (Adaptive CUration SErvice Composition) that ensures adaptive curation services composition by considering different features: the source type, the user constraints and preferences, and the decision context. To do so, we rely on AI and machine learning mechanisms such as reinforcement learning. Following the approach's definition, we conducted experiments that show encouraging results in overall execution time and adaptation to the above features.