流数据实时处理的大数据:现状和未来挑战

S. Ashraf, Y. Afify, R. Ismail
{"title":"流数据实时处理的大数据:现状和未来挑战","authors":"S. Ashraf, Y. Afify, R. Ismail","doi":"10.1109/ICECCME55909.2022.9987770","DOIUrl":null,"url":null,"abstract":"As many commercial businesses aspire for a competitive edge, real-time analysis on streaming data employing a big data methodology has lately become widespread. The ability to efficiently arrange massive amounts of data to make a business decision empowers data warehousing. Dealing with this sort of data poses considerable obstacles; as a result, several ways of assessing data in the form of streams have been created. Many solutions for dealing with enormous volumes of data and making decisions based on off-line batch processing have been investigated. In this paper, we explore the most recent developments in big data approaches for real-time analysis on streaming data to answer four research questions related to streaming data source, stream pre-processing, types of streaming data processing, used machine learning model, in addition to the validation and evaluation criteria. The system architecture associated with the used big data tools, the architecture hardware specs as well as other current platforms appropriate for large data streaming analytics are also considered. Furthermore, we outline numerous difficulties encountered in big data stream processing. The purpose of this review is to fill a gap in the surveyed area by offering thorough evaluation of various big data frameworks, architectures, and categories for cutting-edge approaches, as well as critical analyses of their performance and discussions of their applications, trends, and future directions to serve as guides for readers in this burgeoning study area.","PeriodicalId":202568,"journal":{"name":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","volume":"2000 23","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Big Data for Real-Time Processing on Streaming Data: State-of-the-art and Future Challenges\",\"authors\":\"S. Ashraf, Y. Afify, R. Ismail\",\"doi\":\"10.1109/ICECCME55909.2022.9987770\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As many commercial businesses aspire for a competitive edge, real-time analysis on streaming data employing a big data methodology has lately become widespread. The ability to efficiently arrange massive amounts of data to make a business decision empowers data warehousing. Dealing with this sort of data poses considerable obstacles; as a result, several ways of assessing data in the form of streams have been created. Many solutions for dealing with enormous volumes of data and making decisions based on off-line batch processing have been investigated. In this paper, we explore the most recent developments in big data approaches for real-time analysis on streaming data to answer four research questions related to streaming data source, stream pre-processing, types of streaming data processing, used machine learning model, in addition to the validation and evaluation criteria. The system architecture associated with the used big data tools, the architecture hardware specs as well as other current platforms appropriate for large data streaming analytics are also considered. Furthermore, we outline numerous difficulties encountered in big data stream processing. The purpose of this review is to fill a gap in the surveyed area by offering thorough evaluation of various big data frameworks, architectures, and categories for cutting-edge approaches, as well as critical analyses of their performance and discussions of their applications, trends, and future directions to serve as guides for readers in this burgeoning study area.\",\"PeriodicalId\":202568,\"journal\":{\"name\":\"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)\",\"volume\":\"2000 23\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECCME55909.2022.9987770\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCME55909.2022.9987770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

随着许多商业企业渴望获得竞争优势,采用大数据方法对流数据进行实时分析最近变得普遍。有效地安排大量数据以做出业务决策的能力为数据仓库提供了支持。处理这类数据存在相当大的障碍;因此,已经创建了几种以流形式评估数据的方法。许多基于离线批处理的海量数据处理和决策解决方案已经被研究过。在本文中,我们探讨了流数据实时分析的大数据方法的最新发展,以回答与流数据源、流预处理、流数据处理类型、使用的机器学习模型以及验证和评估标准相关的四个研究问题。与所使用的大数据工具相关的系统架构、架构硬件规格以及其他适合大数据流分析的当前平台也被考虑在内。此外,我们概述了在大数据流处理中遇到的许多困难。本综述的目的是通过对各种大数据框架、架构和前沿方法的类别进行全面评估,以及对其性能的批判性分析和对其应用、趋势和未来方向的讨论,以填补调查领域的空白,为这个新兴研究领域的读者提供指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Big Data for Real-Time Processing on Streaming Data: State-of-the-art and Future Challenges
As many commercial businesses aspire for a competitive edge, real-time analysis on streaming data employing a big data methodology has lately become widespread. The ability to efficiently arrange massive amounts of data to make a business decision empowers data warehousing. Dealing with this sort of data poses considerable obstacles; as a result, several ways of assessing data in the form of streams have been created. Many solutions for dealing with enormous volumes of data and making decisions based on off-line batch processing have been investigated. In this paper, we explore the most recent developments in big data approaches for real-time analysis on streaming data to answer four research questions related to streaming data source, stream pre-processing, types of streaming data processing, used machine learning model, in addition to the validation and evaluation criteria. The system architecture associated with the used big data tools, the architecture hardware specs as well as other current platforms appropriate for large data streaming analytics are also considered. Furthermore, we outline numerous difficulties encountered in big data stream processing. The purpose of this review is to fill a gap in the surveyed area by offering thorough evaluation of various big data frameworks, architectures, and categories for cutting-edge approaches, as well as critical analyses of their performance and discussions of their applications, trends, and future directions to serve as guides for readers in this burgeoning study area.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信