DG2CEP: a near real-time on-line algorithm for detecting spatial clusters large data streams through complex event processing

IF 2.4 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Marcos Roriz Junior, Bruno Olivieri, Markus Endler
{"title":"DG2CEP: a near real-time on-line algorithm for detecting spatial clusters large data streams through complex event processing","authors":"Marcos Roriz Junior, Bruno Olivieri, Markus Endler","doi":"10.1186/s13174-019-0107-x","DOIUrl":null,"url":null,"abstract":"Spatial concentrations (or spatial clusters) of moving objects, such as vehicles and humans, is a mobility pattern that is relevant to many applications. Fast detection of this pattern and its evolution, e.g., if the cluster is shrinking or growing, is useful in numerous scenarios, such as detecting the formation of traffic jams or detecting a fast dispersion of people in a music concert. On-Line detection of this pattern is a challenging task because it requires algorithms that are capable of continuously and efficiently processing the high volume of position updates in a timely manner. Currently, the majority of approaches for spatial cluster detection operate in batch mode, where moving objects location updates are recorded during time periods of a certain length and then batch-processed by an external routine, thus delaying the result of the cluster detection until the end of the time period. Further, they extensively use spatial data structures and operators, which can be troublesome to maintain or parallelize in on-line scenarios. To address these issues, in this paper we propose DG2CEP, a parallel algorithm that combines the well-known density-based clustering algorithm DBSCAN with the data stream processing paradigm Complex Event Processing (CEP) to achieve continuous and timely detection of spatial clusters. Our experiments with real-world data streams indicate that DG2CEP is able to detect the formation and dispersion of clusters with small latency while having higher similarity to DBSCAN than batch-based approaches.","PeriodicalId":46467,"journal":{"name":"Journal of Internet Services and Applications","volume":"12 1","pages":"1-28"},"PeriodicalIF":2.4000,"publicationDate":"2019-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Internet Services and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s13174-019-0107-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 7

Abstract

Spatial concentrations (or spatial clusters) of moving objects, such as vehicles and humans, is a mobility pattern that is relevant to many applications. Fast detection of this pattern and its evolution, e.g., if the cluster is shrinking or growing, is useful in numerous scenarios, such as detecting the formation of traffic jams or detecting a fast dispersion of people in a music concert. On-Line detection of this pattern is a challenging task because it requires algorithms that are capable of continuously and efficiently processing the high volume of position updates in a timely manner. Currently, the majority of approaches for spatial cluster detection operate in batch mode, where moving objects location updates are recorded during time periods of a certain length and then batch-processed by an external routine, thus delaying the result of the cluster detection until the end of the time period. Further, they extensively use spatial data structures and operators, which can be troublesome to maintain or parallelize in on-line scenarios. To address these issues, in this paper we propose DG2CEP, a parallel algorithm that combines the well-known density-based clustering algorithm DBSCAN with the data stream processing paradigm Complex Event Processing (CEP) to achieve continuous and timely detection of spatial clusters. Our experiments with real-world data streams indicate that DG2CEP is able to detect the formation and dispersion of clusters with small latency while having higher similarity to DBSCAN than batch-based approaches.
DG2CEP:一种通过复杂事件处理检测空间集群大数据流的近实时在线算法
移动对象(如车辆和人)的空间集中(或空间集群)是一种与许多应用程序相关的移动模式。快速检测这种模式及其演变,例如,如果集群正在缩小或增长,在许多场景中都很有用,例如检测交通堵塞的形成或检测音乐会中人群的快速分散。这种模式的在线检测是一项具有挑战性的任务,因为它需要能够持续有效地及时处理大量位置更新的算法。目前,大多数空间聚类检测方法采用批处理模式,即在一定长度的时间段内记录运动物体的位置更新,然后由外部例程进行批处理,从而将聚类检测结果延迟到时间段结束。此外,它们广泛使用空间数据结构和运算符,这在在线场景中维护或并行化可能很麻烦。为了解决这些问题,本文提出了DG2CEP并行算法,该算法将著名的基于密度的聚类算法DBSCAN与数据流处理范式复杂事件处理(Complex Event processing, CEP)相结合,实现对空间集群的连续、及时检测。我们对真实数据流的实验表明,DG2CEP能够以较小的延迟检测集群的形成和分散,同时与基于批处理的方法相比,与DBSCAN具有更高的相似性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Internet Services and Applications
Journal of Internet Services and Applications COMPUTER SCIENCE, INFORMATION SYSTEMS-
CiteScore
3.70
自引率
0.00%
发文量
2
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信