Algorithms for Trajectory Points Clustering in Location-based Social Networks

Nan Han, Shaojie Qiao, Kun Yue, Jianbin Huang, Qiang He, Tingting Tang, Faliang Huang, Chunlin He, Chang-an Yuan
{"title":"Algorithms for Trajectory Points Clustering in Location-based Social Networks","authors":"Nan Han, Shaojie Qiao, Kun Yue, Jianbin Huang, Qiang He, Tingting Tang, Faliang Huang, Chunlin He, Chang-an Yuan","doi":"10.1145/3480972","DOIUrl":null,"url":null,"abstract":"Recent advances in localization techniques have fundamentally enhanced social networking services, allowing users to share their locations and location-related contents. This has further increased the popularity of location-based social networks (LBSNs) and produces a huge amount of trajectories composed of continuous and complex spatio-temporal points from people’s daily lives. How to accurately aggregate large-scale trajectories is an important and challenging task. Conventional clustering algorithms (e.g., k-means or k-mediods) cannot be directly employed to process trajectory data due to their serialization, triviality and redundancy. Aiming to overcome the drawbacks of traditional k-means algorithm and k-mediods, including their sensitivity to the selection of the initial k value, the cluster centers and easy convergence to a locally optimal solution, we first propose an optimized k-means algorithm (namely OKM) to obtain k optimal initial clustering centers based on the density of trajectory points. Second, because k-means is sensitive to noisy points, we propose an improved k-mediods algorithm called IKMD based on an acceptable radius r by considering users’ geographic location in LBSNs. The value of k can be calculated based on r, and the optimal k points are selected as the initial clustering centers with high densities to reduce the cost of distance calculation. Thirdly, we thoroughly analyze the advantages of IKMD by comparing it with the commonly used clustering approaches through illustrative examples. Last, we conduct extensive experiments to evaluate the performance of IKMD against seven clustering approaches including the proposed optimized k-means algorithm, k-mediods algorithm, traditional density-based k-mediods algorithm and the state-of-the-arts trajectory clustering methods. The results demonstrate that IKMD significantly outperforms existing algorithms in the cost of distance calculation and the convergence speed. The methods proposed is proved to contribute to a larger effort targeted at advancing the study of intelligent trajectory data analytics.","PeriodicalId":123526,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology (TIST)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Intelligent Systems and Technology (TIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3480972","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Recent advances in localization techniques have fundamentally enhanced social networking services, allowing users to share their locations and location-related contents. This has further increased the popularity of location-based social networks (LBSNs) and produces a huge amount of trajectories composed of continuous and complex spatio-temporal points from people’s daily lives. How to accurately aggregate large-scale trajectories is an important and challenging task. Conventional clustering algorithms (e.g., k-means or k-mediods) cannot be directly employed to process trajectory data due to their serialization, triviality and redundancy. Aiming to overcome the drawbacks of traditional k-means algorithm and k-mediods, including their sensitivity to the selection of the initial k value, the cluster centers and easy convergence to a locally optimal solution, we first propose an optimized k-means algorithm (namely OKM) to obtain k optimal initial clustering centers based on the density of trajectory points. Second, because k-means is sensitive to noisy points, we propose an improved k-mediods algorithm called IKMD based on an acceptable radius r by considering users’ geographic location in LBSNs. The value of k can be calculated based on r, and the optimal k points are selected as the initial clustering centers with high densities to reduce the cost of distance calculation. Thirdly, we thoroughly analyze the advantages of IKMD by comparing it with the commonly used clustering approaches through illustrative examples. Last, we conduct extensive experiments to evaluate the performance of IKMD against seven clustering approaches including the proposed optimized k-means algorithm, k-mediods algorithm, traditional density-based k-mediods algorithm and the state-of-the-arts trajectory clustering methods. The results demonstrate that IKMD significantly outperforms existing algorithms in the cost of distance calculation and the convergence speed. The methods proposed is proved to contribute to a larger effort targeted at advancing the study of intelligent trajectory data analytics.
基于位置的社交网络中轨迹点聚类算法
最近本地化技术的进步从根本上增强了社交网络服务,允许用户分享他们的位置和与位置相关的内容。这进一步增加了基于位置的社交网络(LBSNs)的普及,并产生了大量由人们日常生活中连续而复杂的时空点组成的轨迹。如何准确地聚合大尺度轨迹是一项重要而具有挑战性的任务。传统的聚类算法(如k-means或k-mediods)由于其序列化、琐碎性和冗余性而不能直接用于处理轨迹数据。针对传统k-means算法和k- medium算法对初始k值的选取敏感、聚类中心容易收敛到局部最优解等缺点,提出了一种基于轨迹点密度获得k个最优初始聚类中心的优化k-means算法(即OKM)。其次,由于k-means对噪声点敏感,我们提出了一种基于可接受半径r的改进k- medium算法IKMD,并考虑了LBSNs中用户的地理位置。可以根据r计算k的值,并选择最优的k个点作为高密度的初始聚类中心,以减少距离计算的代价。第三,通过实例比较了IKMD与常用聚类方法的优势。最后,我们进行了大量的实验来评估IKMD与7种聚类方法的性能,包括所提出的优化k-means算法、k-mediods算法、传统的基于密度的k-mediods算法和最先进的轨迹聚类方法。结果表明,IKMD在距离计算代价和收敛速度上明显优于现有算法。所提出的方法被证明有助于更大的努力,旨在推进智能轨迹数据分析的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信