The Promotion Effect of LOESS Smoothing Technique in Short-term Traffic Volume Clustering

Jia-juan Chen, Yu-bang Liu, Zhi-Yuan Wang, Chuan-tao Wang
{"title":"The Promotion Effect of LOESS Smoothing Technique in Short-term Traffic Volume Clustering","authors":"Jia-juan Chen, Yu-bang Liu, Zhi-Yuan Wang, Chuan-tao Wang","doi":"10.2991/MASTA-19.2019.65","DOIUrl":null,"url":null,"abstract":"In short-term traffic volume clustering, one important issue is the representation of traffic profiles. This article focuses on how the LOESS smoothing technique enhances the clustering effect and what the best value of parameter span of LOESS is. This article used K-Means clustering algorithm and compared the clustering effect using raw data and smoothed data. The experiment result shows when the traffic profiles are slight smoothed, the clustering effect enhances from 39.15% to 66.48%. And the best range of parameter span is 0.2~0.4 to keep the balance of clustering effect and profiles details. This article verifies the promotion effect of LOESS smoothing technique in shortterm traffic volume clustering and gives advice on the best value of LOESS parameter. Introduction With the rapid development of information techniques such as Internet of things, big data, cloud computing, Intelligent Transport System (ITS) is making great progress not only in laboratories but also in real traffic management systems in China. Short term traffic analysis is the essential component of ITS to identify specific traffic patterns and make forecasting of short-term traffic flow. Short term traffic clustering, which can identify similar traffic flow of different sections of road on different day, is enlightening to find similar traffic patterns and support traffic management. Some researchers used clustering to support data driven forecasting methods such as K Nearest Neighbor regression and used locally estimated scatterplot smoothing (LOESS) to smooth the noise in traffic flow profiles. However, the promotion effect of LOESS to identify similar traffic profiles was rarely been focused on. This research aims to fill up this gap to find whether LOESS can enhance the clustering effect of short term traffic profiles and what the best parameter of LOESS is to keep the balance of maintaining the profiles details and clustering effect. Short term traffic profile is one type of time series. Aghabozorgi (2015) [1] has reviewed the four main elements of time series clustering, including representation method, similarity degree, prototype definition, clustering algorithm. This article uses Euclidean distance as similarity measurement, average value as prototype definition and K-Means as clustering algorithm. Raw profiles and LOESS smoothing profiles as two kinds of representation methods will be conducted in clustering experiments and the clustering effects will be compared. Methodology K-Means Clustering Methods K-Means is one of the most popular clustering algorithms based on partition. K-Means produces k clusters from n unlabeled objects to make sure that there is at least one object in each cluster [1]. The clustering prototype of K-Means is the average of the objects. The principle of K-Means is to minimize the sum of the distances (generally expressed in Euclidean distance) between all objects in the cluster and their cluster center (i.e. prototype). So this article chooses the result of between-cluster sum of squares by total squared distance as an index of clustering index. For the data in Euclidean International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019) Copyright © 2019, the Authors. Published by Atlantis Press. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/). Advances in Intelligent Systems Research, volume 168","PeriodicalId":103896,"journal":{"name":"Proceedings of the 2019 International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2991/MASTA-19.2019.65","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In short-term traffic volume clustering, one important issue is the representation of traffic profiles. This article focuses on how the LOESS smoothing technique enhances the clustering effect and what the best value of parameter span of LOESS is. This article used K-Means clustering algorithm and compared the clustering effect using raw data and smoothed data. The experiment result shows when the traffic profiles are slight smoothed, the clustering effect enhances from 39.15% to 66.48%. And the best range of parameter span is 0.2~0.4 to keep the balance of clustering effect and profiles details. This article verifies the promotion effect of LOESS smoothing technique in shortterm traffic volume clustering and gives advice on the best value of LOESS parameter. Introduction With the rapid development of information techniques such as Internet of things, big data, cloud computing, Intelligent Transport System (ITS) is making great progress not only in laboratories but also in real traffic management systems in China. Short term traffic analysis is the essential component of ITS to identify specific traffic patterns and make forecasting of short-term traffic flow. Short term traffic clustering, which can identify similar traffic flow of different sections of road on different day, is enlightening to find similar traffic patterns and support traffic management. Some researchers used clustering to support data driven forecasting methods such as K Nearest Neighbor regression and used locally estimated scatterplot smoothing (LOESS) to smooth the noise in traffic flow profiles. However, the promotion effect of LOESS to identify similar traffic profiles was rarely been focused on. This research aims to fill up this gap to find whether LOESS can enhance the clustering effect of short term traffic profiles and what the best parameter of LOESS is to keep the balance of maintaining the profiles details and clustering effect. Short term traffic profile is one type of time series. Aghabozorgi (2015) [1] has reviewed the four main elements of time series clustering, including representation method, similarity degree, prototype definition, clustering algorithm. This article uses Euclidean distance as similarity measurement, average value as prototype definition and K-Means as clustering algorithm. Raw profiles and LOESS smoothing profiles as two kinds of representation methods will be conducted in clustering experiments and the clustering effects will be compared. Methodology K-Means Clustering Methods K-Means is one of the most popular clustering algorithms based on partition. K-Means produces k clusters from n unlabeled objects to make sure that there is at least one object in each cluster [1]. The clustering prototype of K-Means is the average of the objects. The principle of K-Means is to minimize the sum of the distances (generally expressed in Euclidean distance) between all objects in the cluster and their cluster center (i.e. prototype). So this article chooses the result of between-cluster sum of squares by total squared distance as an index of clustering index. For the data in Euclidean International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019) Copyright © 2019, the Authors. Published by Atlantis Press. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/). Advances in Intelligent Systems Research, volume 168
黄土平滑技术对短期交通量聚类的促进作用
在短期交通量聚类中,一个重要的问题是交通量剖面的表示。本文重点讨论了黄土平滑技术如何提高聚类效果以及黄土参数跨度的最佳取值。本文采用K-Means聚类算法,比较了原始数据和平滑数据的聚类效果。实验结果表明,当交通轮廓稍微平滑时,聚类效果从39.15%提高到66.48%。为了保持聚类效果和轮廓细节的平衡,参数跨度的最佳范围为0.2~0.4。验证了黄土平滑技术对短期交通量聚类的促进作用,并给出了黄土参数的最佳取值建议。随着物联网、大数据、云计算等信息技术的快速发展,智能交通系统(ITS)在中国无论是在实验室还是在实际的交通管理系统中都取得了长足的进步。短期交通分析是智能交通系统识别特定交通模式和预测短期交通流量的重要组成部分。短期交通聚类可以识别不同路段在不同时段的相似交通流,对寻找相似交通模式,支持交通管理具有一定的启示意义。一些研究人员使用聚类支持数据驱动的预测方法,如K最近邻回归,并使用局部估计散点图平滑(黄土)来平滑交通流剖面中的噪声。然而,黄土对识别相似交通剖面的促进作用却很少得到关注。本研究旨在填补这一空白,探讨黄土是否能够增强短期交通剖面的聚类效果,以及黄土在保持剖面细节与聚类效果之间的最佳参数是什么。短期流量剖面图是时间序列的一种。Aghabozorgi(2015)[1]回顾了时间序列聚类的四个主要要素,包括表示方法、相似度、原型定义、聚类算法。本文采用欧氏距离作为相似性度量,均值作为原型定义,K-Means作为聚类算法。将原始剖面和黄土平滑剖面作为两种表示方法进行聚类实验,比较聚类效果。K-Means聚类方法K-Means是目前最流行的基于分区的聚类算法之一。k - means从n个未标记的对象生成k个聚类,以确保每个聚类中至少有一个对象[1]。K-Means的聚类原型是对象的平均值。K-Means的原理是最小化聚类中所有对象与其聚类中心(即原型)之间的距离之和(一般用欧几里得距离表示)。因此本文选择聚类间平方和与总平方距离的结果作为聚类指标。欧几里得建模、分析、仿真技术与应用国际会议(MASTA 2019)中的数据版权所有©2019,作者。亚特兰蒂斯出版社出版。这是一篇基于CC BY-NC许可(http://creativecommons.org/licenses/by-nc/4.0/)的开放获取文章。智能系统研究进展,第168卷
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信