{"title":"Using machine learning for unsupervised maritime waypoint discovery from streaming AIS data","authors":"Andrej Dobrkovic, M. Iacob, J. Hillegersberg","doi":"10.1145/2809563.2809573","DOIUrl":null,"url":null,"abstract":"Estimating the future position of a deep sea vessel more than 24 hours in advance is a major challenge for Dutch logistics service providers (LSPs). Their unscheduled arrival in ports directly impacts scheduling and waiting times of barges, propagating throughout the entire supply chain network. To help LSPs' planners improve planning operations, we intend to capture the characteristics of maritime routes for a specific region (the North Sea connecting the Netherlands and United Kingdom) in the form of a directed graph, which can be used as a foundation for predicting destination and arrival time of each associated vessel. To create such graph we need an efficient way to extract waypoints for traffic data and this is the problem we will address in this paper. Since LSPs only use publicly available data for arrival estimation, our solution is entirely based on Automatic Identification System (AIS) data. Extracting positional information from AIS, we explore various machine learning approaches to identify clusters. We apply DBSCAN algorithm and show its advantages and disadvantages when used on AIS data. The same process is repeated using meta-heuristics, comparing clustering results generated by a genetic algorithm and by modified ant-colony optimization to those produced by DBSCAN. Finally, we present a hybrid approach and its ability to discover waypoints, highlighting the achieved improvements. To extend the problem, two constraints are added. The first is the requirement to handle large volumes of streaming AIS data on standard PC-based hardware. The second introduces the common situation of \"dark areas\" in a map due to problems with receiving and transmitting AIS data. The algorithm discovers route waypoints in efficient and effective ways under these constraints.","PeriodicalId":20526,"journal":{"name":"Proceedings of the 15th International Conference on Knowledge Technologies and Data-driven Business","volume":"71 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2015-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 15th International Conference on Knowledge Technologies and Data-driven Business","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2809563.2809573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 33
Abstract
Estimating the future position of a deep sea vessel more than 24 hours in advance is a major challenge for Dutch logistics service providers (LSPs). Their unscheduled arrival in ports directly impacts scheduling and waiting times of barges, propagating throughout the entire supply chain network. To help LSPs' planners improve planning operations, we intend to capture the characteristics of maritime routes for a specific region (the North Sea connecting the Netherlands and United Kingdom) in the form of a directed graph, which can be used as a foundation for predicting destination and arrival time of each associated vessel. To create such graph we need an efficient way to extract waypoints for traffic data and this is the problem we will address in this paper. Since LSPs only use publicly available data for arrival estimation, our solution is entirely based on Automatic Identification System (AIS) data. Extracting positional information from AIS, we explore various machine learning approaches to identify clusters. We apply DBSCAN algorithm and show its advantages and disadvantages when used on AIS data. The same process is repeated using meta-heuristics, comparing clustering results generated by a genetic algorithm and by modified ant-colony optimization to those produced by DBSCAN. Finally, we present a hybrid approach and its ability to discover waypoints, highlighting the achieved improvements. To extend the problem, two constraints are added. The first is the requirement to handle large volumes of streaming AIS data on standard PC-based hardware. The second introduces the common situation of "dark areas" in a map due to problems with receiving and transmitting AIS data. The algorithm discovers route waypoints in efficient and effective ways under these constraints.