Tao Wu, Jiali Mao, Yifan Zhu, Kaixuan Zhu, Aoying Zhou
{"title":"Multi-view context awareness based transport stay hotspot recognizing","authors":"Tao Wu, Jiali Mao, Yifan Zhu, Kaixuan Zhu, Aoying Zhou","doi":"10.1007/s11280-024-01256-5","DOIUrl":null,"url":null,"abstract":"<p>During long-distance transporting for bulk commodities, the trucks need to stop off at multiple places for resting, refueling, repairing or unloading, called as transport stay hotspots (or <i>Tshot</i> for short). Massive waybills and their related trajectories accumulated by the freight platforms enable us to recognize <i>Tshot</i>s and keep them updated constantly. But due to most of <i>Tshot</i>s have varying sizes and are adjacent to each other, it is hard to pinpoint their locations precisely. In addition, to correctly annotate functional tags of <i>Tshots</i> that have fewer visiting trajectories is quite difficult. In this paper, we propose a <u>M</u>ulti-view <u>C</u>ontext awareness based transport <span>\\(\\underline{S}\\)</span><i>tay hotspot</i> <u>R</u>ecognition framework, called <i>MCSR</i>, consisting of <i>location identification</i>, <i>feature extraction</i> and <i>functional tag annotation</i>. To address the missed-detection issue in pinpointing adjacent <i>Tshots</i> having various sizes, we design a <i>multi-view clustering</i> based stay area merging strategy by incorporating the distance between <i>road turn-off locations</i>, the number of visiting trajectories with the similarity of <i>visiting time distribution</i>. Further, aiming at the issue of low annotating precision resulted by data scarcity, based on extracting <i>behavioral features</i> and <i>attribute features</i> from waybill trajectories, we leverage a <i>time interval awareness self-attention network</i> to extract <i>semantic contextual features</i> to assist in ensemble learning based annotation modeling. Experimental results on a large-scale logistics dataset demonstrate that our proposal can improve <i>F-measure</i> by an average of 14.76%, <i>AIoU</i> by an average of 12.89% for <i>location identification</i>, and <i>G-mean</i> by an average of 18.39% and <i>mAUC</i> by an average of 14.48% for <i>functional tag annotation</i> as compared to the baselines.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Wide Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11280-024-01256-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
During long-distance transporting for bulk commodities, the trucks need to stop off at multiple places for resting, refueling, repairing or unloading, called as transport stay hotspots (or Tshot for short). Massive waybills and their related trajectories accumulated by the freight platforms enable us to recognize Tshots and keep them updated constantly. But due to most of Tshots have varying sizes and are adjacent to each other, it is hard to pinpoint their locations precisely. In addition, to correctly annotate functional tags of Tshots that have fewer visiting trajectories is quite difficult. In this paper, we propose a Multi-view Context awareness based transport \(\underline{S}\)tay hotspotRecognition framework, called MCSR, consisting of location identification, feature extraction and functional tag annotation. To address the missed-detection issue in pinpointing adjacent Tshots having various sizes, we design a multi-view clustering based stay area merging strategy by incorporating the distance between road turn-off locations, the number of visiting trajectories with the similarity of visiting time distribution. Further, aiming at the issue of low annotating precision resulted by data scarcity, based on extracting behavioral features and attribute features from waybill trajectories, we leverage a time interval awareness self-attention network to extract semantic contextual features to assist in ensemble learning based annotation modeling. Experimental results on a large-scale logistics dataset demonstrate that our proposal can improve F-measure by an average of 14.76%, AIoU by an average of 12.89% for location identification, and G-mean by an average of 18.39% and mAUC by an average of 14.48% for functional tag annotation as compared to the baselines.