{"title":"TrailMarker:地理复杂序列的自动挖掘","authors":"Takato Honda","doi":"10.1145/2926693.2929903","DOIUrl":null,"url":null,"abstract":"Given a huge collection of vehicle sensor data consisting of d sensors for w trajectories of duration n, which are accompanied by geographical information, how can we find patterns, rules and outliers? How can we efficiently and effectively find typical patterns and points of variation? In this paper we present TRAILMARKER, a fully automatic mining algorithm for geographical complex sequences. Our method has the following properties: (a) effective: it finds important patterns and outliers in real datasets; (b) scalable: it is linear with respect to the data size; (c) parameter-free: it is fully automatic, and requires no prior training, and no parameter tuning. Extensive experiments on real data demonstrate that TRAILMARKER finds interesting and unexpected patterns and groups accurately. In fact, TRAILMARKER consistently outperforms the best state-of-the-art methods in terms of both accuracy and execution speed.","PeriodicalId":123723,"journal":{"name":"Proceedings of the 2016 on SIGMOD'16 PhD Symposium","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"TrailMarker: Automatic Mining of Geographical Complex Sequences\",\"authors\":\"Takato Honda\",\"doi\":\"10.1145/2926693.2929903\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Given a huge collection of vehicle sensor data consisting of d sensors for w trajectories of duration n, which are accompanied by geographical information, how can we find patterns, rules and outliers? How can we efficiently and effectively find typical patterns and points of variation? In this paper we present TRAILMARKER, a fully automatic mining algorithm for geographical complex sequences. Our method has the following properties: (a) effective: it finds important patterns and outliers in real datasets; (b) scalable: it is linear with respect to the data size; (c) parameter-free: it is fully automatic, and requires no prior training, and no parameter tuning. Extensive experiments on real data demonstrate that TRAILMARKER finds interesting and unexpected patterns and groups accurately. In fact, TRAILMARKER consistently outperforms the best state-of-the-art methods in terms of both accuracy and execution speed.\",\"PeriodicalId\":123723,\"journal\":{\"name\":\"Proceedings of the 2016 on SIGMOD'16 PhD Symposium\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2016 on SIGMOD'16 PhD Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2926693.2929903\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2016 on SIGMOD'16 PhD Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2926693.2929903","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
TrailMarker: Automatic Mining of Geographical Complex Sequences
Given a huge collection of vehicle sensor data consisting of d sensors for w trajectories of duration n, which are accompanied by geographical information, how can we find patterns, rules and outliers? How can we efficiently and effectively find typical patterns and points of variation? In this paper we present TRAILMARKER, a fully automatic mining algorithm for geographical complex sequences. Our method has the following properties: (a) effective: it finds important patterns and outliers in real datasets; (b) scalable: it is linear with respect to the data size; (c) parameter-free: it is fully automatic, and requires no prior training, and no parameter tuning. Extensive experiments on real data demonstrate that TRAILMARKER finds interesting and unexpected patterns and groups accurately. In fact, TRAILMARKER consistently outperforms the best state-of-the-art methods in terms of both accuracy and execution speed.