{"title":"Deep Siamese Network for Multiple Object Tracking","authors":"Bonan Cuan, Khalid Idrissi, Christophe Garcia","doi":"10.1109/MMSP.2018.8547137","DOIUrl":null,"url":null,"abstract":"Multiple object tracking is an important but challenging computer vision task. Thanks to the significant progress in object detection field, tracking-by-detection becomes a trending paradigm for tracking multiple objects at the same time. Appearance models are also widely used for associating detection results. In this paper, we combine cosine similarity metric learning with very deep convolutional neural network, yielding a robust appearance pairwise matching model: a deep Siamese network capable of re-identifying the same object after a long time and dealing with partial and complete occlusion. Embedded in existing tracking algorithms, our model is a lightweight but powerful module for decision-making among track hypotheses. Experiments on MOT Challenge 2016 benchmark [1] demonstrate the effectiveness of our model, which achieves state-of-the-art performance without delving into extensive hyper-parameter tuning.","PeriodicalId":137522,"journal":{"name":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2018.8547137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Multiple object tracking is an important but challenging computer vision task. Thanks to the significant progress in object detection field, tracking-by-detection becomes a trending paradigm for tracking multiple objects at the same time. Appearance models are also widely used for associating detection results. In this paper, we combine cosine similarity metric learning with very deep convolutional neural network, yielding a robust appearance pairwise matching model: a deep Siamese network capable of re-identifying the same object after a long time and dealing with partial and complete occlusion. Embedded in existing tracking algorithms, our model is a lightweight but powerful module for decision-making among track hypotheses. Experiments on MOT Challenge 2016 benchmark [1] demonstrate the effectiveness of our model, which achieves state-of-the-art performance without delving into extensive hyper-parameter tuning.