{"title":"基于深度外观学习的GM-PHD滤波器在线多目标视觉跟踪","authors":"Nathanael L. Baisa","doi":"10.23919/fusion43075.2019.9011441","DOIUrl":null,"url":null,"abstract":"We propose a new online multi-object visual tracker based on a Gaussian mixture Probability Hypothesis Density (GM-PHD) filter in combination with a similarity Convolutional Neural Network (CNN). The GM-PHD filter estimates the states and cardinality of an unknown and time varying number of targets in the scene handling target birth, death, clutter (false alarms) and missing detections in a unified framework, and has a linear complexity with the number of targets. However, it lacks the identity of targets. We combine spatio-temporal and visual similarities obtained from object bounding boxes and deep CNN appearance features, respectively, to alleviate its shortcoming of labelling targets across frames. We apply this developed method for tracking multiple targets in video sequences acquired under varying environmental conditions and targets density using a tracking-by-detection approach. Finally, we carry out extensive experiments on Multiple Object Tracking 2016 (MOTI6) and 2017 (MOTI7) benchmark datasets and find out that our tracker significantly outperforms several state-of-the-art trackers in terms of tracking accuracy and precision.","PeriodicalId":348881,"journal":{"name":"2019 22th International Conference on Information Fusion (FUSION)","volume":"303 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Online Multi-object Visual Tracking using a GM-PHD Filter with Deep Appearance Learning\",\"authors\":\"Nathanael L. Baisa\",\"doi\":\"10.23919/fusion43075.2019.9011441\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a new online multi-object visual tracker based on a Gaussian mixture Probability Hypothesis Density (GM-PHD) filter in combination with a similarity Convolutional Neural Network (CNN). The GM-PHD filter estimates the states and cardinality of an unknown and time varying number of targets in the scene handling target birth, death, clutter (false alarms) and missing detections in a unified framework, and has a linear complexity with the number of targets. However, it lacks the identity of targets. We combine spatio-temporal and visual similarities obtained from object bounding boxes and deep CNN appearance features, respectively, to alleviate its shortcoming of labelling targets across frames. We apply this developed method for tracking multiple targets in video sequences acquired under varying environmental conditions and targets density using a tracking-by-detection approach. Finally, we carry out extensive experiments on Multiple Object Tracking 2016 (MOTI6) and 2017 (MOTI7) benchmark datasets and find out that our tracker significantly outperforms several state-of-the-art trackers in terms of tracking accuracy and precision.\",\"PeriodicalId\":348881,\"journal\":{\"name\":\"2019 22th International Conference on Information Fusion (FUSION)\",\"volume\":\"303 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 22th International Conference on Information Fusion (FUSION)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/fusion43075.2019.9011441\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 22th International Conference on Information Fusion (FUSION)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/fusion43075.2019.9011441","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Online Multi-object Visual Tracking using a GM-PHD Filter with Deep Appearance Learning
We propose a new online multi-object visual tracker based on a Gaussian mixture Probability Hypothesis Density (GM-PHD) filter in combination with a similarity Convolutional Neural Network (CNN). The GM-PHD filter estimates the states and cardinality of an unknown and time varying number of targets in the scene handling target birth, death, clutter (false alarms) and missing detections in a unified framework, and has a linear complexity with the number of targets. However, it lacks the identity of targets. We combine spatio-temporal and visual similarities obtained from object bounding boxes and deep CNN appearance features, respectively, to alleviate its shortcoming of labelling targets across frames. We apply this developed method for tracking multiple targets in video sequences acquired under varying environmental conditions and targets density using a tracking-by-detection approach. Finally, we carry out extensive experiments on Multiple Object Tracking 2016 (MOTI6) and 2017 (MOTI7) benchmark datasets and find out that our tracker significantly outperforms several state-of-the-art trackers in terms of tracking accuracy and precision.