{"title":"多目标跟踪的两步模型","authors":"Shuai Zhang, Xiaobo Lu, Songlin Du","doi":"10.1145/3487075.3487083","DOIUrl":null,"url":null,"abstract":"Multi-object tracking is widely used in video analysis. However, due to the limitation of detector performance, many multi-object tracking models have the problem of detecting two objects into one object in some occlusion scenes. In this paper, we propose a two-step model for handling this problem. In the first step model, the non-occlusion targets are detected and embeddings are extracted, while the occlusion areas are identified. The second step model processes the occlusion areas to obtain occlusion targets' accurate positions and embeddings. Finally, we integrate and optimize the output results of the two steps models. Experiments show that the number of false positives and missed positives in our model's object detection is significantly reduced. The multi-object tracking performance (MOTA metric) is improved by nearly 3% compared with other models.","PeriodicalId":354966,"journal":{"name":"Proceedings of the 5th International Conference on Computer Science and Application Engineering","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Two-step Model for Multi-object Tracking\",\"authors\":\"Shuai Zhang, Xiaobo Lu, Songlin Du\",\"doi\":\"10.1145/3487075.3487083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi-object tracking is widely used in video analysis. However, due to the limitation of detector performance, many multi-object tracking models have the problem of detecting two objects into one object in some occlusion scenes. In this paper, we propose a two-step model for handling this problem. In the first step model, the non-occlusion targets are detected and embeddings are extracted, while the occlusion areas are identified. The second step model processes the occlusion areas to obtain occlusion targets' accurate positions and embeddings. Finally, we integrate and optimize the output results of the two steps models. Experiments show that the number of false positives and missed positives in our model's object detection is significantly reduced. The multi-object tracking performance (MOTA metric) is improved by nearly 3% compared with other models.\",\"PeriodicalId\":354966,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Computer Science and Application Engineering\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Computer Science and Application Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3487075.3487083\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Computer Science and Application Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3487075.3487083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-object tracking is widely used in video analysis. However, due to the limitation of detector performance, many multi-object tracking models have the problem of detecting two objects into one object in some occlusion scenes. In this paper, we propose a two-step model for handling this problem. In the first step model, the non-occlusion targets are detected and embeddings are extracted, while the occlusion areas are identified. The second step model processes the occlusion areas to obtain occlusion targets' accurate positions and embeddings. Finally, we integrate and optimize the output results of the two steps models. Experiments show that the number of false positives and missed positives in our model's object detection is significantly reduced. The multi-object tracking performance (MOTA metric) is improved by nearly 3% compared with other models.