用于对象再识别的任务特定数据集的无监督构建

Proceedings of the 2021 7th International Conference on Computer Technology Applications Pub Date : 2021-07-13 DOI:10.1145/3477911.3477922

Petr Pulc, M. Holeňa

{"title":"用于对象再识别的任务特定数据集的无监督构建","authors":"Petr Pulc, M. Holeňa","doi":"10.1145/3477911.3477922","DOIUrl":null,"url":null,"abstract":"In the last decade, we have seen a significant uprise of deep neural networks in image processing tasks and many other research areas. However, while various neural architectures have successfully solved numerous tasks, they constantly demand more and more processing time and training data. Moreover, the current trend of using existing pre-trained architectures just as backbones and attaching new processing branches on top not only increases this demand but diminishes the explainability of the whole model. Our research focuses on combinations of explainable building blocks for the image processing tasks, such as object tracking. We propose a combination of Mask R-CNN, state-of-the-art object detection and segmentation neural network, with our previously published method of sparse feature tracking [16]. Such a combination allows us to track objects by connecting detected masks using the proposed sparse feature tracklets. However, this method cannot recover from complete object occlusions and has to be assisted by an object re-identification. To this end, this paper uses our feature tracking method for a slightly different task: an unsupervised extraction of object representations that we can directly use to fine-tune an object re-identification algorithm, see Fig. 1 for visualisation. As we have to use objects masks already in the object tracking, our approach utilises the additional information as an alpha channel of the object representations, which further increases the precision of the re-identification. An additional benefit is that our fine-tuning method can be employed even in a fully online scenario.","PeriodicalId":174824,"journal":{"name":"Proceedings of the 2021 7th International Conference on Computer Technology Applications","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unsupervised Construction of Task-Specific Datasets for Object Re-identification\",\"authors\":\"Petr Pulc, M. Holeňa\",\"doi\":\"10.1145/3477911.3477922\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the last decade, we have seen a significant uprise of deep neural networks in image processing tasks and many other research areas. However, while various neural architectures have successfully solved numerous tasks, they constantly demand more and more processing time and training data. Moreover, the current trend of using existing pre-trained architectures just as backbones and attaching new processing branches on top not only increases this demand but diminishes the explainability of the whole model. Our research focuses on combinations of explainable building blocks for the image processing tasks, such as object tracking. We propose a combination of Mask R-CNN, state-of-the-art object detection and segmentation neural network, with our previously published method of sparse feature tracking [16]. Such a combination allows us to track objects by connecting detected masks using the proposed sparse feature tracklets. However, this method cannot recover from complete object occlusions and has to be assisted by an object re-identification. To this end, this paper uses our feature tracking method for a slightly different task: an unsupervised extraction of object representations that we can directly use to fine-tune an object re-identification algorithm, see Fig. 1 for visualisation. As we have to use objects masks already in the object tracking, our approach utilises the additional information as an alpha channel of the object representations, which further increases the precision of the re-identification. An additional benefit is that our fine-tuning method can be employed even in a fully online scenario.\",\"PeriodicalId\":174824,\"journal\":{\"name\":\"Proceedings of the 2021 7th International Conference on Computer Technology Applications\",\"volume\":\"101 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 7th International Conference on Computer Technology Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3477911.3477922\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 7th International Conference on Computer Technology Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3477911.3477922","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在过去的十年中，我们已经看到深度神经网络在图像处理任务和许多其他研究领域的重大进步。然而，尽管各种神经结构已经成功地解决了许多任务，但它们不断地需要越来越多的处理时间和训练数据。此外，目前使用现有的预训练架构作为主干，并在其上附加新的处理分支的趋势不仅增加了这种需求，而且降低了整个模型的可解释性。我们的研究重点是图像处理任务的可解释构建块的组合，例如目标跟踪。我们提出将最先进的目标检测和分割神经网络Mask R-CNN与我们之前发表的稀疏特征跟踪方法相结合[16]。这样的组合允许我们通过使用所提出的稀疏特征tracklets连接检测到的掩模来跟踪对象。然而，这种方法不能从完全的物体遮挡中恢复，必须借助于物体的重新识别。为此，本文将我们的特征跟踪方法用于一个略有不同的任务:我们可以直接使用对象表征的无监督提取来微调对象重新识别算法，如图1所示。由于我们必须在对象跟踪中使用已经存在的对象掩码，我们的方法利用附加信息作为对象表示的alpha通道，这进一步提高了重新识别的精度。另一个好处是，我们的微调方法甚至可以在完全在线的场景中使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Unsupervised Construction of Task-Specific Datasets for Object Re-identification

In the last decade, we have seen a significant uprise of deep neural networks in image processing tasks and many other research areas. However, while various neural architectures have successfully solved numerous tasks, they constantly demand more and more processing time and training data. Moreover, the current trend of using existing pre-trained architectures just as backbones and attaching new processing branches on top not only increases this demand but diminishes the explainability of the whole model. Our research focuses on combinations of explainable building blocks for the image processing tasks, such as object tracking. We propose a combination of Mask R-CNN, state-of-the-art object detection and segmentation neural network, with our previously published method of sparse feature tracking [16]. Such a combination allows us to track objects by connecting detected masks using the proposed sparse feature tracklets. However, this method cannot recover from complete object occlusions and has to be assisted by an object re-identification. To this end, this paper uses our feature tracking method for a slightly different task: an unsupervised extraction of object representations that we can directly use to fine-tune an object re-identification algorithm, see Fig. 1 for visualisation. As we have to use objects masks already in the object tracking, our approach utilises the additional information as an alpha channel of the object representations, which further increases the precision of the re-identification. An additional benefit is that our fine-tuning method can be employed even in a fully online scenario.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2021 7th International Conference on Computer Technology Applications

自引率

0.00%

发文量