基于网格化超像素的高效内存深度显著目标分割网络

2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2017-12-27 DOI:10.1109/MMSP.2018.8547096

Çağlar Aytekin, Xingyang Ni, Francesco Cricri, Lixin Fan, Emre B. Aksu

{"title":"基于网格化超像素的高效内存深度显著目标分割网络","authors":"Çağlar Aytekin, Xingyang Ni, Francesco Cricri, Lixin Fan, Emre B. Aksu","doi":"10.1109/MMSP.2018.8547096","DOIUrl":null,"url":null,"abstract":"Computer vision algorithms with pixel-wise labeling tasks, such as semantic segmentation and salient object detection, have gone through a significant accuracy increase with the incorporation of deep learning. Deep segmentation methods slightly modify and fine-tune pre-trained networks that have hundreds of millions of parameters. In this work, we question the need of having such memory demanding networks for a reasonable performance in salient object segmentation. To this end, we propose a way to learn a memory-efficient network from scratch by training it only on salient object detection datasets. Our method encodes images to gridized superpixels that preserve both the object boundaries and the connectivity rules of regular pixels. This representation allows us to use convolutional neural networks that operate on regular grids. By using these encoded images, we train a memory-efficient network using only 0.048% of the number of parameters that a majority of other deep salient object detection networks have. Our method shows comparable accuracy with the state-of-the-art deep salient object detection methods and provides a much more memory-efficient alternative to them. Due to its easy deployment and small size, such a network is preferable for applications in memory limited IoT devices.","PeriodicalId":137522,"journal":{"name":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Memory-Efficient Deep Salient Object Segmentation Networks on Gridized Superpixels\",\"authors\":\"Çağlar Aytekin, Xingyang Ni, Francesco Cricri, Lixin Fan, Emre B. Aksu\",\"doi\":\"10.1109/MMSP.2018.8547096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer vision algorithms with pixel-wise labeling tasks, such as semantic segmentation and salient object detection, have gone through a significant accuracy increase with the incorporation of deep learning. Deep segmentation methods slightly modify and fine-tune pre-trained networks that have hundreds of millions of parameters. In this work, we question the need of having such memory demanding networks for a reasonable performance in salient object segmentation. To this end, we propose a way to learn a memory-efficient network from scratch by training it only on salient object detection datasets. Our method encodes images to gridized superpixels that preserve both the object boundaries and the connectivity rules of regular pixels. This representation allows us to use convolutional neural networks that operate on regular grids. By using these encoded images, we train a memory-efficient network using only 0.048% of the number of parameters that a majority of other deep salient object detection networks have. Our method shows comparable accuracy with the state-of-the-art deep salient object detection methods and provides a much more memory-efficient alternative to them. Due to its easy deployment and small size, such a network is preferable for applications in memory limited IoT devices.\",\"PeriodicalId\":137522,\"journal\":{\"name\":\"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)\",\"volume\":\"78 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MMSP.2018.8547096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2018.8547096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

具有逐像素标记任务的计算机视觉算法，如语义分割和显著目标检测，随着深度学习的结合，准确性显著提高。深度分割方法稍微修改和微调预训练的网络，这些网络有数亿个参数。在这项工作中，我们质疑在显著目标分割中需要这样的内存需求网络来获得合理的性能。为此，我们提出了一种方法，通过只在显著目标检测数据集上训练它来从头开始学习内存高效网络。我们的方法将图像编码为网格化的超像素，既保留了目标边界，又保留了常规像素的连通性规则。这种表示允许我们使用在规则网格上运行的卷积神经网络。通过使用这些编码图像，我们训练了一个内存效率高的网络，使用的参数数量仅为大多数其他深度显著目标检测网络的0.048%。我们的方法显示出与最先进的深度显著目标检测方法相当的准确性，并提供了一种更高效的内存替代方法。由于其易于部署和体积小，这种网络更适合内存有限的物联网设备中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Memory-Efficient Deep Salient Object Segmentation Networks on Gridized Superpixels

Computer vision algorithms with pixel-wise labeling tasks, such as semantic segmentation and salient object detection, have gone through a significant accuracy increase with the incorporation of deep learning. Deep segmentation methods slightly modify and fine-tune pre-trained networks that have hundreds of millions of parameters. In this work, we question the need of having such memory demanding networks for a reasonable performance in salient object segmentation. To this end, we propose a way to learn a memory-efficient network from scratch by training it only on salient object detection datasets. Our method encodes images to gridized superpixels that preserve both the object boundaries and the connectivity rules of regular pixels. This representation allows us to use convolutional neural networks that operate on regular grids. By using these encoded images, we train a memory-efficient network using only 0.048% of the number of parameters that a majority of other deep salient object detection networks have. Our method shows comparable accuracy with the state-of-the-art deep salient object detection methods and provides a much more memory-efficient alternative to them. Due to its easy deployment and small size, such a network is preferable for applications in memory limited IoT devices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)

自引率

0.00%

发文量