HRPM网络:基于人眼视网膜感知机制生物学建模的高效特征学习网络

Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition Pub Date : 2022-11-17 DOI:10.1145/3581807.3581869

Renwei Ba, Yidan Zhang, Zhenghui Hu, Jun Sun, Xiao Li

{"title":"HRPM网络:基于人眼视网膜感知机制生物学建模的高效特征学习网络","authors":"Renwei Ba, Yidan Zhang, Zhenghui Hu, Jun Sun, Xiao Li","doi":"10.1145/3581807.3581869","DOIUrl":null,"url":null,"abstract":"The biological model of the mammal visual mechanisms is very beneficial to feature learning in motionless images. It is proved that the visual mechanisms can improve the performance of the hand-crafted methods and CNNS method. Recently CNNs learn discriminate and robust features by changing the backbone, processing multi-scale feature maps, and adding attention mechanisms, etc. While they are relatively short of changing the network structure with human retina mechanisms, which have been proven to have a strong feature extract capability of images by traditional feature descriptors. To address this problem, we present two CNN blocks, multi-scale receptive field convolutional block (MSRF) and Sensitivity block (SENSI), both of which are constructed by modeling the human retina ganglion cell's mechanisms. MSRF is designed to enhance the feature discriminability and robustness by imitating the exponentially increased way of the receptive fields of the P ganglion cells in the human retina. We constructed experiments to get the specific value of the size of the receptive fields, and it can capture both local and global features with various convolution kernels. SENSI is presented to make sure each receptive field has a suitable weight to choose which receptive field can better learn features. Both of them help to learn features and can be easily integrated into the existing CNN models. The framework is evaluated on two benchmark datasets. We further assemble MSRF and SENSI to the top of SSD, constructing the HRPM Net. The model outperforms the state-of-the-art approaches by a considerable margin on MS COCO, VOC 2012, and VOC 2007 datasets. The results also show that MSRF block and SENSI block are helpful in feature learning and can improve the performance by a margin.","PeriodicalId":292813,"journal":{"name":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HRPM Net: An Efficient Feature Learning Network from The Biological Modelling Of Human Retinal Perception Mechanism\",\"authors\":\"Renwei Ba, Yidan Zhang, Zhenghui Hu, Jun Sun, Xiao Li\",\"doi\":\"10.1145/3581807.3581869\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The biological model of the mammal visual mechanisms is very beneficial to feature learning in motionless images. It is proved that the visual mechanisms can improve the performance of the hand-crafted methods and CNNS method. Recently CNNs learn discriminate and robust features by changing the backbone, processing multi-scale feature maps, and adding attention mechanisms, etc. While they are relatively short of changing the network structure with human retina mechanisms, which have been proven to have a strong feature extract capability of images by traditional feature descriptors. To address this problem, we present two CNN blocks, multi-scale receptive field convolutional block (MSRF) and Sensitivity block (SENSI), both of which are constructed by modeling the human retina ganglion cell's mechanisms. MSRF is designed to enhance the feature discriminability and robustness by imitating the exponentially increased way of the receptive fields of the P ganglion cells in the human retina. We constructed experiments to get the specific value of the size of the receptive fields, and it can capture both local and global features with various convolution kernels. SENSI is presented to make sure each receptive field has a suitable weight to choose which receptive field can better learn features. Both of them help to learn features and can be easily integrated into the existing CNN models. The framework is evaluated on two benchmark datasets. We further assemble MSRF and SENSI to the top of SSD, constructing the HRPM Net. The model outperforms the state-of-the-art approaches by a considerable margin on MS COCO, VOC 2012, and VOC 2007 datasets. The results also show that MSRF block and SENSI block are helpful in feature learning and can improve the performance by a margin.\",\"PeriodicalId\":292813,\"journal\":{\"name\":\"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3581807.3581869\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3581807.3581869","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

哺乳动物视觉机制的生物学模型对静止图像的特征学习非常有益。实验证明，视觉机制可以提高手工方法和cnn方法的性能。近年来，cnn通过改变主干、处理多尺度特征映射、增加注意机制等方法来学习区分和鲁棒特征。而传统的特征描述符对图像具有较强的特征提取能力，在改变人类视网膜机制的网络结构方面相对不足。为了解决这一问题，我们提出了两个CNN块，即多尺度接受野卷积块(MSRF)和灵敏度块(SENSI)，它们都是通过模拟人类视网膜神经节细胞的机制构建的。MSRF通过模仿人类视网膜P神经节细胞感受野的指数增长方式来增强特征的可辨别性和鲁棒性。通过构造实验得到接收野大小的具体值，并利用不同的卷积核捕获局部和全局特征。为了保证每个感受野都有一个合适的权值来选择哪个感受野能更好地学习特征，我们提出了SENSI。它们都有助于学习特征，并且可以很容易地集成到现有的CNN模型中。该框架在两个基准数据集上进行了评估。我们进一步将MSRF和SENSI组装到SSD的顶部，构建了HRPM网络。该模型在MS COCO, VOC 2012和VOC 2007数据集上的表现优于最先进的方法。结果还表明，MSRF块和SENSI块有助于特征学习，并能在一定程度上提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

HRPM Net: An Efficient Feature Learning Network from The Biological Modelling Of Human Retinal Perception Mechanism

The biological model of the mammal visual mechanisms is very beneficial to feature learning in motionless images. It is proved that the visual mechanisms can improve the performance of the hand-crafted methods and CNNS method. Recently CNNs learn discriminate and robust features by changing the backbone, processing multi-scale feature maps, and adding attention mechanisms, etc. While they are relatively short of changing the network structure with human retina mechanisms, which have been proven to have a strong feature extract capability of images by traditional feature descriptors. To address this problem, we present two CNN blocks, multi-scale receptive field convolutional block (MSRF) and Sensitivity block (SENSI), both of which are constructed by modeling the human retina ganglion cell's mechanisms. MSRF is designed to enhance the feature discriminability and robustness by imitating the exponentially increased way of the receptive fields of the P ganglion cells in the human retina. We constructed experiments to get the specific value of the size of the receptive fields, and it can capture both local and global features with various convolution kernels. SENSI is presented to make sure each receptive field has a suitable weight to choose which receptive field can better learn features. Both of them help to learn features and can be easily integrated into the existing CNN models. The framework is evaluated on two benchmark datasets. We further assemble MSRF and SENSI to the top of SSD, constructing the HRPM Net. The model outperforms the state-of-the-art approaches by a considerable margin on MS COCO, VOC 2012, and VOC 2007 datasets. The results also show that MSRF block and SENSI block are helpful in feature learning and can improve the performance by a margin.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition

自引率

0.00%

发文量