Shrinking Encoding with Two-Level Codebook Learning for Fine-Grained Fish Recognition

Gaoang Wang, Jenq-Neng Hwang, K. Williams, Farron Wallace, Craig S. Rose
{"title":"Shrinking Encoding with Two-Level Codebook Learning for Fine-Grained Fish Recognition","authors":"Gaoang Wang, Jenq-Neng Hwang, K. Williams, Farron Wallace, Craig S. Rose","doi":"10.1109/CVAUI.2016.018","DOIUrl":null,"url":null,"abstract":"Bag-of-features (BoF) shows a great power in representing images for image classification. Many codebook learning methods have been developed to find discriminative parts of images for fine-grained recognition. Built upon BoF framework, we propose a novel approach for finegrained fish recognition with two-level codebook learning by shrinking coding coefficients. In the framework, only the maximum-valued coefficient will be maintained in the local spatial region if followed by max pooling strategy. However, the maximum-valued coefficient may result from a local descriptor which is not discriminative among fine-grained classes, resulting in difficulty in classification. In this paper, a two-level codebook is learned to represent the importance between the local descriptor and each codeword in its corresponding k-nearest neighbors. A shrinkage function is also introduced to shrink unrelated coefficients after encoding. Our experimental results show that the proposed method achieves significant performance improvement for fine-grained fish recognition tasks.","PeriodicalId":169345,"journal":{"name":"2016 ICPR 2nd Workshop on Computer Vision for Analysis of Underwater Imagery (CVAUI)","volume":"52 20","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 ICPR 2nd Workshop on Computer Vision for Analysis of Underwater Imagery (CVAUI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVAUI.2016.018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

Abstract

Bag-of-features (BoF) shows a great power in representing images for image classification. Many codebook learning methods have been developed to find discriminative parts of images for fine-grained recognition. Built upon BoF framework, we propose a novel approach for finegrained fish recognition with two-level codebook learning by shrinking coding coefficients. In the framework, only the maximum-valued coefficient will be maintained in the local spatial region if followed by max pooling strategy. However, the maximum-valued coefficient may result from a local descriptor which is not discriminative among fine-grained classes, resulting in difficulty in classification. In this paper, a two-level codebook is learned to represent the importance between the local descriptor and each codeword in its corresponding k-nearest neighbors. A shrinkage function is also introduced to shrink unrelated coefficients after encoding. Our experimental results show that the proposed method achieves significant performance improvement for fine-grained fish recognition tasks.
基于两级码本学习的细粒度鱼类识别压缩编码
特征袋(BoF)在表示图像用于图像分类方面显示出强大的能力。许多码本学习方法已经被开发出来,用于寻找图像的判别部分,以进行细粒度识别。在BoF框架的基础上,我们提出了一种基于压缩编码系数的两级码本学习的细粒度鱼类识别新方法。在该框架中,如果采用最大池化策略,则只会在局部空间区域保持最大的系数。然而,系数的最大值可能是由局部描述符产生的,而局部描述符在细粒度类之间没有区别,从而导致分类困难。本文学习了一个两级码本来表示局部描述符与对应的k近邻中的每个码字之间的重要性。在编码后引入收缩函数对不相关系数进行收缩。实验结果表明,该方法在细粒度鱼类识别任务中取得了显著的性能提升。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信