Improved Attentive Pairwise Interaction (API-Net) for Fine-Grained Image Classification

2021 Emerging Technology in Computing, Communication and Electronics (ETCCE) Pub Date : 2021-12-21 DOI:10.1109/ETCCE54784.2021.9689910

Ong Zu Yet, Taha H. Rassem, Md. Arafatur Rahman, M. M. Rahman

{"title":"Improved Attentive Pairwise Interaction (API-Net) for Fine-Grained Image Classification","authors":"Ong Zu Yet, Taha H. Rassem, Md. Arafatur Rahman, M. M. Rahman","doi":"10.1109/ETCCE54784.2021.9689910","DOIUrl":null,"url":null,"abstract":"Fine-grained classification is a challenging problem as one has to deal with a similar class of objects but with various types of variations. For more elaboration, they are almost similar and have subtle differences, and are confusing. In this study, aircraft will be the fine-grained object to be focused on. Aircraft which has almost similar shapes and patterns can be hardly recognized even for humans, especially those who haven not gone through any training. In recent years, a lot of proposed methods addressed to solve the difficulties in fine-grained problems by learning contrastive clues from an image. This study aims to increase the accuracy of the Attentive Pairwise Interaction Network (API-Net) by introducing data augmentation into the network structure. Some of the previous studies proved that data augmentation does help improve a network. So, this study is going to modify the API-Net with different data augmentation settings. In this study, various settings have been introduced to the API-Net. Several experiments had been done with a simple modification where a portion of the train dataset’s images will randomly convert into greyscale images. These settings are, only brightness & contrast 0.2, only grayscale 0.3, only grayscale 0.5, brightness & contrast 0.2 with grayscale 0.3, and brightness & contrast 0.2 with grayscale 0.5. As a result, the proposed modification achieved with 92.74% with brightness & contrast 0.2, 92.80% on brightness & contrast 0.2 with grayscale 0.5, and 92.86% on brightness & contrast 0.2 with grayscale 0.3. While grayscale 0.3 alone achieve 93.25% and grayscale 0.5 alone achieve 93.46% compared with the original results which reached 92.77%.","PeriodicalId":208038,"journal":{"name":"2021 Emerging Technology in Computing, Communication and Electronics (ETCCE)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Emerging Technology in Computing, Communication and Electronics (ETCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETCCE54784.2021.9689910","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Fine-grained classification is a challenging problem as one has to deal with a similar class of objects but with various types of variations. For more elaboration, they are almost similar and have subtle differences, and are confusing. In this study, aircraft will be the fine-grained object to be focused on. Aircraft which has almost similar shapes and patterns can be hardly recognized even for humans, especially those who haven not gone through any training. In recent years, a lot of proposed methods addressed to solve the difficulties in fine-grained problems by learning contrastive clues from an image. This study aims to increase the accuracy of the Attentive Pairwise Interaction Network (API-Net) by introducing data augmentation into the network structure. Some of the previous studies proved that data augmentation does help improve a network. So, this study is going to modify the API-Net with different data augmentation settings. In this study, various settings have been introduced to the API-Net. Several experiments had been done with a simple modification where a portion of the train dataset’s images will randomly convert into greyscale images. These settings are, only brightness & contrast 0.2, only grayscale 0.3, only grayscale 0.5, brightness & contrast 0.2 with grayscale 0.3, and brightness & contrast 0.2 with grayscale 0.5. As a result, the proposed modification achieved with 92.74% with brightness & contrast 0.2, 92.80% on brightness & contrast 0.2 with grayscale 0.5, and 92.86% on brightness & contrast 0.2 with grayscale 0.3. While grayscale 0.3 alone achieve 93.25% and grayscale 0.5 alone achieve 93.46% compared with the original results which reached 92.77%.

查看原文本刊更多论文

用于细粒度图像分类的改进细心对交互(API-Net)

细粒度分类是一个具有挑战性的问题，因为必须处理类似的对象类别，但有各种类型的变化。更详细地说，它们几乎相似，但有细微的差异，令人困惑。在这项研究中，飞机将是细粒度的对象。具有几乎相似的形状和图案的飞机即使对人类来说也很难识别，特别是那些没有经过任何训练的人。近年来，许多方法都是通过从图像中学习对比线索来解决细粒度问题的困难。本研究旨在通过在网络结构中引入数据扩充来提高细心配对交互网络(API-Net)的准确性。之前的一些研究证明，数据增强确实有助于改善网络。因此，本研究将使用不同的数据增强设置来修改API-Net。在本研究中，API-Net引入了各种设置。几个实验已经完成了一个简单的修改，其中一部分火车数据集的图像将随机转换为灰度图像。这些设置是，只有亮度和对比度0.2，只有灰度0.3，只有灰度0.5，亮度和对比度0.2与灰度0.3，亮度和对比度0.2与灰度0.5。结果表明，在亮度和对比度为0.2时，修正率为92.74%，在灰度为0.5时，修正率为92.80%，在灰度为0.3时，修正率为92.86%。与原始结果92.77%相比，单独灰度0.3和单独灰度0.5分别达到93.25%和93.46%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 Emerging Technology in Computing, Communication and Electronics (ETCCE)

自引率

0.00%

发文量