{"title":"FFM-ViT: an efficient fish species classification method based on deep features and transformers.","authors":"Yuwei Gao, Xiaoyong Li, Jian Xiang, Yunjie Xie, Chen Yang, Wan Xiang","doi":"10.1111/jfb.70213","DOIUrl":null,"url":null,"abstract":"<p><p>Morphological identification of fish species plays a crucial role in the monitoring and management of fishery resources and biodiversity conservation. However, existing classification methods fail to meet practical needs when confronted with small fish data sets and high similarity. In this paper, we propose a novel deep learning method called feature fusion module vision transformer (FFM-ViT). The essence of FFM-ViT lies in abandoning the direct patch operation used in traditional vision transformer (ViT) and introducing Mobile Inverted Bottleneck Convolution (MBConv) and Fused Mobile Inverted Bottleneck Convolution (Fuse-MBConv) blocks to obtain more accurate high-dimensional information. To enhance feature extraction capability and channel feature fusion, we also introduce the channel spatial merge attention (CSMA) module. Furthermore, we have curated a dataset consisting of 78 categories named Oceanfish78. Our model achieves an impressive accuracy rate of 90.2% on this dataset, surpassing the 80.4% accuracy achieved by the ViT model without pre-trained weights significantly. Additionally, we conducted tests on several datasets, such as fish4knowledge and Fish31, while comparing our proposed method with other deep learning models, including shufflenet, convnext and swin transformer, through comprehensive empirical analysis. The results demonstrate that our proposed method outperforms existing approaches comprehensively, not only providing an effective solution for fish classification, but also offering valuable insights for approximate target recognition in other environments.</p>","PeriodicalId":15794,"journal":{"name":"Journal of fish biology","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of fish biology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1111/jfb.70213","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FISHERIES","Score":null,"Total":0}
引用次数: 0
Abstract
Morphological identification of fish species plays a crucial role in the monitoring and management of fishery resources and biodiversity conservation. However, existing classification methods fail to meet practical needs when confronted with small fish data sets and high similarity. In this paper, we propose a novel deep learning method called feature fusion module vision transformer (FFM-ViT). The essence of FFM-ViT lies in abandoning the direct patch operation used in traditional vision transformer (ViT) and introducing Mobile Inverted Bottleneck Convolution (MBConv) and Fused Mobile Inverted Bottleneck Convolution (Fuse-MBConv) blocks to obtain more accurate high-dimensional information. To enhance feature extraction capability and channel feature fusion, we also introduce the channel spatial merge attention (CSMA) module. Furthermore, we have curated a dataset consisting of 78 categories named Oceanfish78. Our model achieves an impressive accuracy rate of 90.2% on this dataset, surpassing the 80.4% accuracy achieved by the ViT model without pre-trained weights significantly. Additionally, we conducted tests on several datasets, such as fish4knowledge and Fish31, while comparing our proposed method with other deep learning models, including shufflenet, convnext and swin transformer, through comprehensive empirical analysis. The results demonstrate that our proposed method outperforms existing approaches comprehensively, not only providing an effective solution for fish classification, but also offering valuable insights for approximate target recognition in other environments.
期刊介绍:
The Journal of Fish Biology is a leading international journal for scientists engaged in all aspects of fishes and fisheries research, both fresh water and marine. The journal publishes high-quality papers relevant to the central theme of fish biology and aims to bring together under one cover an overall picture of the research in progress and to provide international communication among researchers in many disciplines with a common interest in the biology of fish.