Arnab Banerjee, D. Bhattacharjee, N. Das, Samarendra Behra, N. T. Srinivasan
{"title":"CARP-YOLO:在混乱环境中识别和计数鱼类的检测框架","authors":"Arnab Banerjee, D. Bhattacharjee, N. Das, Samarendra Behra, N. T. Srinivasan","doi":"10.1109/INCET57972.2023.10170475","DOIUrl":null,"url":null,"abstract":"In the research area of object detection, the recognition of different fish species in a cluttered environment is a challenging task. In the live fish market, different fish species with variations in size, angle, and scale make it hard for the common people to recognize the species properly. Again, counting and sorting fish species is an important task in the fisheries industry. A dataset named JUDVLP-WBUAFS: Fishdb-Detection.v1 with 400 images is prepared by collecting images from the different live fish markets in West Bengal under unconstrained environments. Various augmentations like flip, rotation, blur, gaussian noise, hue saturation, and RGBshift have been applied to make the dataset more diversified and less prone to overfitting. A total of six fish species, Labeo catla, Labeo rohita, Cirrhinus mrigala, Labeo bata, Hypophthalmichthys molitrix, and Ctenopharyngodon idella, are considered in this study for the purposes of recognition and counting. Two popular object detection deep learning networks, YOLOv3 and YOLOv5, with different variants, have been applied to the original and augmented datasets individually. Using YOLOv5l and the YOLOv3-SPP network, the best mAP@0.5 of 0.764 is achieved on the original dataset. In the augmented dataset, the best mAP@0.5 of 0.84 is achieved using the YOLOv3-SPP network. The mAP@0.5 value on both datasets shows a promising result for the recognition of fish species in some extremely cluttered environments. This study is expected to help common people and the fishing industry in a variety of contexts.","PeriodicalId":403008,"journal":{"name":"2023 4th International Conference for Emerging Technology (INCET)","volume":"575 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CARP-YOLO: A Detection Framework for Recognising and Counting Fish Species in a Cluttered Environment\",\"authors\":\"Arnab Banerjee, D. Bhattacharjee, N. Das, Samarendra Behra, N. T. Srinivasan\",\"doi\":\"10.1109/INCET57972.2023.10170475\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the research area of object detection, the recognition of different fish species in a cluttered environment is a challenging task. In the live fish market, different fish species with variations in size, angle, and scale make it hard for the common people to recognize the species properly. Again, counting and sorting fish species is an important task in the fisheries industry. A dataset named JUDVLP-WBUAFS: Fishdb-Detection.v1 with 400 images is prepared by collecting images from the different live fish markets in West Bengal under unconstrained environments. Various augmentations like flip, rotation, blur, gaussian noise, hue saturation, and RGBshift have been applied to make the dataset more diversified and less prone to overfitting. A total of six fish species, Labeo catla, Labeo rohita, Cirrhinus mrigala, Labeo bata, Hypophthalmichthys molitrix, and Ctenopharyngodon idella, are considered in this study for the purposes of recognition and counting. Two popular object detection deep learning networks, YOLOv3 and YOLOv5, with different variants, have been applied to the original and augmented datasets individually. Using YOLOv5l and the YOLOv3-SPP network, the best mAP@0.5 of 0.764 is achieved on the original dataset. In the augmented dataset, the best mAP@0.5 of 0.84 is achieved using the YOLOv3-SPP network. The mAP@0.5 value on both datasets shows a promising result for the recognition of fish species in some extremely cluttered environments. This study is expected to help common people and the fishing industry in a variety of contexts.\",\"PeriodicalId\":403008,\"journal\":{\"name\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"volume\":\"575 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INCET57972.2023.10170475\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCET57972.2023.10170475","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CARP-YOLO: A Detection Framework for Recognising and Counting Fish Species in a Cluttered Environment
In the research area of object detection, the recognition of different fish species in a cluttered environment is a challenging task. In the live fish market, different fish species with variations in size, angle, and scale make it hard for the common people to recognize the species properly. Again, counting and sorting fish species is an important task in the fisheries industry. A dataset named JUDVLP-WBUAFS: Fishdb-Detection.v1 with 400 images is prepared by collecting images from the different live fish markets in West Bengal under unconstrained environments. Various augmentations like flip, rotation, blur, gaussian noise, hue saturation, and RGBshift have been applied to make the dataset more diversified and less prone to overfitting. A total of six fish species, Labeo catla, Labeo rohita, Cirrhinus mrigala, Labeo bata, Hypophthalmichthys molitrix, and Ctenopharyngodon idella, are considered in this study for the purposes of recognition and counting. Two popular object detection deep learning networks, YOLOv3 and YOLOv5, with different variants, have been applied to the original and augmented datasets individually. Using YOLOv5l and the YOLOv3-SPP network, the best mAP@0.5 of 0.764 is achieved on the original dataset. In the augmented dataset, the best mAP@0.5 of 0.84 is achieved using the YOLOv3-SPP network. The mAP@0.5 value on both datasets shows a promising result for the recognition of fish species in some extremely cluttered environments. This study is expected to help common people and the fishing industry in a variety of contexts.