{"title":"Automated fish detection and classification on sonar images using detection transformer and YOLOv7","authors":"Ella Mahoro, M. Akhloufi","doi":"10.1117/12.2688330","DOIUrl":null,"url":null,"abstract":"In order to maintain a healthy ecosystem and fish stocks, it is necessary to monitor the abundance and frequency of fish species. In this article, we propose a fish detection and classification system. In the first step, the images were extracted from a public Ocqueoc River DIDSON high-resolution imaging sonar dataset and annotated. End-to-end object detection models, Detection Transformer with a ResNet-50 backbone (DETR-ResNet-50) and YOLOv7 were used to detect and classify fish species. With a mean average precision of 0.79, YOLOv7 outperformed DETR-ResNet-50. The results demonstrated that the proposed system can in fact be used to detect and classify fish species using high-resolution imaging sonar data.","PeriodicalId":295011,"journal":{"name":"International Conference on Quality Control by Artificial Vision","volume":"51 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Quality Control by Artificial Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2688330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In order to maintain a healthy ecosystem and fish stocks, it is necessary to monitor the abundance and frequency of fish species. In this article, we propose a fish detection and classification system. In the first step, the images were extracted from a public Ocqueoc River DIDSON high-resolution imaging sonar dataset and annotated. End-to-end object detection models, Detection Transformer with a ResNet-50 backbone (DETR-ResNet-50) and YOLOv7 were used to detect and classify fish species. With a mean average precision of 0.79, YOLOv7 outperformed DETR-ResNet-50. The results demonstrated that the proposed system can in fact be used to detect and classify fish species using high-resolution imaging sonar data.