{"title":"Sparse R-CNN OBB: Ship Target Detection in SAR Images Based on Oriented Sparse Proposals","authors":"Kamirul Kamirul, Odysseas Pappas, Alin Achim","doi":"arxiv-2409.07973","DOIUrl":null,"url":null,"abstract":"We present Sparse R-CNN OBB, a novel framework for the detection of oriented\nobjects in SAR images leveraging sparse learnable proposals. The Sparse R-CNN\nOBB has streamlined architecture and ease of training as it utilizes a sparse\nset of 300 proposals instead of training a proposals generator on hundreds of\nthousands of anchors. To the best of our knowledge, Sparse R-CNN OBB is the\nfirst to adopt the concept of sparse learnable proposals for the detection of\noriented objects, as well as for the detection of ships in Synthetic Aperture\nRadar (SAR) images. The detection head of the baseline model, Sparse R-CNN, is\nre-designed to enable the model to capture object orientation. We also\nfine-tune the model on RSDD-SAR dataset and provide a performance comparison to\nstate-of-the-art models. Experimental results shows that Sparse R-CNN OBB\nachieves outstanding performance, surpassing other models on both inshore and\noffshore scenarios. The code is available at:\nwww.github.com/ka-mirul/Sparse-R-CNN-OBB.","PeriodicalId":501130,"journal":{"name":"arXiv - CS - Computer Vision and Pattern Recognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present Sparse R-CNN OBB, a novel framework for the detection of oriented
objects in SAR images leveraging sparse learnable proposals. The Sparse R-CNN
OBB has streamlined architecture and ease of training as it utilizes a sparse
set of 300 proposals instead of training a proposals generator on hundreds of
thousands of anchors. To the best of our knowledge, Sparse R-CNN OBB is the
first to adopt the concept of sparse learnable proposals for the detection of
oriented objects, as well as for the detection of ships in Synthetic Aperture
Radar (SAR) images. The detection head of the baseline model, Sparse R-CNN, is
re-designed to enable the model to capture object orientation. We also
fine-tune the model on RSDD-SAR dataset and provide a performance comparison to
state-of-the-art models. Experimental results shows that Sparse R-CNN OBB
achieves outstanding performance, surpassing other models on both inshore and
offshore scenarios. The code is available at:
www.github.com/ka-mirul/Sparse-R-CNN-OBB.