Graphically Residual Attentive Network for tackling aerial image occlusion

IF 4 3区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Praveen Kumar Pradhan , Kunal Purkayastha , Aaditya Lochan Sharma , Udayan Baruah , Biswaraj Sen , Palash Ghosal
{"title":"Graphically Residual Attentive Network for tackling aerial image occlusion","authors":"Praveen Kumar Pradhan ,&nbsp;Kunal Purkayastha ,&nbsp;Aaditya Lochan Sharma ,&nbsp;Udayan Baruah ,&nbsp;Biswaraj Sen ,&nbsp;Palash Ghosal","doi":"10.1016/j.compeleceng.2025.110429","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning has rapidly advanced, enabling new applications such as object detection, text recognition, and occlusion handling. However, challenges remain in the detection of objects in complex environments such as aerial images where things like motion blur, low light, and significant occlusion occur. This paper addresses a similar challenge by introducing a novel supervised framework, the Graphically Residual Attentive Network (GRESIDAN). In the same model, GRESIDAN integrates three synergistic pipelines for object detection, occlusion detection, and occlusion removal. GRESIDAN uses a residually attentive block combining ResNet-18 and a multi-headed attention mechanism to improve feature extraction and detection accuracy in low-quality, occluded aerial images. A graphically attentive occlusion detection pipeline is implemented to handle occlusion, segment better, and mask out the occluder in the aerial image. The GRESIDAN model is validated on the COCO-2017 dataset and a custom private aerial object detection dataset, outperforming the state-of-the-art methods in handling occlusion and detecting objects. Our contributions provide a robust solution to the problem of detecting and handling occluded objects in aerial imagery, pushing the boundaries of automated visual recognition in challenging real-world scenarios. The code for public use in training and testing is available on <span><span>GitHub</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"125 ","pages":"Article 110429"},"PeriodicalIF":4.0000,"publicationDate":"2025-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625003726","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Deep learning has rapidly advanced, enabling new applications such as object detection, text recognition, and occlusion handling. However, challenges remain in the detection of objects in complex environments such as aerial images where things like motion blur, low light, and significant occlusion occur. This paper addresses a similar challenge by introducing a novel supervised framework, the Graphically Residual Attentive Network (GRESIDAN). In the same model, GRESIDAN integrates three synergistic pipelines for object detection, occlusion detection, and occlusion removal. GRESIDAN uses a residually attentive block combining ResNet-18 and a multi-headed attention mechanism to improve feature extraction and detection accuracy in low-quality, occluded aerial images. A graphically attentive occlusion detection pipeline is implemented to handle occlusion, segment better, and mask out the occluder in the aerial image. The GRESIDAN model is validated on the COCO-2017 dataset and a custom private aerial object detection dataset, outperforming the state-of-the-art methods in handling occlusion and detecting objects. Our contributions provide a robust solution to the problem of detecting and handling occluded objects in aerial imagery, pushing the boundaries of automated visual recognition in challenging real-world scenarios. The code for public use in training and testing is available on GitHub.
图残差关注网络处理航空图像遮挡
深度学习迅速发展,实现了新的应用,如目标检测、文本识别和遮挡处理。然而,在复杂环境中检测物体的挑战仍然存在,比如航拍图像,其中会出现运动模糊、低光和严重遮挡等情况。本文通过引入一种新的监督框架——图形残差关注网络(GRESIDAN)来解决类似的挑战。在同一模型中,GRESIDAN集成了三个协同管道,用于目标检测、遮挡检测和遮挡去除。GRESIDAN使用结合ResNet-18和多头注意机制的剩余注意块来提高低质量、遮挡的航空图像的特征提取和检测精度。实现了一个图形关注的遮挡检测管道来处理遮挡,更好地分割,并掩盖掉航空图像中的遮挡。GRESIDAN模型在COCO-2017数据集和自定义私人空中目标检测数据集上进行了验证,在处理遮挡和检测目标方面优于最先进的方法。我们的贡献为航空图像中遮挡物体的检测和处理问题提供了一个强大的解决方案,在具有挑战性的现实场景中推动了自动视觉识别的界限。在培训和测试中公开使用的代码可在GitHub上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers & Electrical Engineering
Computers & Electrical Engineering 工程技术-工程:电子与电气
CiteScore
9.20
自引率
7.00%
发文量
661
审稿时长
47 days
期刊介绍: The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信