MCD-YOLO: An Improved YOLOv11 Framework for Manhole Cover Detection in UAV Imagery

IF 4.4
Fengchang Li;Shaomei Li;Qing Xu;Zhenyan Yu;Ning You;Liunan Ren
{"title":"MCD-YOLO: An Improved YOLOv11 Framework for Manhole Cover Detection in UAV Imagery","authors":"Fengchang Li;Shaomei Li;Qing Xu;Zhenyan Yu;Ning You;Liunan Ren","doi":"10.1109/LGRS.2026.3663831","DOIUrl":null,"url":null,"abstract":"Unmanned aerial vehicle (UAV) remote sensing provides an effective solution for monitoring urban infrastructure. Accurate detection of manhole covers in high-resolution UAV imagery is essential for the safe operation and maintenance of underground utility networks. However, detecting manhole covers in complex urban environments remains challenging due to their small size, visual similarity to surrounding structures, and frequent occlusion. To address these challenges, we propose a novel detection model termed manhole cover detection YOLO (MCD-YOLO). First, to exploit the regular geometric structure of manhole covers, we design an EdgeExtract module to enhance the C3k2 block in the backbone network. This module fuses image gradient information and high-frequency features to strengthen the geometric edge representation of manhole covers, thereby improving their discriminability against complex backgrounds. Second, we propose an oriented context interaction (OCI) module that employs multiorientation depthwise separable convolutions to capture both local features and global contextual dependencies, effectively suppressing interference from structurally similar background elements. Finally, we design a distribution-guided localization (DGL) module that dynamically calibrates classification confidence based on the statistical distribution of bounding box regression offsets, significantly reducing high-confidence false positives caused by localization errors under occlusion. Extensive experiments on our self-constructed manhole cover (MHC) dataset and the public VisDrone2019 dataset demonstrate the superior performance of MCD-YOLO.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"23 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11394791/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Unmanned aerial vehicle (UAV) remote sensing provides an effective solution for monitoring urban infrastructure. Accurate detection of manhole covers in high-resolution UAV imagery is essential for the safe operation and maintenance of underground utility networks. However, detecting manhole covers in complex urban environments remains challenging due to their small size, visual similarity to surrounding structures, and frequent occlusion. To address these challenges, we propose a novel detection model termed manhole cover detection YOLO (MCD-YOLO). First, to exploit the regular geometric structure of manhole covers, we design an EdgeExtract module to enhance the C3k2 block in the backbone network. This module fuses image gradient information and high-frequency features to strengthen the geometric edge representation of manhole covers, thereby improving their discriminability against complex backgrounds. Second, we propose an oriented context interaction (OCI) module that employs multiorientation depthwise separable convolutions to capture both local features and global contextual dependencies, effectively suppressing interference from structurally similar background elements. Finally, we design a distribution-guided localization (DGL) module that dynamically calibrates classification confidence based on the statistical distribution of bounding box regression offsets, significantly reducing high-confidence false positives caused by localization errors under occlusion. Extensive experiments on our self-constructed manhole cover (MHC) dataset and the public VisDrone2019 dataset demonstrate the superior performance of MCD-YOLO.
MCD-YOLO:一种改进的无人机图像井盖检测YOLOv11框架
无人机遥感技术为城市基础设施监控提供了有效的解决方案。在高分辨率无人机图像中准确检测井盖对于地下公用事业网络的安全运行和维护至关重要。然而,在复杂的城市环境中检测井盖仍然具有挑战性,因为它们体积小,视觉上与周围结构相似,并且经常遮挡。为了解决这些挑战,我们提出了一种新的检测模型,称为井盖检测YOLO (MCD-YOLO)。首先,利用井盖的规则几何结构,设计了EdgeExtract模块对骨干网络中的C3k2块进行增强。该模块融合图像梯度信息和高频特征,增强井盖的几何边缘表示,从而提高井盖对复杂背景的识别能力。其次,我们提出了一个面向上下文交互(OCI)模块,该模块采用多向深度可分卷积来捕获局部特征和全局上下文依赖,有效地抑制结构相似背景元素的干扰。最后,我们设计了一个基于边界盒回归偏移量统计分布动态校准分类置信度的分布导向定位(DGL)模块,显著降低了遮挡下定位错误导致的高置信度误报。在我们的自建井盖(MHC)数据集和公共VisDrone2019数据集上进行的大量实验表明,MCD-YOLO具有优越的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书