基于红外图像的传输线典型线钳分割

Ding-Kuo Huang, Jie Yang, Yunfeng Yan, Xiangwei Sun, Xiaoming Huang
{"title":"基于红外图像的传输线典型线钳分割","authors":"Ding-Kuo Huang, Jie Yang, Yunfeng Yan, Xiangwei Sun, Xiaoming Huang","doi":"10.1117/12.2667507","DOIUrl":null,"url":null,"abstract":"In view of the current image segmentation field, there are few studies on the segmentation of typical wire clamp components of transmission lines. Traditional image processing methods have low segmentation accuracy and require artificial design of feature extraction methods, which are usually only suitable for equipment of a certain structure with insufficient generalization. In this paper, an infrared image segmentation method based on Mask R-CNN (Mask region-based convolutional neural network) for typical guide-ground lines is proposed. Its structure takes Mask R-CNN model combined with FPN (Feature pyramid structure) as the basic framework, and uses RPN (Regional proposal network) to generate candidate regions. Features are extracted from each candidate region through RoI Align layer, and then connected to FC (Fully connected layer) to achieve target classification and bbox (bounding box) regression. A mask branch is also added to predict the segmentation mask. The design can integrate multi-scale and multi-level semantic information to improve the recognition rate when extracting image features. In addition, the network structure is optimized by single channel for infrared images to reduce the size of the model and make it more lightweight. Ablation experiments were performed on two GTX 2080Ti graphics cards to verify the effectiveness of the proposed structure, and the mAP (mean average accuracy) of 0.421 was achieved with an IoU (Intersection over Union) threshold of 0.5.","PeriodicalId":137914,"journal":{"name":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Typical wire clamps segmentation of transmission lines based on infrared image\",\"authors\":\"Ding-Kuo Huang, Jie Yang, Yunfeng Yan, Xiangwei Sun, Xiaoming Huang\",\"doi\":\"10.1117/12.2667507\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In view of the current image segmentation field, there are few studies on the segmentation of typical wire clamp components of transmission lines. Traditional image processing methods have low segmentation accuracy and require artificial design of feature extraction methods, which are usually only suitable for equipment of a certain structure with insufficient generalization. In this paper, an infrared image segmentation method based on Mask R-CNN (Mask region-based convolutional neural network) for typical guide-ground lines is proposed. Its structure takes Mask R-CNN model combined with FPN (Feature pyramid structure) as the basic framework, and uses RPN (Regional proposal network) to generate candidate regions. Features are extracted from each candidate region through RoI Align layer, and then connected to FC (Fully connected layer) to achieve target classification and bbox (bounding box) regression. A mask branch is also added to predict the segmentation mask. The design can integrate multi-scale and multi-level semantic information to improve the recognition rate when extracting image features. In addition, the network structure is optimized by single channel for infrared images to reduce the size of the model and make it more lightweight. Ablation experiments were performed on two GTX 2080Ti graphics cards to verify the effectiveness of the proposed structure, and the mAP (mean average accuracy) of 0.421 was achieved with an IoU (Intersection over Union) threshold of 0.5.\",\"PeriodicalId\":137914,\"journal\":{\"name\":\"International Conference on Artificial Intelligence, Virtual Reality, and Visualization\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Artificial Intelligence, Virtual Reality, and Visualization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2667507\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2667507","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

针对目前的图像分割领域,对传输线典型线钳部件的分割研究较少。传统的图像处理方法分割精度低,需要人工设计特征提取方法,通常只适用于某种结构的设备,泛化程度不够。本文提出了一种基于Mask - cnn (Mask region-based convolutional neural network)的典型导地线红外图像分割方法。其结构以Mask R-CNN模型结合FPN (Feature pyramid structure)作为基本框架,使用RPN (Regional proposal network)生成候选区域。通过RoI Align层提取每个候选区域的特征,然后与FC (Fully connected layer)连接,实现目标分类和bbox (bounding box)回归。还添加了一个掩码分支来预测分割掩码。该设计能够融合多尺度、多层次的语义信息,提高图像特征提取的识别率。此外,针对红外图像采用单通道优化网络结构,减小模型尺寸,使其更轻量化。在两张GTX 2080Ti显卡上进行了消融实验,验证了所提出结构的有效性,在IoU (Intersection over Union)阈值为0.5的情况下,mAP(平均精度)达到0.421。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Typical wire clamps segmentation of transmission lines based on infrared image
In view of the current image segmentation field, there are few studies on the segmentation of typical wire clamp components of transmission lines. Traditional image processing methods have low segmentation accuracy and require artificial design of feature extraction methods, which are usually only suitable for equipment of a certain structure with insufficient generalization. In this paper, an infrared image segmentation method based on Mask R-CNN (Mask region-based convolutional neural network) for typical guide-ground lines is proposed. Its structure takes Mask R-CNN model combined with FPN (Feature pyramid structure) as the basic framework, and uses RPN (Regional proposal network) to generate candidate regions. Features are extracted from each candidate region through RoI Align layer, and then connected to FC (Fully connected layer) to achieve target classification and bbox (bounding box) regression. A mask branch is also added to predict the segmentation mask. The design can integrate multi-scale and multi-level semantic information to improve the recognition rate when extracting image features. In addition, the network structure is optimized by single channel for infrared images to reduce the size of the model and make it more lightweight. Ablation experiments were performed on two GTX 2080Ti graphics cards to verify the effectiveness of the proposed structure, and the mAP (mean average accuracy) of 0.421 was achieved with an IoU (Intersection over Union) threshold of 0.5.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信