A Decoder-Free Reconstruction Method for Semi-Supervised Rail Surface Defect Detection

Chen Liu;Zhenyu Shi;Shibo He;Shunpu Tang;Qianqian Yang
{"title":"A Decoder-Free Reconstruction Method for Semi-Supervised Rail Surface Defect Detection","authors":"Chen Liu;Zhenyu Shi;Shibo He;Shunpu Tang;Qianqian Yang","doi":"10.1109/TICPS.2024.3456758","DOIUrl":null,"url":null,"abstract":"Detecting defects on railway tracks is critical for the operation of high-speed trains. Despite a plethora of machine vision-based methods designed to tackle this problem, the majority adopt a supervised setting and demand considerable labeled training data, inclusive of defective samples, which is expensive and impractical. In this paper, we propose an <underline>I</u>nvertible <underline>R</u>econstruction neural <underline>N</u>etwork (IRNet) for semi-supervised rail surface defect detection, where only normal images are accessible during training. Firstly, we devise an information-preserving feature encoder comprising several invertible blocks. This structure safeguards subtle visual patterns distinguishing normal and defective images from being obscured by background information, guaranteed by its mathematical reversibility property. Second, to overcome the overgeneralization issue of conventional autoencoders caused by imperfectly crafted decoders, we propose a novel decoder-free reconstruction workflow based on the invertible feature encoder. Specifically, we force one portion of extracted features to approach a predefined constant tensor during the training stage by minimizing their mean squared error. Next, we feed the remained features and the predefined constant tensor backward into the encoder to reconstruct the original images. During the testing phase, we formulate an anomaly score that consolidates the reconstruction error and mean squared error to spot defects. Extensive experiments are conducted on 4 real-world datasets. Our method consistently outperforms state-of-the-art techniques, demonstrating an average increase of 8.5% on the F1 score.","PeriodicalId":100640,"journal":{"name":"IEEE Transactions on Industrial Cyber-Physical Systems","volume":"3 ","pages":"285-295"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Cyber-Physical Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10694737/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Detecting defects on railway tracks is critical for the operation of high-speed trains. Despite a plethora of machine vision-based methods designed to tackle this problem, the majority adopt a supervised setting and demand considerable labeled training data, inclusive of defective samples, which is expensive and impractical. In this paper, we propose an Invertible Reconstruction neural Network (IRNet) for semi-supervised rail surface defect detection, where only normal images are accessible during training. Firstly, we devise an information-preserving feature encoder comprising several invertible blocks. This structure safeguards subtle visual patterns distinguishing normal and defective images from being obscured by background information, guaranteed by its mathematical reversibility property. Second, to overcome the overgeneralization issue of conventional autoencoders caused by imperfectly crafted decoders, we propose a novel decoder-free reconstruction workflow based on the invertible feature encoder. Specifically, we force one portion of extracted features to approach a predefined constant tensor during the training stage by minimizing their mean squared error. Next, we feed the remained features and the predefined constant tensor backward into the encoder to reconstruct the original images. During the testing phase, we formulate an anomaly score that consolidates the reconstruction error and mean squared error to spot defects. Extensive experiments are conducted on 4 real-world datasets. Our method consistently outperforms state-of-the-art techniques, demonstrating an average increase of 8.5% on the F1 score.
半监督轨道表面缺陷检测的无解码器重构方法
轨道缺陷检测对高速列车的运行至关重要。尽管有大量基于机器视觉的方法旨在解决这个问题,但大多数方法采用监督设置,需要大量标记的训练数据,包括有缺陷的样本,这既昂贵又不切实际。在本文中,我们提出了一种可逆重构神经网络(IRNet)用于半监督轨道表面缺陷检测,其中在训练过程中只有正常图像可访问。首先,我们设计了一个包含多个可逆块的信息保持特征编码器。这种结构通过其数学可逆性保证了区分正常和有缺陷图像的细微视觉模式不被背景信息所掩盖。其次,为了克服传统自编码器因解码器制作不完美而导致的过度泛化问题,我们提出了一种基于可逆特征编码器的新型无解码器重构工作流程。具体来说,我们通过最小化均方误差,在训练阶段强制提取的一部分特征接近预定义的常数张量。接下来,我们将剩余的特征和预定义的常数张量反向馈送到编码器中以重建原始图像。在测试阶段,我们制定了一个异常分数,它结合了重建误差和均方误差来发现缺陷。在4个真实数据集上进行了广泛的实验。我们的方法始终优于最先进的技术,证明F1得分平均提高8.5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信