基于自监督变压器的高速铁路轨道几何少弹离群点分类

IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yu Liu, Jinzhao Liu, Wenxuan Zhang, Sen Yang, Kai Tao, Fei Yang
{"title":"基于自监督变压器的高速铁路轨道几何少弹离群点分类","authors":"Yu Liu,&nbsp;Jinzhao Liu,&nbsp;Wenxuan Zhang,&nbsp;Sen Yang,&nbsp;Kai Tao,&nbsp;Fei Yang","doi":"10.1016/j.asoc.2025.113281","DOIUrl":null,"url":null,"abstract":"<div><div>External disturbances, data transmission, sensor signal offsets and weather conditions are common sources of outliers in track geometry inspection data. The infrequent occurrence of these outliers leads to a scarcity of labeled samples, making high-accuracy few-shot classification challenging with traditional supervised learning methods. To address this, we propose a method for the few-shot classification of outliers in high-speed railway track geometry inspection data based on a self-supervised transformer. First, the self-supervised transformer pre-trains on a substantial amount of unlabeled data, enabling the model to learn and extract fundamental features and patterns from the inspection data. Next, the limited labeled outliers are used to fine-tune the model, enhancing its adaptability to the task of classifying outliers. This approach allows for the automatic identification and classification of outliers even with limited labeled data and without prior knowledge. Experimental results demonstrate that the proposed method accurately identifies and classifies outliers such as local burr, turnout gauge widening, constant section of unilateral gauge data, and abnormal distribution in track geometry inspection data, achieving a classification accuracy of up to 97.8 % and an F1-score as high as 97.9 %. This performance surpasses five supervised baselines by 4–26 % in accuracy and by 5–35 % in F1-score. Moreover, the method maintains an accuracy rate exceeding 92 % across different inspection trains and lines, demonstrating excellent generalization performance.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113281"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Few-shot outliers classification in high-speed railway track geometry based on self-supervised transformer\",\"authors\":\"Yu Liu,&nbsp;Jinzhao Liu,&nbsp;Wenxuan Zhang,&nbsp;Sen Yang,&nbsp;Kai Tao,&nbsp;Fei Yang\",\"doi\":\"10.1016/j.asoc.2025.113281\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>External disturbances, data transmission, sensor signal offsets and weather conditions are common sources of outliers in track geometry inspection data. The infrequent occurrence of these outliers leads to a scarcity of labeled samples, making high-accuracy few-shot classification challenging with traditional supervised learning methods. To address this, we propose a method for the few-shot classification of outliers in high-speed railway track geometry inspection data based on a self-supervised transformer. First, the self-supervised transformer pre-trains on a substantial amount of unlabeled data, enabling the model to learn and extract fundamental features and patterns from the inspection data. Next, the limited labeled outliers are used to fine-tune the model, enhancing its adaptability to the task of classifying outliers. This approach allows for the automatic identification and classification of outliers even with limited labeled data and without prior knowledge. Experimental results demonstrate that the proposed method accurately identifies and classifies outliers such as local burr, turnout gauge widening, constant section of unilateral gauge data, and abnormal distribution in track geometry inspection data, achieving a classification accuracy of up to 97.8 % and an F1-score as high as 97.9 %. This performance surpasses five supervised baselines by 4–26 % in accuracy and by 5–35 % in F1-score. Moreover, the method maintains an accuracy rate exceeding 92 % across different inspection trains and lines, demonstrating excellent generalization performance.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"178 \",\"pages\":\"Article 113281\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625005927\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625005927","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

外部干扰、数据传输、传感器信号偏移和天气条件是轨道几何检测数据异常值的常见来源。这些异常值的罕见出现导致标记样本的稀缺性,使得传统的监督学习方法对高精度的少量分类提出了挑战。针对这一问题,提出了一种基于自监督变压器的高速铁路轨道几何检测数据中异常点的少镜头分类方法。首先,自监督变压器在大量未标记数据上进行预训练,使模型能够从检测数据中学习和提取基本特征和模式。其次,利用有限的标记异常值对模型进行微调,增强其对异常值分类任务的适应性。这种方法允许对异常值进行自动识别和分类,即使有有限的标记数据和没有先验知识。实验结果表明,该方法能够准确识别和分类轨道几何检测数据中的局部毛刺、道岔轨加宽、单侧轨数据等异常分布等异常值,分类准确率高达97.8% %,f1得分高达97.9% %。这一表现在准确率上超过了5条监督基线4-26 %,在f1得分上超过了5-35 %。此外,该方法在不同的检测列车和线路上保持了超过92 %的准确率,显示出良好的泛化性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Few-shot outliers classification in high-speed railway track geometry based on self-supervised transformer
External disturbances, data transmission, sensor signal offsets and weather conditions are common sources of outliers in track geometry inspection data. The infrequent occurrence of these outliers leads to a scarcity of labeled samples, making high-accuracy few-shot classification challenging with traditional supervised learning methods. To address this, we propose a method for the few-shot classification of outliers in high-speed railway track geometry inspection data based on a self-supervised transformer. First, the self-supervised transformer pre-trains on a substantial amount of unlabeled data, enabling the model to learn and extract fundamental features and patterns from the inspection data. Next, the limited labeled outliers are used to fine-tune the model, enhancing its adaptability to the task of classifying outliers. This approach allows for the automatic identification and classification of outliers even with limited labeled data and without prior knowledge. Experimental results demonstrate that the proposed method accurately identifies and classifies outliers such as local burr, turnout gauge widening, constant section of unilateral gauge data, and abnormal distribution in track geometry inspection data, achieving a classification accuracy of up to 97.8 % and an F1-score as high as 97.9 %. This performance surpasses five supervised baselines by 4–26 % in accuracy and by 5–35 % in F1-score. Moreover, the method maintains an accuracy rate exceeding 92 % across different inspection trains and lines, demonstrating excellent generalization performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Soft Computing
Applied Soft Computing 工程技术-计算机:跨学科应用
CiteScore
15.80
自引率
6.90%
发文量
874
审稿时长
10.9 months
期刊介绍: Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信