{"title":"PDSAM: Prompt-Driven SAM for Track Defect Detection","authors":"Yu Fang;Pan Tao;Tianrui Li;Fan Min","doi":"10.1109/TIM.2025.3583378","DOIUrl":null,"url":null,"abstract":"The track defect detection is critical for ensuring the safety and reliability of railway systems. Existing machine vision-based approaches are hindered by three key issues: high-time complexity stemming from end-to-end network training, limited availability of training data (with only a few hundred labeled images), and suboptimal prediction precision. To address these challenges, this article introduces the prompt-driven segment anything model (PDSAM), a novel image semantic segmentation framework that introduces a paradigm shift in problem formulation. The core contribution lies in reformulating the segmentation task as a prompt generation problem, which offers two correlated advantages. First, a simplified prompt generation network reduces both training time and data requirements compared with standalone segmentation networks. Second, an upscaling and visual prompting technique restores spatial resolution and mitigates the risk of local optima in feature optimization, enabling more precise and fine-grained segmentation outputs. Experimental evaluations on benchmark datasets demonstrate that PDSAM outperforms state-of-the-art methods in both prediction accuracy and computational efficiency for railway track defect detection. The proposed framework’s source code and pretrained models (PTMs) are publicly available to facilitate reproducibility and further research, accessible at: <uri>https://github.com/FreddyDylan/PDSAM/</uri>","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-17"},"PeriodicalIF":5.9000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11052718/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The track defect detection is critical for ensuring the safety and reliability of railway systems. Existing machine vision-based approaches are hindered by three key issues: high-time complexity stemming from end-to-end network training, limited availability of training data (with only a few hundred labeled images), and suboptimal prediction precision. To address these challenges, this article introduces the prompt-driven segment anything model (PDSAM), a novel image semantic segmentation framework that introduces a paradigm shift in problem formulation. The core contribution lies in reformulating the segmentation task as a prompt generation problem, which offers two correlated advantages. First, a simplified prompt generation network reduces both training time and data requirements compared with standalone segmentation networks. Second, an upscaling and visual prompting technique restores spatial resolution and mitigates the risk of local optima in feature optimization, enabling more precise and fine-grained segmentation outputs. Experimental evaluations on benchmark datasets demonstrate that PDSAM outperforms state-of-the-art methods in both prediction accuracy and computational efficiency for railway track defect detection. The proposed framework’s source code and pretrained models (PTMs) are publicly available to facilitate reproducibility and further research, accessible at: https://github.com/FreddyDylan/PDSAM/
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.