Xinyu Fan, Xuxu Yang, Feifei Hou, Cuipu Xi, Yijun Wang
{"title":"基于所提出的PLCA-pix2pixGAN方法的跟踪外目标图像增强","authors":"Xinyu Fan, Xuxu Yang, Feifei Hou, Cuipu Xi, Yijun Wang","doi":"10.1016/j.dibe.2025.100754","DOIUrl":null,"url":null,"abstract":"<div><div>The presence of foreign objects on railway tracks poses serious safety risks and may lead to accidents or service disruptions. However, existing detection systems based on deep learning are often constrained by small datasets, limited sample diversity, and low realism in synthesized training images. To address these issues, this paper proposes PLCA-pix2pixGAN (Perceptual Loss and Channel Attention Enhanced pix2pix GAN) to generate high-quality synthetic images for data augmentation. The method overlays object templates onto real-world track images to build a composite dataset and applies interpretable augmentation to simulate lighting and weather changes. To enhance fidelity, a channel attention mechanism enables region-aware reconstruction, and a multi-objective loss combines perceptual loss with adaptive weighting to balance pixel-level accuracy and semantic consistency. Experiments show the proposed method achieves an average SSIM of 0.9106 across object categories, demonstrating its effectiveness in generating realistic, structurally consistent images for safety-critical foreign object detection in railway systems.</div></div>","PeriodicalId":34137,"journal":{"name":"Developments in the Built Environment","volume":"24 ","pages":"Article 100754"},"PeriodicalIF":8.2000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Track foreign object image augmentation based on the proposed PLCA-pix2pixGAN method\",\"authors\":\"Xinyu Fan, Xuxu Yang, Feifei Hou, Cuipu Xi, Yijun Wang\",\"doi\":\"10.1016/j.dibe.2025.100754\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The presence of foreign objects on railway tracks poses serious safety risks and may lead to accidents or service disruptions. However, existing detection systems based on deep learning are often constrained by small datasets, limited sample diversity, and low realism in synthesized training images. To address these issues, this paper proposes PLCA-pix2pixGAN (Perceptual Loss and Channel Attention Enhanced pix2pix GAN) to generate high-quality synthetic images for data augmentation. The method overlays object templates onto real-world track images to build a composite dataset and applies interpretable augmentation to simulate lighting and weather changes. To enhance fidelity, a channel attention mechanism enables region-aware reconstruction, and a multi-objective loss combines perceptual loss with adaptive weighting to balance pixel-level accuracy and semantic consistency. Experiments show the proposed method achieves an average SSIM of 0.9106 across object categories, demonstrating its effectiveness in generating realistic, structurally consistent images for safety-critical foreign object detection in railway systems.</div></div>\",\"PeriodicalId\":34137,\"journal\":{\"name\":\"Developments in the Built Environment\",\"volume\":\"24 \",\"pages\":\"Article 100754\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2025-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Developments in the Built Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666165925001541\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Developments in the Built Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666165925001541","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Track foreign object image augmentation based on the proposed PLCA-pix2pixGAN method
The presence of foreign objects on railway tracks poses serious safety risks and may lead to accidents or service disruptions. However, existing detection systems based on deep learning are often constrained by small datasets, limited sample diversity, and low realism in synthesized training images. To address these issues, this paper proposes PLCA-pix2pixGAN (Perceptual Loss and Channel Attention Enhanced pix2pix GAN) to generate high-quality synthetic images for data augmentation. The method overlays object templates onto real-world track images to build a composite dataset and applies interpretable augmentation to simulate lighting and weather changes. To enhance fidelity, a channel attention mechanism enables region-aware reconstruction, and a multi-objective loss combines perceptual loss with adaptive weighting to balance pixel-level accuracy and semantic consistency. Experiments show the proposed method achieves an average SSIM of 0.9106 across object categories, demonstrating its effectiveness in generating realistic, structurally consistent images for safety-critical foreign object detection in railway systems.
期刊介绍:
Developments in the Built Environment (DIBE) is a recently established peer-reviewed gold open access journal, ensuring that all accepted articles are permanently and freely accessible. Focused on civil engineering and the built environment, DIBE publishes original papers and short communications. Encompassing topics such as construction materials and building sustainability, the journal adopts a holistic approach with the aim of benefiting the community.