{"title":"基于尺度变换变压器的电网场景语义分割","authors":"Wenjie Pan, Linhan Huang, Yutao Chen, Yuqing Fu, Jianqing Zhu, Yibing Zhan","doi":"10.1007/s10489-025-06883-7","DOIUrl":null,"url":null,"abstract":"<div><p>Semantic segmentation of power grids is challenging due to size variations and intricate deformations caused by different shooting distances and angles. Traditional hierarchical architectures and pyramidal methods can learn multi-scale features to address size variations but struggle with deformations due to fixed aspect ratios of features. To address this issue, we propose a scale-transforming transformer (STT) approach. Our approach’s novelty lies in a scale-transforming module (STM), which implements cost-effective aspect ratio adjustments, patch splitting, and patch combining. This process generates local patches comprising various versions of the original patch, characterized by distinct aspect ratios and scales. In particular, this approach ensures that the output and input feature maintain uniform dimensions. We also control computational loads through a channel grouping strategy, which deploys different STMs in distinct feature groups. Consequently, our STM seamlessly integrates into existing transformer models to build STT models. Experiments show that our STT models achieve state-of-the-art performance. </p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 15","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic segmentation in power grid scenarios using scale-transforming transformer\",\"authors\":\"Wenjie Pan, Linhan Huang, Yutao Chen, Yuqing Fu, Jianqing Zhu, Yibing Zhan\",\"doi\":\"10.1007/s10489-025-06883-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Semantic segmentation of power grids is challenging due to size variations and intricate deformations caused by different shooting distances and angles. Traditional hierarchical architectures and pyramidal methods can learn multi-scale features to address size variations but struggle with deformations due to fixed aspect ratios of features. To address this issue, we propose a scale-transforming transformer (STT) approach. Our approach’s novelty lies in a scale-transforming module (STM), which implements cost-effective aspect ratio adjustments, patch splitting, and patch combining. This process generates local patches comprising various versions of the original patch, characterized by distinct aspect ratios and scales. In particular, this approach ensures that the output and input feature maintain uniform dimensions. We also control computational loads through a channel grouping strategy, which deploys different STMs in distinct feature groups. Consequently, our STM seamlessly integrates into existing transformer models to build STT models. Experiments show that our STT models achieve state-of-the-art performance. </p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 15\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-025-06883-7\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06883-7","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Semantic segmentation in power grid scenarios using scale-transforming transformer
Semantic segmentation of power grids is challenging due to size variations and intricate deformations caused by different shooting distances and angles. Traditional hierarchical architectures and pyramidal methods can learn multi-scale features to address size variations but struggle with deformations due to fixed aspect ratios of features. To address this issue, we propose a scale-transforming transformer (STT) approach. Our approach’s novelty lies in a scale-transforming module (STM), which implements cost-effective aspect ratio adjustments, patch splitting, and patch combining. This process generates local patches comprising various versions of the original patch, characterized by distinct aspect ratios and scales. In particular, this approach ensures that the output and input feature maintain uniform dimensions. We also control computational loads through a channel grouping strategy, which deploys different STMs in distinct feature groups. Consequently, our STM seamlessly integrates into existing transformer models to build STT models. Experiments show that our STT models achieve state-of-the-art performance.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.