{"title":"面向全景立体直播系统的深度图像拼接","authors":"Yun Tie, Zihao Zhao, Dalong Zhang, Yuning Gao","doi":"10.1016/j.engappai.2025.111832","DOIUrl":null,"url":null,"abstract":"<div><div>Image stitching is a crucial task in computer vision, enabling the creation of panoramic images for augmented reality/virtual reality (AR/VR) experiences. Image registration plays an important role in achieving spatial continuity and consistency in image stitching. Therefore, achieving accurate image registration is critical to high-quality panoramic image generation. Although recent learning-based methods have improved the optimization of stitched images, they still rely on simple associations of images or feature maps to search for feature correspondences, neglecting the capture of these correspondences during feature extraction. Additionally, directly warping the original image with the homography matrix extracted from the feature space ignores the network’s influence on image transformation relationships. To address these limitations and enhance the quality of panoramic images, we propose an artificial intelligence (AI) image registration network based on a cross-attention mechanism. Our approach incorporates a Local Transformer to help the network perceive image correspondences. Furthermore, we impose constraints on network training through warp-equivariance to mitigate the impact on image transformation relationships. These strategies increase the accuracy and generalization of the AI-based registration method and form a better image stitching algorithm. Finally, we apply the AI image stitching algorithm to construct a panoramic stereoscopic live broadcast system. Experimental results show that our method achieves competitive results and satisfies the requirements of panoramic systems.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"160 ","pages":"Article 111832"},"PeriodicalIF":8.0000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep image stitching for panoramic stereoscopic live broadcast system\",\"authors\":\"Yun Tie, Zihao Zhao, Dalong Zhang, Yuning Gao\",\"doi\":\"10.1016/j.engappai.2025.111832\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Image stitching is a crucial task in computer vision, enabling the creation of panoramic images for augmented reality/virtual reality (AR/VR) experiences. Image registration plays an important role in achieving spatial continuity and consistency in image stitching. Therefore, achieving accurate image registration is critical to high-quality panoramic image generation. Although recent learning-based methods have improved the optimization of stitched images, they still rely on simple associations of images or feature maps to search for feature correspondences, neglecting the capture of these correspondences during feature extraction. Additionally, directly warping the original image with the homography matrix extracted from the feature space ignores the network’s influence on image transformation relationships. To address these limitations and enhance the quality of panoramic images, we propose an artificial intelligence (AI) image registration network based on a cross-attention mechanism. Our approach incorporates a Local Transformer to help the network perceive image correspondences. Furthermore, we impose constraints on network training through warp-equivariance to mitigate the impact on image transformation relationships. These strategies increase the accuracy and generalization of the AI-based registration method and form a better image stitching algorithm. Finally, we apply the AI image stitching algorithm to construct a panoramic stereoscopic live broadcast system. Experimental results show that our method achieves competitive results and satisfies the requirements of panoramic systems.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"160 \",\"pages\":\"Article 111832\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625018342\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625018342","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Deep image stitching for panoramic stereoscopic live broadcast system
Image stitching is a crucial task in computer vision, enabling the creation of panoramic images for augmented reality/virtual reality (AR/VR) experiences. Image registration plays an important role in achieving spatial continuity and consistency in image stitching. Therefore, achieving accurate image registration is critical to high-quality panoramic image generation. Although recent learning-based methods have improved the optimization of stitched images, they still rely on simple associations of images or feature maps to search for feature correspondences, neglecting the capture of these correspondences during feature extraction. Additionally, directly warping the original image with the homography matrix extracted from the feature space ignores the network’s influence on image transformation relationships. To address these limitations and enhance the quality of panoramic images, we propose an artificial intelligence (AI) image registration network based on a cross-attention mechanism. Our approach incorporates a Local Transformer to help the network perceive image correspondences. Furthermore, we impose constraints on network training through warp-equivariance to mitigate the impact on image transformation relationships. These strategies increase the accuracy and generalization of the AI-based registration method and form a better image stitching algorithm. Finally, we apply the AI image stitching algorithm to construct a panoramic stereoscopic live broadcast system. Experimental results show that our method achieves competitive results and satisfies the requirements of panoramic systems.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.