{"title":"An unsupervised video stabilization algorithm based on gyroscope image fusion","authors":"Zhengwei Ren , Mingrui Zou , Lin Bi , Ming Fang","doi":"10.1016/j.cag.2024.104154","DOIUrl":null,"url":null,"abstract":"<div><div>Video stabilization aims to enhance the visual quality by reducing jitter and ghosting artifacts caused by camera shaking, yet effectively stabilizing low-quality videos and from complex scenarios remains a significant challenge. While gyroscope-based approaches can address this issue, they struggle with depth variations and translational shaking. In this paper, we propose a coarse-to-fine, unsupervised deep learning video stabilization solution that integrates image and gyroscope data to address these challenges. Our approach excels in stabilizing videos under diverse conditions, managing depth changes, and handling both translational and rotational motion. We utilize gyroscope data to estimate the 3D camera rotation and apply LSTM to predict stable poses. Grid-based motion parameters address depth-related motion, generating a multi-grid warping field that mitigates the significant image jitter caused by camera rotation. Subsequently, we achieve the elimination of residual motion at the pixel level. PDCNet is used to generated confidence maps filter optical flow to minimize disturbances from prominent local areas, while a U-Net architecture smooths the optical flow, performing pixel-level warping to generating finely stabilized frames. Comparative analysis shows that our approach surpasses state-of-the-art methods, particularly in handling complex scenes and achieving stability in challenging conditions.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"126 ","pages":"Article 104154"},"PeriodicalIF":2.5000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Graphics-Uk","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0097849324002899","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Video stabilization aims to enhance the visual quality by reducing jitter and ghosting artifacts caused by camera shaking, yet effectively stabilizing low-quality videos and from complex scenarios remains a significant challenge. While gyroscope-based approaches can address this issue, they struggle with depth variations and translational shaking. In this paper, we propose a coarse-to-fine, unsupervised deep learning video stabilization solution that integrates image and gyroscope data to address these challenges. Our approach excels in stabilizing videos under diverse conditions, managing depth changes, and handling both translational and rotational motion. We utilize gyroscope data to estimate the 3D camera rotation and apply LSTM to predict stable poses. Grid-based motion parameters address depth-related motion, generating a multi-grid warping field that mitigates the significant image jitter caused by camera rotation. Subsequently, we achieve the elimination of residual motion at the pixel level. PDCNet is used to generated confidence maps filter optical flow to minimize disturbances from prominent local areas, while a U-Net architecture smooths the optical flow, performing pixel-level warping to generating finely stabilized frames. Comparative analysis shows that our approach surpasses state-of-the-art methods, particularly in handling complex scenes and achieving stability in challenging conditions.
期刊介绍:
Computers & Graphics is dedicated to disseminate information on research and applications of computer graphics (CG) techniques. The journal encourages articles on:
1. Research and applications of interactive computer graphics. We are particularly interested in novel interaction techniques and applications of CG to problem domains.
2. State-of-the-art papers on late-breaking, cutting-edge research on CG.
3. Information on innovative uses of graphics principles and technologies.
4. Tutorial papers on both teaching CG principles and innovative uses of CG in education.