Yeongjoon Kim , Sunkyu Kwon , Donggoo Kang , Hyunmin Lee , Joonki Paik
{"title":"Enhancing video frame interpolation with region of motion loss and self-attention mechanisms: A dual approach to address large, nonlinear motions","authors":"Yeongjoon Kim , Sunkyu Kwon , Donggoo Kang , Hyunmin Lee , Joonki Paik","doi":"10.1016/j.neucom.2024.128728","DOIUrl":null,"url":null,"abstract":"<div><div>Video frame interpolation is particularly challenging when dealing with large and non-linear object motions, often resulting in poor frame quality and motion artifacts. In this study, we introduce a novel dual-approach methodology for video frame interpolation that effectively addresses these complexities. Our method consists of two key components: a Region of Motion (RoM) loss and self-attention mechanisms. The RoM loss is designed to spotlight significant movements within frames. This is achieved by employing feature-matching techniques that assign tailored weights during the training process, ensuring that areas of intense motion are given priority. This is facilitated by the computation of optical flow, which identifies crucial feature points and highlights regions of significant motion for targeted enhancement. Our method incorporates self-attention mechanisms to maintain inter-frame continuity while emphasizing the unique attributes of individual frames. The self-attention scores reduce motion discrepancies and enhance the distinctiveness and texture quality of each frame. We validate the efficacy of our approach through extensive evaluations on benchmark datasets, including Vimeo-90K, Middlebury, UCF101, and SNU-Film.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"614 ","pages":"Article 128728"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224014991","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Video frame interpolation is particularly challenging when dealing with large and non-linear object motions, often resulting in poor frame quality and motion artifacts. In this study, we introduce a novel dual-approach methodology for video frame interpolation that effectively addresses these complexities. Our method consists of two key components: a Region of Motion (RoM) loss and self-attention mechanisms. The RoM loss is designed to spotlight significant movements within frames. This is achieved by employing feature-matching techniques that assign tailored weights during the training process, ensuring that areas of intense motion are given priority. This is facilitated by the computation of optical flow, which identifies crucial feature points and highlights regions of significant motion for targeted enhancement. Our method incorporates self-attention mechanisms to maintain inter-frame continuity while emphasizing the unique attributes of individual frames. The self-attention scores reduce motion discrepancies and enhance the distinctiveness and texture quality of each frame. We validate the efficacy of our approach through extensive evaluations on benchmark datasets, including Vimeo-90K, Middlebury, UCF101, and SNU-Film.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.