{"title":"O2Flow:对象感知光流估计","authors":"Ziyi Liu, Jiawei Wang, Haixu Bi, Hongmin Liu","doi":"10.1016/j.patrec.2025.06.023","DOIUrl":null,"url":null,"abstract":"<div><div>Optical flow estimation is a fundamental task in computer vision that aims to obtain dense pixel motion of adjacent frames. It is widely considered a low-level vision task. Existing methods mainly focus on capturing local or global matching clues for pixel motion modeling, while neglecting object-level information. However, human perception of motion is closely linked to high-level object understanding. To introduce the object awareness into the optical flow estimation pipeline, we propose an Object-aware Optical Flow Estimation framework (O<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>Flow) comprising two branches: the Object Awareness (OA) Branch and the Flow Prediction (FP) Branch. Specifically, the FP-branch serves the basic optical flow prediction function, while the OA-branch is designed to capture the object-level information, guided by an auxiliary moving object prediction task. Extensive experimental results demonstrate that our method significantly enhances optical flow estimation performance in these challenging regions. Compared with the other two-view methods, O<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>Flow achieves state-of-the-art results on the Sintel and KITTI-2015 benchmarks.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"197 ","pages":"Pages 58-64"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"O2Flow: Object-aware optical flow estimation\",\"authors\":\"Ziyi Liu, Jiawei Wang, Haixu Bi, Hongmin Liu\",\"doi\":\"10.1016/j.patrec.2025.06.023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Optical flow estimation is a fundamental task in computer vision that aims to obtain dense pixel motion of adjacent frames. It is widely considered a low-level vision task. Existing methods mainly focus on capturing local or global matching clues for pixel motion modeling, while neglecting object-level information. However, human perception of motion is closely linked to high-level object understanding. To introduce the object awareness into the optical flow estimation pipeline, we propose an Object-aware Optical Flow Estimation framework (O<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>Flow) comprising two branches: the Object Awareness (OA) Branch and the Flow Prediction (FP) Branch. Specifically, the FP-branch serves the basic optical flow prediction function, while the OA-branch is designed to capture the object-level information, guided by an auxiliary moving object prediction task. Extensive experimental results demonstrate that our method significantly enhances optical flow estimation performance in these challenging regions. Compared with the other two-view methods, O<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>Flow achieves state-of-the-art results on the Sintel and KITTI-2015 benchmarks.</div></div>\",\"PeriodicalId\":54638,\"journal\":{\"name\":\"Pattern Recognition Letters\",\"volume\":\"197 \",\"pages\":\"Pages 58-64\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S016786552500251X\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016786552500251X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Optical flow estimation is a fundamental task in computer vision that aims to obtain dense pixel motion of adjacent frames. It is widely considered a low-level vision task. Existing methods mainly focus on capturing local or global matching clues for pixel motion modeling, while neglecting object-level information. However, human perception of motion is closely linked to high-level object understanding. To introduce the object awareness into the optical flow estimation pipeline, we propose an Object-aware Optical Flow Estimation framework (OFlow) comprising two branches: the Object Awareness (OA) Branch and the Flow Prediction (FP) Branch. Specifically, the FP-branch serves the basic optical flow prediction function, while the OA-branch is designed to capture the object-level information, guided by an auxiliary moving object prediction task. Extensive experimental results demonstrate that our method significantly enhances optical flow estimation performance in these challenging regions. Compared with the other two-view methods, OFlow achieves state-of-the-art results on the Sintel and KITTI-2015 benchmarks.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.