{"title":"Rethinking interactive image matting as incremental Gaussian process regression problems","authors":"Bingjie Guo, Wenhui Huang","doi":"10.1016/j.knosys.2025.114410","DOIUrl":null,"url":null,"abstract":"<div><div>Interactive Image Matting (IIM) aims to predict alpha mattes through user interaction. Traditional methods often depend on user experience to interact at the regions where the alpha matte are inaccurate. However, regions with inaccurate model predictions do not necessarily correspond to areas of high model uncertainty, so these methods are unable to effectively reduce model uncertainty, resulting in low interaction efficiency. To address this issue, we observe a commonality between IIM tasks and Gaussian Process (GP) regression: the former predicts alpha values of unlabeled pixels based on user-labeled information, while the latter predicts observations of unknown data based on known data and provides uncertainty estimation for predictions. Based on this observation, we model IIM as an incremental GP regression problem and propose a novel IIM paradigm, IIM-GP. First, IIM-GP is the first model to incrementally utilize model-predicted uncertainty to guide user interaction and update matting results, significantly enhancing interaction efficiency and prediction reliability. Second, an incremental update strategy is implemented within the GP framework, overcoming traditional GP models’ inefficiency in updating results for IIM tasks. Additionally, IIM-GP employs a strategy of selecting <span><math><mi>p</mi></math></span> inducing points from <span><math><mi>n</mi></math></span> labeled pixels to perform variational inference on GP, reducing computational complexity from <span><math><mrow><mi>O</mi><mo>(</mo><msup><mi>n</mi><mn>3</mn></msup><mo>)</mo></mrow></math></span> to <span><math><mrow><mi>O</mi><mo>(</mo><mi>n</mi><msup><mi>p</mi><mn>2</mn></msup><mo>)</mo></mrow></math></span> (<span><math><mrow><mi>p</mi><mo>≪</mo><mi>n</mi></mrow></math></span>). Comprehensive experiments on five widely-used datasets (Composition-1k, AIM-500, Distinctions-646, HIM2K and AM-2K) demonstrate that IIM-GP achieves competitive performance.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"329 ","pages":"Article 114410"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125014492","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Interactive Image Matting (IIM) aims to predict alpha mattes through user interaction. Traditional methods often depend on user experience to interact at the regions where the alpha matte are inaccurate. However, regions with inaccurate model predictions do not necessarily correspond to areas of high model uncertainty, so these methods are unable to effectively reduce model uncertainty, resulting in low interaction efficiency. To address this issue, we observe a commonality between IIM tasks and Gaussian Process (GP) regression: the former predicts alpha values of unlabeled pixels based on user-labeled information, while the latter predicts observations of unknown data based on known data and provides uncertainty estimation for predictions. Based on this observation, we model IIM as an incremental GP regression problem and propose a novel IIM paradigm, IIM-GP. First, IIM-GP is the first model to incrementally utilize model-predicted uncertainty to guide user interaction and update matting results, significantly enhancing interaction efficiency and prediction reliability. Second, an incremental update strategy is implemented within the GP framework, overcoming traditional GP models’ inefficiency in updating results for IIM tasks. Additionally, IIM-GP employs a strategy of selecting inducing points from labeled pixels to perform variational inference on GP, reducing computational complexity from to (). Comprehensive experiments on five widely-used datasets (Composition-1k, AIM-500, Distinctions-646, HIM2K and AM-2K) demonstrate that IIM-GP achieves competitive performance.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.