AMICO: Amodal Instance Composition

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference Pub Date : 2022-10-11 DOI:10.48550/arXiv.2210.05828

Peiye Zhuang, D. Demandolx, Ayush Saraf, Xuejian Rong, Changil Kim, Jia-Bin Huang

引用次数: 0

Abstract

Image composition aims to blend multiple objects to form a harmonized image. Existing approaches often assume precisely segmented and intact objects. Such assumptions, however, are hard to satisfy in unconstrained scenarios. We present Amodal Instance Composition for compositing imperfect -- potentially incomplete and/or coarsely segmented -- objects onto a target image. We first develop object shape prediction and content completion modules to synthesize the amodal contents. We then propose a neural composition model to blend the objects seamlessly. Our primary technical novelty lies in using separate foreground/background representations and blending mask prediction to alleviate segmentation errors. Our results show state-of-the-art performance on public COCOA and KINS benchmarks and attain favorable visual results across diverse scenes. We demonstrate various image composition applications such as object insertion and de-occlusion.

查看原文本刊更多论文

AMICO:模态实例合成

图像合成的目的是将多个物体混合在一起，形成一个和谐的图像。现有的方法通常假设精确分割和完整的对象。然而，在不受约束的情况下，这些假设很难得到满足。我们提出了模态实例合成(Amodal Instance Composition)，用于将不完美(潜在的不完整和/或粗分割)的对象合成到目标图像上。我们首先开发了物体形状预测和内容补全模块来合成模态内容。然后，我们提出了一个神经合成模型来无缝地混合物体。我们的主要技术新颖之处在于使用单独的前景/背景表示和混合掩码预测来减轻分割错误。我们的研究结果在公共COCOA和KINS基准测试中显示了最先进的性能，并在不同的场景中获得了良好的视觉效果。我们演示了各种图像合成应用，如对象插入和去遮挡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference

自引率

0.00%

发文量