|国家预印本平台
首页|Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning

Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning

Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning

来源:Arxiv_logoArxiv
英文摘要

Occlusion Boundary Estimation (OBE) identifies boundaries arising from both inter-object occlusions and self-occlusion within individual objects, distinguishing intrinsic object edges from occlusion-induced contours to improve scene understanding and 3D reconstruction capacity. This is closely related to Monocular Depth Estimation (MDE), which infers depth from a single image, as occlusion boundaries provide critical geometric cues for resolving depth ambiguities, while depth priors can conversely refine occlusion reasoning in complex scenes. In this paper, we propose a novel network, MoDOT, that first jointly estimates depth and OBs. We propose CASM, a cross-attention multi-scale strip convolution module, leverages mid-level OB features to significantly enhance depth prediction. Additionally, we introduce an occlusion-aware loss function, OBDCL, which encourages sharper and more accurate depth boundaries. Extensive experiments on both real and synthetic datasets demonstrate the mutual benefits of jointly estimating depth and OB, and highlight the effectiveness of our model design. Our method achieves the state-of-the-art (SOTA) on both our proposed synthetic datasets and one popular real dataset, NYUD-v2, significantly outperforming multi-task baselines. Besides, without domain adaptation, results on real-world depth transfer are comparable to the competitors, while preserving sharp occlusion boundaries for geometric fidelity. We will release our code, pre-trained models, and datasets to support future research in this direction.

Lintao Xu、Yinghao Wang、Chaohui Wang

计算技术、计算机技术

Lintao Xu,Yinghao Wang,Chaohui Wang.Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning[EB/OL].(2025-05-27)[2025-06-14].https://arxiv.org/abs/2505.21231.点此复制

评论