学习遮挡意识到的粗到细深度图，用于自我监督的单眼深度估计

论文标题

学习遮挡意识到的粗到细深度图，用于自我监督的单眼深度估计

Learning Occlusion-Aware Coarse-to-Fine Depth Map for Self-supervised Monocular Depth Estimation

论文作者

Zhou, Zhengming, Dong, Qiulei

论文摘要

最近，以自我监督的方式从单个图像中学习场景深度，最近受到了很多关注，旨在从单一图像中学习场景深度。尽管最近在这一领域做出了努力，但如何学习准确的场景深度并减轻了闭塞对自我监督深度估计的负面影响，但仍然是一个开放的问题。在解决这个问题时，我们首先凭经验分析了在许多现有作品的训练过程中广泛使用的连续和离散深度约束的影响。然后，受到上述经验分析的启发，我们提出了一个新型网络，以学习一个自我监督的单眼深度估计的咬合意识的粗到细深度图，称为OCFD-NET。在任意训练立体声图像对的情况下，所提出的OCFD-NET不仅采用离散的深度约束来学习粗级深度图，而且还采用了一个连续的深度约束来学习场景深度残差，从而产生了细小的深度图。此外，在提出的OCFD-NET下设计了一个遮挡感知的模块，该模块能够提高学习闭塞的精细级别深度图的能力。 Kitti的实验结果表明，在大多数情况下，所提出的方法在七个常用指标下的比较最新方法优于比较最先进的方法。此外，对Make3D的实验结果证明了该方法在四个常用指标下的跨数据集泛化能力方面的有效性。该代码可在https://github.com/zm-zhou/ocfd-net_pytorch上找到。

Self-supervised monocular depth estimation, aiming to learn scene depths from single images in a self-supervised manner, has received much attention recently. In spite of recent efforts in this field, how to learn accurate scene depths and alleviate the negative influence of occlusions for self-supervised depth estimation, still remains an open problem. Addressing this problem, we firstly empirically analyze the effects of both the continuous and discrete depth constraints which are widely used in the training process of many existing works. Then inspired by the above empirical analysis, we propose a novel network to learn an Occlusion-aware Coarse-to-Fine Depth map for self-supervised monocular depth estimation, called OCFD-Net. Given an arbitrary training set of stereo image pairs, the proposed OCFD-Net does not only employ a discrete depth constraint for learning a coarse-level depth map, but also employ a continuous depth constraint for learning a scene depth residual, resulting in a fine-level depth map. In addition, an occlusion-aware module is designed under the proposed OCFD-Net, which is able to improve the capability of the learnt fine-level depth map for handling occlusions. Experimental results on KITTI demonstrate that the proposed method outperforms the comparative state-of-the-art methods under seven commonly used metrics in most cases. In addition, experimental results on Make3D demonstrate the effectiveness of the proposed method in terms of the cross-dataset generalization ability under four commonly used metrics. The code is available at https://github.com/ZM-Zhou/OCFD-Net_pytorch.

下载PDF全文

下载文献需遵守相关版权规定

论文标题