Boosting Monocular Metric Depth Estimation via Bokeh Rendering 文章

ArXiv CS.CV2026-05-26NEWSen作者: Hangwei Zhang, Armando Fortes, Tianyi Wei, Xingang Pan

摘要

arXiv:2512.12425v2 Announce Type: replace Abstract: Bokeh rendering and depth estimation share a fundamental optical connection, yet existing methods fail to fully exploit this reciprocity. Conventional bokeh pipelines rely heavily on noisy depth maps that inevitably introduce visual artifacts. Conversely, existing monocular depth models typically follow two flawed paradigms. Generative diffusion-based frameworks often lack consistent metric scale. Meanwhile, feed-forward metric depth models frequently fail in textureless or distant regions where defocus blur can provide geometric information. We propose BokehDepth, a two-stage framework that treats synthetic defocus as a supervision-free geometric signal. In the first stage, a physically grounded generative model produces calibrated bokeh stacks from a single sharp input without requiring prior depth input.