Abstract
Multi-frame depth estimation methods typically assume a static scene, which fails when dynamic objects are present. We address this limitation by jointly reasoning about camera motion and per-pixel object motion, enabling reliable depth recovery in dynamic real-world scenes. See the project page for full details, code, and results.