We introduce the spatial and temporal attention mechanism behind DUSt3R, enabling robust dynamic object segmentation.
This segmentation efficiently disentangles object and camera motion, enabling DUSt3R to robustly reconstruct 4D.
4D Reconstruction
Static Scene
Dynamic Object
Results are downsampled for efficient online rendering