DiCeption: diffusion generalist for multiple dense tasks.
Trains a single DiT to do monodepth, normals, segmentation, point-prompted segmentation.
Strong zero-shot performance, even beating specialists like DepthAnythingv2, albeit with 28 diffusion steps at inference
12 months ago