DiCeption: diffusion generalist for multiple dense tasks.
Trains a single DiT to do monodepth, normals, segmentation, point-prompted segmentation.
Strong zero-shot performance, even beating specialists like DepthAnythingv2, albeit with 28 diffusion steps at inference
over 1 year ago