Johannes Schusterbauer 2 months ago
๐ค What if you could generate an entire image using just one continuous token?
๐ก It works if we leverage a self-supervised representation!
Meet RepTok๐ฆ: A generative model that encodes an image into a single continuous latent while keeping realism and semantics. ๐งต ๐