 
 Johannes Schusterbauer 14 days ago
 ๐ค What if you could generate an entire image using just one continuous token?
๐ก It works if we leverage a self-supervised representation!
Meet RepTok๐ฆ: A generative model that encodes an image into a single continuous latent while keeping realism and semantics. ๐งต ๐