Johannes Schusterbauer
@joh-schb.bsky.social
📤 309
📥 327
📝 19
PhD Student @ CompVis group, LMU Munich Working on diffusion & flow models🫶
pinned post!
🤔 What if you could generate an entire image using just one continuous token? 💡 It works if we leverage a self-supervised representation! Meet RepTok🦎: A generative model that encodes an image into a single continuous latent while keeping realism and semantics. 🧵 👇
14 days ago
1
9
5
reposted by
Johannes Schusterbauer
Pingchuan Ma
13 days ago
I’m thrilled to share that I’ll present two first-authored papers at
#ICCV2025
🌺 in Honolulu together with
@mgui7.bsky.social
! 🏝️ (Thread 🧵👇)
1
5
4
🤔 What if you could generate an entire image using just one continuous token? 💡 It works if we leverage a self-supervised representation! Meet RepTok🦎: A generative model that encodes an image into a single continuous latent while keeping realism and semantics. 🧵 👇
14 days ago
1
9
5
reposted by
Johannes Schusterbauer
Stefan Baumann
16 days ago
🤔 What happens when you poke a scene — and your model has to predict how the world moves in response? We built the Flow Poke Transformer (FPT) to model multi-modal scene dynamics from sparse interactions. It learns to predict the 𝘥𝘪𝘴𝘵𝘳𝘪𝘣𝘶𝘵𝘪𝘰𝘯 of motion itself 🧵👇
1
23
9
Looking forward to attending
#CVPR2025
in Nashville next week 🎸🎶
@mgui7.bsky.social
and I will be presenting our latest work: 🌊 Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment
5 months ago
1
3
1
Sunrise in the office after the
#ICCV
deadline night with
@mgui7.bsky.social
🚀
8 months ago
1
13
2
reposted by
Johannes Schusterbauer
CompVis - Computer Vision and Learning LMU Munich
9 months ago
www.youtube.com/watch?v=bCy6...
loading . . .
Building a New Foundation Model (Björn Ommer) | DLD25
YouTube video by DLD Conference
https://www.youtube.com/watch?v=bCy6TDktTUw
0
10
2
reposted by
Johannes Schusterbauer
Jan-Hendrik Müller
10 months ago
Over 60 German universities and research institutions announced their departure from X today.
add a skeleton here at some point
0
77
15
reposted by
Johannes Schusterbauer
Pingchuan Ma
10 months ago
🤔When combining Vision-language models (VLMs) with Large language models (LLMs), do VLMs benefit from additional genuine semantics or artificial augmentations of the text for downstream tasks? 🤨Interested? Check out our latest work at
#AAAI25
: 💻Code and 📝Paper at:
github.com/CompVis/DisCLIP
🧵👇
1
15
8
Congrats to
@frankfundel.bsky.social
for publishing this work at WACV🔥 Has been a pleasure to jointly work on this topic with such a talented master student🤗 Looking forward to seeing what comes next!🚀
add a skeleton here at some point
11 months ago
0
4
0
Awesome work from some colleagues cleaning up diffusion features!🚀
add a skeleton here at some point
11 months ago
0
6
0
reposted by
Johannes Schusterbauer
Sander Dieleman
11 months ago
IMO VQGAN is why GANs deserve the NeurIPS test of time award. Suddenly our image representations were an order of magnitude more compact. Absolute game changer for generative modelling at scale, and the basis for latent diffusion models.
loading . . .
Taming Transformers for High-Resolution Image Synthesis
Designed to learn long-range interactions on sequential data, transformers continue to show state-of-the-art results on a wide variety of tasks. In contrast to CNNs, they contain no inductive bias tha...
https://arxiv.org/abs/2012.09841
2
104
17
reposted by
Johannes Schusterbauer
Anton Obukhov
11 months ago
Check out my GenAI starter pack!
go.bsky.app/BT1bRvZ
add a skeleton here at some point
0
10
3
reposted by
Johannes Schusterbauer
Stefan Baumann
11 months ago
After many years, our lab finally has a social media presence at
@compvis.bsky.social
! 🥳 Give it a follow, we have some amazing research on generative computer vision coming soon!
0
19
2
reposted by
Johannes Schusterbauer
Nick Stracke
11 months ago
me right now..
4
48
3
reposted by
Johannes Schusterbauer
Sander Dieleman
12 months ago
In a gratuitous attempt to acquire more followers myself 😁, I've made a start on a "starter pack". Hopefully as more people from 🐦 make it over to 🦋, we can extend this a bit. Suggestions welcome! I've noticed not all accounts seem to be eligible to be added, anyone know what's up with that? 🤔
add a skeleton here at some point
34
125
47
you reached the end!!
feeds!
log in