Alexander Kolesnikov
@kolesnikov.ch
📤 794
📥 71
📝 20
reposted by
Alexander Kolesnikov
Andreas Steiner
10 months ago
Looking for a small or medium sized VLM? PaliGemma 2 spans more than 150x of compute! Not sure yet if you want to invest the time 🪄finetuning🪄 on your data? Give it a try with our ready-to-use "mix" checkpoints: 🤗
huggingface.co/blog/paligem...
🎤
developers.googleblog.com/en/introduci...
0
19
7
With some delay, JetFormer's *prequel* paper is finally out on arXiv: a radically simple ViT-based normalizing flow (NF) model that achieves SOTA results in its class. Jet is one of the key components of JetFormer, deserving a standalone report. Let's unpack: 🧵⬇️
12 months ago
2
42
8
Paligemma2 is out! Bigger models, better results. For the best experience, do not forget to finetune. Congrats Paligemma2 team!
add a skeleton here at some point
about 1 year ago
0
13
1
Ok, it is yesterdays news already, but good night sleep is important. After 7 amazing years at Google Brain/DM, I am joining OpenAI. Together with
@xzhai.bsky.social
and
@giffmana.ai
, we will establish OpenAI Zurich office. Proud of our past work and looking forward to the future.
about 1 year ago
9
117
17
reposted by
Alexander Kolesnikov
Sander Dieleman
about 1 year ago
In
arxiv.org/abs/2303.00848
,
@dpkingma.bsky.social
and
@ruiqigao.bsky.social
had suggested that noise augmentation could be used to make other likelihood-based models optimise perceptually weighted losses, like diffusion models do. So cool to see this working well in practice!
add a skeleton here at some point
0
52
11
The answer has just dropped:
bsky.app/profile/kole...
add a skeleton here at some point
about 1 year ago
2
15
2
I always dreamed of a model that simultaneously 1. optimizes NLL of raw pixel data, 2. generates competitive high-res. natural images, 3. is practical. But it seemed too good to be true. Until today! Our new JetFormer model (
arxiv.org/abs/2411.19722
) ticks on all of these. 🧵
add a skeleton here at some point
about 1 year ago
2
37
5
you reached the end!!
feeds!
log in