Spyros Gidaris
@spyrosgidaris.bsky.social
๐ค 149
๐ฅ 127
๐ 5
Senior Research Scientist at Valeo.ai (
@valeoai.bsky.social
)
reposted by
Spyros Gidaris
Gilles Puy
1 day ago
Update: ResearchGate has investigated the case, and, as far as I can see, all the suspicious papers (~200) have now been removed. Many thanks to the
@researchgate.bsky.social
team!
add a skeleton here at some point
1
3
2
Three papers accepted to
#NeurIPS2025
(one spotlight)! ๐ Awesome works in generative modeling, multi-token prediction, and future prediction. Congratulations to all collaborators!
@nasosger.bsky.social
,
sta8is.bsky.social
,
@nicolabourbaki.bsky.social
,
@ikakogeorgiou.bsky.social
& N. Komodakis!
3 days ago
1
10
1
reposted by
Spyros Gidaris
Gilles Puy
10 days ago
Discovered that our RangeViT paper keeps being cited in what might be LLM-generated papers. Number of citations increased rapidly in the last weeks. Too good to be true. Papers popped up on different platforms, but mainly on ResearchGate with ~80 papers in just 3 weeks. [1/]
1
5
7
reposted by
Spyros Gidaris
Andrei Bursuc
2 months ago
1/ Can open-data models beat DINOv2? Today we release Franca, a fully open-sourced vision foundation model. Franca with ViT-G backbone matches (and often beats) proprietary models like SigLIPv2, CLIP, DINOv2 on various benchmarks setting a new standard for open-source research.
2
84
24
reposted by
Spyros Gidaris
Andrei Bursuc
3 months ago
1/ New & old work on self-supervised representation learning (SSL) with ViTs: MOCA โ - Predicting Masked Online Codebook Assignments w/
@spyrosgidaris.bsky.social
O. Simeoni, A. Vobecky,
@matthieucord.bsky.social
, N. Komodakis,
@ptrkprz.bsky.social
#TMLR
#ICLR2025
Grab a โ & brace for a story & a๐งต
1
23
5
reposted by
Spyros Gidaris
Sophia Sirko-Galouchenko ๐บ๐ฆ
3 months ago
1/n ๐New paper out - accepted at
#ICCV2025
! Introducing DIP: unsupervised post-training that enhances dense features in pretrained ViTs for dense in-context scene understanding Below: Low-shot in-context semantic segmentation examples. DIP features outperform DINOv2!
1
19
8
reposted by
Spyros Gidaris
Paul Couairon
3 months ago
๐Thrilled to introduce JAFARโa lightweight, flexible, plug-and-play module that upsamples features from any Foundation Vision Encoder to any desired output resolution (1/n) Paper :
arxiv.org/abs/2506.11136
Project Page:
jafar-upsampler.github.io
Github:
github.com/PaulCouairon...
loading . . .
1
26
6
reposted by
Spyros Gidaris
Giorgos Kordopatis-Zilos
3 months ago
Are you at
@cvprconference.bsky.social
? Come by our poster! ๐ Sat 14/6, 10:30-12:30 ๐ Poster #395, ExHall D
add a skeleton here at some point
0
17
9
I am at
#CVPR2025
this week in Nashville! Presenting "Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers" on multi-modal semantic future prediction. Come discuss! Fri 13 Jun 10:30-12:30, poster #345
bsky.app/profile/sta8...
add a skeleton here at some point
3 months ago
0
6
2
reposted by
Spyros Gidaris
Thodoris Kouzelis
5 months ago
1/n Introducing ReDi (Representation Diffusion): a new generative approach that leverages a diffusion model to jointly capture โ Low-level image details (via VAE latents) โ High-level semantic features (via DINOv2)๐งต
1
21
4
reposted by
Spyros Gidaris
Andrei Bursuc
6 months ago
The
@valeoai.bsky.social
team is presenting a few exciting works
@iclr-conf.bsky.social
this year on masked generative transformers, adaptation of VLMs, self-supervised representation learning, neural solvers.
#iclr2025
Check them out ๐
add a skeleton here at some point
0
8
1
reposted by
Spyros Gidaris
lebellig
7 months ago
Nice research work from
@nicolabourbaki.bsky.social
et al. Enhances latent generative models by regularizing the VAE's latent space with an equivariance loss. The finetuning process is straightforward + demonstrates improvements in just 5 epochs! ๐
arxiv.org/abs/2502.09509
๐
github.com/zelaki/eqvae
0
9
1
reposted by
Spyros Gidaris
Andrei Bursuc
7 months ago
Still mesmerized by this work and its results: a mid-to-end driving agent trained with self-play on just 8 maps on 1.6B km of driving (9500 years of subjective driving experience) smashes in off-the-shelf manner all existing benchmarks (nuPlan, CARLA, Waymax) ๐ฎ
add a skeleton here at some point
0
6
4
reposted by
Spyros Gidaris
Andrei Bursuc
7 months ago
EQ-VAE: Such a simple & cool trick to regularize multiple kinds of autoencoders: align reconstruction of transformed latents w/ the corresponding transformed inputs. ๐REPA: 4x training speedup ๐MaskGIT: 2x training speedup ๐DiT-XL/2: 7x faster convergence Kudos
@nicolabourbaki.bsky.social
et al.
add a skeleton here at some point
0
9
2
reposted by
Spyros Gidaris
Eugene Vinitsky ๐
7 months ago
The things I've found hardest about research have all been non-technical: maintaining confidence and self-esteem, not abandoning the work when it's too hard or stressful, finding time to learn new things. In comparison, the technical parts are much easier
5
60
10
reposted by
Spyros Gidaris
David Picard
7 months ago
๐จ Just a quick note that following requests, we trained a 512px version of our Coherence-Aware Diffusion model (CVPR'24) and updated the paper on arxiv:
arxiv.org/abs/2405.20324
It has a package and pretrained models! ๐ฅ๏ธ
nicolas-dufour.github.io/cad.html
๐ค
github.com/nicolas-dufo...
2
23
6
reposted by
Spyros Gidaris
Thodoris Kouzelis
7 months ago
1/n๐If youโre working on generative image modeling, check out our latest work! We introduce EQ-VAE, a simple yet powerful regularization approach that makes latent representations equivariant to spatial transformations, leading to smoother latents and better generative models.๐
1
18
9
reposted by
Spyros Gidaris
8 months ago
1/n ๐ Excited to share our latest work: DINO-Foresight, a new framework for predicting the future states of scenes using Vision Foundation Model features! Links to the arXiv and Github ๐
2
20
4
reposted by
Spyros Gidaris
Andrei Bursuc
8 months ago
This amazing team โค๏ธ
add a skeleton here at some point
1
19
3
reposted by
Spyros Gidaris
Andrei Bursuc
8 months ago
Thrilled to announce our workshop on Embodied Intelligence for Autonomous Systems on the Horizon
@cvprconference.bsky.social
featuring a crazy line-up of speakers and challenges. Mark it in your agendas and also in your registration
#cvpr2025
opendrivelab.com/cvpr2025/wor...
0
26
6
you reached the end!!
feeds!
log in