Thibaut Loiseau
@thibautloiseau.bsky.social
π€ 410
π₯ 274
π 26
PhD Student at IMAGINE (ENPC) Working on camera pose estimation thibautloiseau.github.io
pinned post!
1/13 π Introducing our latest work on improving relative camera pose regression with a novel pre-training approach Alligat0R (
arxiv.org/abs/2503.07561
)!
@gbourmaud.bsky.social
@vincentlepetit.bsky.social
7 months ago
4
20
5
reposted by
Thibaut Loiseau
Nicolas Dufour
about 2 months ago
π DinoV3 just became the new go-to backbone for geoloc! It outperforms CLIP-like models (SigLip2, finetuned StreetCLIP)β¦ and thatβs shocking π€― Why? CLIP models have an innate advantage β they literally learn place names + images. DinoV3 doesnβt.
1
46
15
reposted by
Thibaut Loiseau
Imagine-ENPC
4 months ago
Some of our IMAGINE members at
#CVPR2025
0
34
7
reposted by
Thibaut Loiseau
Vincent Lepetit
4 months ago
I am heartbroken that I am not at the conference, but seeing what the government is doing to its people and the world, I simply couldn't go there.
1
21
6
reposted by
Thibaut Loiseau
Imagine-ENPC
5 months ago
Looking forward to
#CVPR2025
! We will present the following papers:
1
28
8
reposted by
Thibaut Loiseau
Nicolas Dufour
6 months ago
This is an idea I've had for a while, but wow, it's working way better than expected! π The model looks really promising, even though it's just 256px for now.
1
7
3
reposted by
Thibaut Loiseau
Lucas Ventura
6 months ago
Introducing Chapter-Llama
#CVPR2025
, a framework for π―π’πππ¨ ππ‘ππ©πππ«π’π§π using Large Language Models! π¬π¦ Check it out: π Paper:
arxiv.org/abs/2504.00072
π Project:
imagine.enpc.fr/~lucas.ventu...
π» Code:
github.com/lucas-ventur...
π€ Demo:
huggingface.co/spaces/lucas...
loading . . .
1
24
5
reposted by
Thibaut Loiseau
David Picard
7 months ago
π₯π₯π₯ CV Folks, I have some news! We're organizing a 1-day meeting in center Paris on June 6th before CVPR called CVPR@Paris (similar as NeurIPS@Paris) π₯πΎπ₯π· Registration is open (it's free) with priority given to authors of accepted papers:
cvprinparis.github.io/CVPR2025InPa...
Big π§΅π with details!
8
136
61
reposted by
Thibaut Loiseau
Imagine-ENPC
7 months ago
Starter pack including some of the lab members:
go.bsky.app/QK8j87w
add a skeleton here at some point
0
24
12
reposted by
Thibaut Loiseau
Johan Edstedt
7 months ago
Introducing DaD, Part 2, a pretty cool keypoint detector.
add a skeleton here at some point
5
31
7
1/13 π Introducing our latest work on improving relative camera pose regression with a novel pre-training approach Alligat0R (
arxiv.org/abs/2503.07561
)!
@gbourmaud.bsky.social
@vincentlepetit.bsky.social
7 months ago
4
20
5
reposted by
Thibaut Loiseau
Zhenjun Zhao
7 months ago
Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression
@thibautloiseau.bsky.social
, Guillaume Bourmaud,
@vincentlepetit.bsky.social
tl;dr: CroCo based; pixel in 1st image->co-visible or occluded or outside FOV in 2nd image
arxiv.org/abs/2503.07561
0
15
4
reposted by
Thibaut Loiseau
Lucas Degeorge
7 months ago
π¨ News! π¨ We have released the models from our latest paper "How far can we go with ImageNet for text-to-image generation?" Check out the models on HuggingFace: π€
huggingface.co/Lucasdegeorg...
π
arxiv.org/abs/2502.21318
add a skeleton here at some point
1
14
4
reposted by
Thibaut Loiseau
Nicolas Dufour
7 months ago
Check out our latest work on Text-to-Image generation! We've successfully trained a T2I model using only ImageNet data by leveraging captioning and data augmentation.
add a skeleton here at some point
1
15
7
reposted by
Thibaut Loiseau
David Picard
7 months ago
π¨ New preprint! How far can we go with ImageNet for Text-to-Image generation? w.
@arrijitghosh.bsky.social
@lucasdegeorge.bsky.social
@nicolasdufour.bsky.social
@vickykalogeiton.bsky.social
TL;DR: Train a text-to-image model using 1000 less data in 200 GPU hrs! πhttps://arxiv.org/abs/2502.21318 π§΅π
2
67
23
π§© Excited to share our paper "RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges" (
arxiv.org/abs/2502.19955
) accepted to
#CVPR2025
! We created a benchmark that systematically evaluates image matching methods across well-defined geometric difficulty levels. π
7 months ago
2
19
7
reposted by
Thibaut Loiseau
Guillaume Astruc
10 months ago
π€ What if embedding multimodal EO data was as easy as using a ResNet on images? Introducing AnySat: one model for any resolution (0.2mβ250m), scale (0.3β2600 hectares), and modalities (choose from 11 sensors & time series)! Try it with just a few lines of code:
2
35
12
reposted by
Thibaut Loiseau
David Picard
10 months ago
We
@imagineenpc.bsky.social
are slowly but surely entering our proposals for master's degree internships here:
docs.google.com/document/d/1...
These are 6 months projects that typically correspond to the end-of-study project in the French curriculum. Probably more offers to come, check it regularly.
loading . . .
2025 IMAGINE Internships
2025 Internship proposals at IMAGINE IMAGINE is a top research group on computer vision and machine learning. It is part of the LIGM lab and hosted at Γcole des Ponts ParisTech (ENPC), about 25 min f...
https://docs.google.com/document/d/1g6WZptYAPf5CcTz5BQ3-bWSoW_Dz3SnJL5erxuikLNM/edit?usp=sharing
2
32
13
you reached the end!!
feeds!
log in