Vincent Tao Hu
@vtaohu.bsky.social
📤 935
📥 173
📝 3
LMU postdoc from Ommer-Lab, MCML junior member. UvA PhD, PKU
reposted by
Vincent Tao Hu
Pingchuan Ma
8 months ago
Our work received an invited talk at the Imageomics-AAAI-25 workshop of
#AAAI25
.
@vtaohu.bsky.social
will be representing us there. Without me being there, I still would like to share our poster with you :D We also have another oral presentation for DepthFM on March 1, 2:30 pm-3:45 pm.
0
3
1
reposted by
Vincent Tao Hu
Pingchuan Ma
10 months ago
🤔When combining Vision-language models (VLMs) with Large language models (LLMs), do VLMs benefit from additional genuine semantics or artificial augmentations of the text for downstream tasks? 🤨Interested? Check out our latest work at
#AAAI25
: 💻Code and 📝Paper at:
github.com/CompVis/DisCLIP
🧵👇
1
15
8
reposted by
Vincent Tao Hu
Frank Fundel
11 months ago
Did you know you can distill the capabilities of a large diffusion model into a small ViT? ⚗️ We showed exactly that for a fundamental task: semantic correspondence📍 A thread 🧵👇
1
4
4
Your Diffusion Model is secretly an implicit timestep model, no matter discrete or continuous~
add a skeleton here at some point
11 months ago
0
6
0
reposted by
Vincent Tao Hu
Vladimir Yugay
11 months ago
Introducing “MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM”! We do SLAM with novel view synthesis capabilities on multiple simultaneously operating agents!
vladimiryugay.github.io/magic_slam/i...
1/7
loading . . .
3
51
18
you reached the end!!
feeds!
log in