Ramon
@noctrog.bsky.social
📤 51
📥 185
📝 10
PhD ML student in Switzerland Prev intern at NVIDIA, Sony
What is the true depth of an LLM? Together with
@danielepal.bsky.social
,
@matpagliardini.bsky.social
, M. Jaggi and
@francois.fleuret.org
we show that LLMs have a smaller effective depth that can be exploited to increase inference speeds on multi-GPU settings!
arxiv.org/abs/2502.02790
(1/N)
10 months ago
1
13
3
you reached the end!!
feeds!
log in