Tiago Pimentel (@tpimentel.bsky.social)

How do language models generalize from information they learn in-context vs. via finetuning? In arxiv.org/abs/2505.00661 we show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning — and ways to improve finetuning. 1/

loading . . .

https://arxiv.org/abs/2505.00661e