Two weeks ago I posted about our recent paper, which shows that to bind entities, LMs use three mechanisms: positional, lexical and reflexive.
We were curious how these mechanisms develop throughout training, so we evaluated their existence across OLMo checkpoints 👇
2 months ago