Charlie Snell
@seasnell.bsky.social
📤 380
📥 319
📝 8
PhD @berkeley_ai; prev SR @GoogleDeepMind. I stare at my computer a lot and make things
pinned post!
Can we predict emergent capabilities in GPT-N+1🌌 using only GPT-N model checkpoints, which have random performance on the task? We propose a method for doing exactly this in our paper “Predicting Emergent Capabilities by Finetuning”🧵
about 1 year ago
3
45
7
reposted by
Charlie Snell
Ted Underwood
about 1 year ago
Did you know that attention across the whole input span was inspired by the time-negating alien language in Arrival? Crazy anecdote from the latest Hard Fork podcast (by
@kevinroose.com
and
@caseynewton.bsky.social
). HT nwbrownboi on Threads for the lead.
19
247
70
Can we predict emergent capabilities in GPT-N+1🌌 using only GPT-N model checkpoints, which have random performance on the task? We propose a method for doing exactly this in our paper “Predicting Emergent Capabilities by Finetuning”🧵
about 1 year ago
3
45
7
you reached the end!!
feeds!
log in