Really enjoyed Goodfire’s neural geometry posts. A few thoughts. The current examples mostly focus on explicit variables, e.g., days of the week, and show how their geometry in activation space mirrors the geometry of the model’s outputs/behavior. (1/3)
add a skeleton here at some point
13 days ago