@rogergrosse.bsky.social
📤 1683
📥 77
📝 6
The Nintendo is closer in time to the first transistor than to today.
10 months ago
0
5
0
Conferences are basically a way for a group of people to temporarily have a lower opportunity cost on their time.
10 months ago
1
10
0
reposted by
Tomer Ullman
10 months ago
thinking of calling this "The Illusion Illusion" (more examples below)
60
1587
481
reposted by
Jonathan Lorraine
11 months ago
🚨 New
#NeurIPS2025
paper “Training Data Attribution via Approximate Unrolling” 🚨 Introducing SOURCE: A method to understand how individual training examples influence neural net behavior, allowing us to make AI models more transparent and trustworthy! 📄 Full paper:
openreview.net/pdf?id=3NaqG...
1
18
2
I have Claude filter my arXiv feed each day. It mostly works pretty well, except that it always hallucinates that "Studying LLM Generalization with Influence Functions" is in my feed and tells me I should read it.
11 months ago
3
9
0
Some very nice work from Cohere and UCL using influence functions to analyze math reasoning abilities in LLMs. Factual queries turn up docs containing the facts, but reasoning queries turn up similar cognitive strategies, suggesting generalization.
arxiv.org/abs/2411.12580
loading . . .
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
The capabilities and limitations of Large Language Models have been sketched out in great detail in recent years, providing an intriguing yet conflicting picture. On the one hand, LLMs demonstrate a g...
https://arxiv.org/abs/2411.12580
11 months ago
0
15
2
you reached the end!!
feeds!
log in