David Bau
@davidbau.bsky.social
📤 2113
📥 240
📝 117
Interpretable Deep Networks.
http://baulab.info/
@davidbau
The NDIF youtube talk series continues... Don't miss the fascinating talks on by Xu Pan and Josh Engels, on the NDIF youtube channel.
www.youtube.com/channel/UCaQ...
add a skeleton here at some point
2 days ago
0
4
1
In the wake of the Jimmy Kimmel firing: Do not underestimate the power of the truth. The truth is our superpower.
davidbau.com/archives/202...
loading . . .
davidbau.com The Truth is Our Superpower
https://davidbau.com/archives/2025/09/20/the_truth_is_our_superpower.html
2 days ago
0
4
1
reposted by
David Bau
Monday: Trump tries to fire Fed Governor Lisa Cook (first time in 111 years). Thursday: CDC chief dismissed, four top scientists resign. Discredit, dismiss, blame. History shows exactly where this three-step pattern leads.
25 days ago
1
4
1
Monday: Trump tries to fire Fed Governor Lisa Cook (first time in 111 years). Thursday: CDC chief dismissed, four top scientists resign. Discredit, dismiss, blame. History shows exactly where this three-step pattern leads.
25 days ago
1
4
1
This Friday NEMI 2025 is at Northeastern in Boston, 8 talks, 24 roundtables, 90 posters; 200+ attendees. Thanks to
goodfire.ai/
for sponsoring!
nemiconf.github.io/summer25/
If you can't make it in person, the livestream will be here:
www.youtube.com/live/4BJBis...
loading . . .
New England Mechanistic Interpretability Workshop
About:The New England Mechanistic Interpretability (NEMI) workshop aims to bring together academic and industry researchers from the New England and surround...
https://www.youtube.com/watch?v=4BJBisHk1UI
about 1 month ago
1
16
10
Announcing a deep net interpretability talk series! Every week you will find new talks on recent research in the science of neural networks. The first few are posted:
jackmerullo.bsky.social
, Roy Rinberg, and me. At the
@ndif-team.bsky.social
Youtube Channel:
www.youtube.com/@NDIFTeam
loading . . .
NDIF Team
We're a research computing project cracking open the mysteries inside large-scale AI systems. The NSF National Deep Inference Fabric consists of a unique combination of hardware and software that provides a remotely-accessible computing resource for scientists and students to perform detailed and reproducible experiments on large pretrained AI models, such as open large language models. We aim to make AI interpretability research more accessible through this channel by publishing lectures and educational content covering real interpretability research.
https://www.youtube.com/channel/UCaQPbDdnHO8RblfLHI_Tz6A
about 1 month ago
0
11
6
The New England Mechanistic Interpretability Workshop, NEMI 2025 is August 22 in Boston. Talks, posters, meals, discussion... Most of all, an excellent chance to chat about new ideas with other great researchers in the field! Help spread the word - register and repost -
bsky.app/profile/koy...
loading . . .
Koyena Pal (@koyena.bsky.social)
🚨 Registration is live! 🚨 The New England Mechanistic Interpretability (NEMI) Workshop is happening Aug 22nd 2025 at Northeastern University! A chance for the mech interp community to nerd out on how models really work 🧠🤖 🌐 Info: nemiconf.github.io/summer25/ 📝 Register: https://forms.gle/v4kJCweE3UUHUE81A
https://bsky.app/profile/koyena.bsky.social/post/3lsubposluc2s
3 months ago
0
12
2
The new "Lookback" paper from
@nikhil07prakash.bsky.social
contains a surprising insight... 70b/405b LLMs use double pointers, akin to C programmers' double (**) pointers. They show up when the LLM is "knowing what Sally knows Ann knows", i.e., Theory of Mind.
bsky.app/profile/nik...
loading . . .
@nikhil07prakash.bsky.social
How do language models track mental states of each character in a story, often referred to as Theory of Mind? We reverse-engineered how LLaMA-3-70B-Instruct handles a belief-tracking task and found something surprising: it uses mechanisms strikingly similar to pointer variables in C programming!
https://bsky.app/profile/nikhil07prakash.bsky.social/post/3lseltldos324
3 months ago
1
28
3
FRIENDS: American science is being decimated by Congress NOW. Your help is needed to fix this. The current DC plan PERMANENTLY slashes NSF, NIH, all science training. Money isn't redirected—it's gone. Please read+share what's happening
thevisible.net/posts/004-s...
4 months ago
1
5
1
Because of propaganda Americans do not understand what Rubio is doing with visas. "I gave you a visa to come and study," they think.
x.com/CitizenFree...
NO, he has not!! Please help explain to X how Rubio has stopped *ALL* student visas, and how it is killing US science.
4 months ago
1
6
0
When setting up my AI lab I faced a choice between Toronto and Boston. I chose Boston, my home and the world's best incubator for research talent. Here you can take a short stroll to meet with top minds in hundreds of fields from AI to astronomy, batteries to biotech.
4 months ago
1
12
0
Black Box, Blood Money Friday evening, an Italian tourist escaped a torturer in Manhattan who was after his crypto password. I asked Anthropic's Opus 4 to analyze and explain what the episode might teach us about AI. It critiqued my guidance, instead proposing a focus on VCs:
4 months ago
2
2
0
Please join me in celebrating the contributions of our international students, researchers, and visitors. Here is a reminder of what makes America unique, and why no other nation can touch USA's 420 Nobel prizes:
4 months ago
4
13
2
How to build AI leadership in the U.S? It is not about the chips. It is about the people! I spoke about AI interpretability at
ntird.gov/
last week. (NTIRD is the joint program between 23 federal agencies that coordinates government technology investments.)
4 months ago
2
5
1
My grandfather was a WW2 American Army veteran who became a proud cataloger at the Library of Congress. As a toddler I remember walking to his Library office from his A Street home, with a stop at the playground. The LOC has always been the jewel of America for me.
5 months ago
1
17
1
Leon Bottou's post ICLR thoughts are worth a read. He reminds us that modern AI is not just a product: it is a scientific wonder that we still do not understand.
leon.bottou.org/news/two_les...
loading . . .
news:two_lessons_from_iclr_2025 [leon.bottou.org]
https://leon.bottou.org/news/two_lessons_from_iclr_2025
5 months ago
1
21
3
In academia, we treat too many of our customs as "awards to be won" rather than obligations to the community. It seems wrong that "getting a paper through peer review" is seen by many like an award, a game to win, rather than a duty. Peer review forces writers and readers to *teach* each other.
add a skeleton here at some point
5 months ago
0
7
2
Insightful advice from M Gessen.
www.nytimes.com/2025/04/21/o...
Universities currently focus on competition—getting as many applicants as possible… real estate… endowments… rankings in the US News & World Report. They need to set all of that aside and focus on teaching as widely as possible.
loading . . .
Opinion | M. Gessen: ‘Trump Is Building a Mafia State’
“Nice university you got there. Shame if something happened to it.”
https://www.nytimes.com/2025/04/21/opinion/trump-universities-mafia-state.html
5 months ago
1
2
0
Government research programs like
#NAIRRPilot
are very different from industry efforts. Because just making smart technology products is not enough for leadership: We must nurture the community of human experts. And government builds those communities from the ground up ↘️
5 months ago
1
4
0
reposted by
David Bau
Garrett Wollman
5 months ago
"we must recognize that our superpower as academics is that nobody is in it for the money"
add a skeleton here at some point
0
7
1
Credibility, not capability. The most important thing we build in technology and academia is not capability, but credibility. It does not matter how fast we calculate, how smart we are, or the number of products or papers we make, if we cannot answer "Why should anybody believe anything we say?"
5 months ago
1
34
5
reposted by
David Bau
Clément Dumas
6 months ago
Very cool work that introduces "concept heads" that copy meanings from one token to another. I love that our activation-based analysis of multilingual representation has now additional insight from weight space analysis:
bsky.app/profile/sfeu...
add a skeleton here at some point
0
8
1
Sheridan asks whether the Dual Route Model of Reading that psychologists have observed in humans also appears in LLMs. In her brilliantly simple study of induction heads, she finds that it does! Induction has a Dual Route that separates concepts from literal token processing. Worth reading ↘️
add a skeleton here at some point
6 months ago
0
7
2
In my life I have paid a lot of tax. And every year, after participating in debates about how to spend it all—town, state, and country—I have been proud to write each tax check even when I disagree with the decisions. This is the first year I have had serious misgivings.
6 months ago
0
6
0
reposted by
David Bau
The need for interpretability to unlock innovation is not new. Contrast biology before and after Nirenberg 1962. Solving **interpretability**, decoding the mysterious information encoded in DNA, is the key to innovation in modern biology.
bsky.app/profile/rod...
6 months ago
1
3
1
reposted by
David Bau
The effects of transparency and interpretability in the AI innovation ecosystem are already visible today. EliahuHorwitz
arxiv.org/abs/2503.10633
HF viz shows it: the light blue explosion of SD innovation is from interpretable methods like
@rohitgandikota.bsky.social
Sliders
sliders.baulab.info
6 months ago
1
10
3
What will be the linchpin for AI dominance? Read our NSF/OSTP recommendations written with Goodfire's Tom McGrath
tommcgrath.github.io
, Transluce's Sarah Schwettmann
cogconfluence.com
, MIT's Dylan Hadfield-Menell
@dhadfieldmenell.bsky.social
TLDR; Dominance comes from **interpretability** 🧵 ↘️
6 months ago
1
22
9
Here is a litmus test for you. What is your choice? As AI becomes more powerful and begins to act with social awareness, with a theory of mind, then a theory of self, and beyond that when it begins to exhibit traits of consciousness, then what?
7 months ago
1
5
0
Today we launch a new open research community It is called ARBOR:
arborproject.github.io/
please join us.
bsky.app/profile/ajy...
7 months ago
1
15
7
DeepSeek R1 shows how important it is to be studying the internals of reasoning models. Try our code: Here
@canrager.bsky.social
shows a method for auditing AI bias by probing the internal monologue.
dsthoughts.baulab.info
I'd be interested in your thoughts.
loading . . .
https://dsthoughts.baulab
8 months ago
1
28
10
Can an AI make a painting with no paintings in its training data?
huggingface.co/spaces/rhfei...
This new demo of
@rhfeiyang.bsky.social
and Joanna Materzynska and
@rohitgandikota.bsky.social
's Art-Free Diffusion is worth a spin. You can use AI models that have seen no art, or just specific art!
loading . . .
Art-Free-Diffusion - a Hugging Face Space by rhfeiyang
Discover amazing ML apps made by the community
https://huggingface.co/spaces/rhfeiyang/Art-Free-Diffusion
8 months ago
0
12
1
reposted by
David Bau
NDIF Team
9 months ago
⏳ Final call! Applications for NDIF's 405b pilot program close tonight! Don’t miss the opportunity to experiment with Llama-405b and shape groundbreaking AI research infrastructure. Details below 🧵⬇️
1
4
2
What was the most important machine learning paper in 2024? My Famous Deep Learning Papers list (that I use in teaching) does not include any new ideas from the last year.
papers.baulab.info
Which single new paper would you add?
9 months ago
10
56
12
I'm at
#NeurIPS2024
Safe and Trustworthy Agents Workshop (West bldg Ballroom C) and will give a talk this morning at 9:40AM. Will talk about interpretability surprises and challenges, and respond to
@ilyasut.bsky.social
's thoughts on complex agentic AI. Come join!
9 months ago
0
10
0
Apply to the NDIF Summer Engineering Fellowship to come to Boston to work on the National Deep Inference Fabric, a cutting-edge infrastructure for research in large-scale AI. A really interesting project that crosses ML, HPC, Systems, PL, UX/Viz, research, and engineering.
ndif.us/fellowship.h...
add a skeleton here at some point
9 months ago
0
15
3
The Phase 2 NDIF Pilot is open for a short window. Apply now to get research capacity on Llama 405b. Deadline is December 31. It is not easy to crack open 405b for research, but NDIF solves the key engineering problems for you. Phase 1 powered several very interesting ICLR submissions...
add a skeleton here at some point
10 months ago
0
14
5
PhD Applicants: remember that the Northeastern Computer Science PhD application deadline is Dec 15. It's a terrific time to do a PhD, with so many interesting things happening in AI. Apply here:
www.khoury.northeastern.edu/apply/phd-ap...
loading . . .
PhD Apply - Khoury College of Computer Sciences
https://www.khoury.northeastern.edu/apply/phd-apply/
10 months ago
0
33
5
reposted by
David Bau
Maxwell Jones
10 months ago
Learning style from a single image is difficult, but what if you had access to an **image pair** instead? I’m excited to share our
#SIGGRAPHASIA2024
work PairCustomization, on customizing text-to-image models with a single image pair!! project page:
paircustomization.github.io
1/3
loading . . .
1
11
2
Do you need to copy art to make art? Hui Ren's and Joanna Materzynska's Art-Free Diffusion tests this question and lets you make "imitation-free" AI art Github:
github.com/rhfeiyang/ar...
Arxiv:
arxiv.org/abs/2412.00176
Website:
joaanna.github.io/art-free-dif...
X:
x.com/materzynska/...
loading . . .
GitHub - rhfeiyang/art-free-diffusion: Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"
Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge" - rhfeiyang/art-free-diffusion
https://github.com/rhfeiyang/art-free-diffusion
10 months ago
0
17
3
It's been a year since the last post. Now just back from EMNLP/BlackboxNLP 2024, which was as excellent and as interesting as last year. A journalist asked me "how quickly is interpretable ML changing"? My answer: We learn big new things about AI internals every month. It is moving very quickly.
10 months ago
1
32
2
Headed to Singapore for CoNLL, BlackboxNLP, EMNLP. Looking forward to learning about your interpretability research! Reach out if you'll be there and you'd like to chat.
almost 2 years ago
0
2
0
reposted by
David Bau
Rohit Gandikota
almost 2 years ago
Stuck on what Thanksgiving dish to cook? 🦃 Here are some AI generated ideas - by composing concepts like "cooked" and "fancy" dinner. 🍗🥧 You can also blend and explore “vegetarian”, “healthier diet”, and many more! Stay tuned for technical details! 🧵 Happy Holidays!🎄✨
@davidbau.bsky.social
0
7
1
My student Koyena Pal will be presenting some cool work at
#CoNLL2023
. Her Future Lens can look a single hidden state of an LLM and see what a transformer is planning, several tokens ahead. It can make reading the internal states much more intuitive!
x.com/kpal_koyena/...
almost 2 years ago
1
7
1
LLMs contain Function Vectors! Eric Todd has a really interesting new preprint on arxiv
functions.baulab.info
showing LLMs contain vector representations of functions that compose and apply in diverse contexts. Could be a powerful analysis tool. More in his twitter thread:
x.com/ericwtodd/st...
loading . . .
Function Vectors in Large Language Models
Understanding the internal computations of huge autoregressive transformer neural network language models during in-context learning.
https://functions.baulab.info/
almost 2 years ago
1
12
3
you reached the end!!
feeds!
log in