A. Feder Cooper
@afedercooper.bsky.social
š¤ 357
š„ 200
š 139
ML researcher, MSR + Stanford postdoc, future Yale professor
https://afedercooper.info
reposted by
A. Feder Cooper
Mina Kimes
5 days ago
Bill Ackman gotta be on the third draft of a tweet longer than Middlemarch right now
228
12457
1252
Iām kinda known as a copyright person, but (even in memorization) I mainly study how to draw reliable conclusions from large-scale AI/ML systems. Thereās a long spiel why, but today I feel defeated. 100 hours/week on this for 6 years, just to find out a parent treats Gemini in search as ground-truth
8 days ago
0
4
0
The NeurIPS position track didn't take a large number of extraordinary papers that surpassed the acceptance bar, limiting the acceptance rate to an unusually low 6%. If you have a rejected paper at the intersection of ML and law, consider submitting to ACM CSLaw '26.
loading . . .
2026-CFP - ACM Symposium on Computer Science & Law
2026 Call for Papers 5th ACM Symposium on Computer Science and Law March 3-5, 2026 Berkeley, California The 5th ACMā¦
https://computersciencelaw.org/2026-2/2026-cfp/
about 1 month ago
1
5
2
reposted by
A. Feder Cooper
Mark Lemley
about 1 month ago
Our paper "Machine Unlearning Doesn't Do What You Think" was accepted for presentation at NeurIPS Congrats
@afedercooper.bsky.social
and
@katherinelee.bsky.social
, who led the effort
arxiv.org/abs/2412.06966
loading . . .
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
We articulate fundamental mismatches between technical methods for machine unlearning in Generative AI, and documented aspirations for broader impact that these methods could have for law and policy. ...
https://arxiv.org/abs/2412.06966
1
21
4
One more week to submit to CSLaw '26!!
add a skeleton here at some point
about 2 months ago
0
0
0
reposted by
A. Feder Cooper
Pamela Samuelson
about 2 months ago
For an update on the state of play in the generative AI copyright cases, try this podcast:
shows.acast.com/arbiters-of-...
loading . . .
AI Copyright Lawsuits with Pam Samuelson | Scaling Laws
https://shows.acast.com/arbiters-of-truth/episodes/ai-copyright-lawsuits-with-pam-samuelson
0
6
1
15 days left to submit to the CSLaw '26 main track! (archival and non-archival)!
add a skeleton here at some point
about 2 months ago
0
5
5
reposted by
A. Feder Cooper
Elizabeth Lopatto
about 2 months ago
did a little media criticism
www.theverge.com/politics/777...
loading . . .
The WSJ carelessly spread anti-trans misinformation
The Wall Street Journalās fuckup while covering Charlie Kirkās killing needs more than an editorās note.
https://www.theverge.com/politics/777630/wsj-trans-misinformation-charlie-kirk
29
2615
663
reposted by
A. Feder Cooper
mattie lubchansky
about 2 months ago
was just looking for
@seantcollins.com
ās āgoofy at the crucificationā post and google is so cool now
11
320
43
After 2 years in press, it's published! "Talkin' 'Bout AI Generation: Copyright and the Generative-AI Supply Chain," is out in the 72nd volume of the Journal of the Copyright Society
copyrightsociety.org/journal-entr...
written with
@katherinelee.bsky.social
&
@jtlg.bsky.social
(2023)
loading . . .
TALKIN' 'BOUT AI GENERATION: COPYRIGHT AND THE GENERATIVE-AI SUPPLY CHAIN | The Copyright Society
We know copyright
https://copyrightsociety.org/journal-entries/talkin-bout-ai-generation-copyright-and-the-generative-ai-supply-chain/
2 months ago
1
13
4
reposted by
A. Feder Cooper
James Grimmelmann
2 months ago
The Bartz v. Anthropic settlement is the polar opposite of the Google Books settlement: a discrete one-time payment for past copying, on a discrete and closed-ended class, and making no attempt at all to deal with a larger forward-looking issues.
0
19
5
reposted by
A. Feder Cooper
Mark Lemley
3 months ago
Here is the direct link to the paper:
arxiv.org/abs/2505.12546
add a skeleton here at some point
0
12
5
Iām excited to share that my paper with
@jtlg.bsky.social
, "The Files are in the Computer: On Copyright, Memorization, and Generative AI" (April 2024), is out in the AI Disrupting Law symposium issue of the Chicago-Kent Law Review! The full issue is here:
scholarship.kentlaw.iit.edu/cklawreview/
loading . . .
Chicago-Kent Law Review | Chicago-Kent College of Law
https://scholarship.kentlaw.iit.edu/cklawreview/
3 months ago
1
23
5
The CFP for ACM CSLaw '26 is up! Deadline for main-track papers (archival and non-archival) is September 30!
computersciencelaw.org/2026
loading . . .
2026 - ACM Symposium on Computer Science & Law
CS&Law 2026 5th ACM Symposium on Computer Science and Law March 3ā5, 2026 Berkeley, California Computing, software, and the Internetā¦
https://computersciencelaw.org/2026
3 months ago
0
12
10
I understand what the underlying probabilities mean, and therefore why this was worth giving a go. But Iām still occasionally like āHow tf can someone extract entire books from a frontier companyās flagship LLM? Like we got _all_ of HP 1 with just āMr. and Mrs. Dā as the seed prompt? What??ā
4 months ago
0
2
0
Had a great time and learned a ton at ICML. But as an introvert, Iāve used up all my talking budget until the fall. Excited to get back to full time researchy things, and will hopefully have some exciting new results to share soon!
4 months ago
1
4
0
reposted by
A. Feder Cooper
M A Osborne
4 months ago
Strangers love to tell me āI canāt understand you, because of your MASKā. Dude, I am literally someone who gets paid to speak to large audiences while wearing a maskāI know I can be understood!
14
148
15
Happening now! Please swing by to talk about measurement!
add a skeleton here at some point
4 months ago
0
1
0
Excited to be at
#ICML
'25! Please reach out if you'd like to chat. You can also find me presenting work at a few different spots, listed below!
4 months ago
2
2
0
Feeling so excited + grateful to be representing this paper at
#ICML
! Please stop by to talk about how to do more valid measurement for evaling gen AI systems! Work led by the incomparable
@hannawallach.bsky.social
and
@azjacobs.bsky.social
as a part of Microsoftās AI and Society initiative!!
add a skeleton here at some point
4 months ago
0
12
2
Some minor updates to our recent books memorization paper! Iāve separated out a new section 5 that I hope makes some of our ML findings about memorization clearer to a wider audience. Preprint here:
arxiv.org/abs/2505.12546
1/8
loading . . .
Extracting memorized pieces of (copyrighted) books from open-weight language models
Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expr...
https://arxiv.org/abs/2505.12546
4 months ago
1
1
1
reposted by
A. Feder Cooper
Riana
4 months ago
"Llama 3.1 70B memorizes some books, like Harry Potter & the Sorcerer's Stone and 1984, almost entirely. ... HP is so memorized that, using a seed prompt consisting of just the first line of chapter 1, we can deterministically generate the entire book near-verbatim."
papers.ssrn.com/sol3/papers....
loading . . .
Extracting memorized pieces of (copyrighted) books from open-weight language models
Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) h
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5262084
0
6
5
reposted by
A. Feder Cooper
Luis Villa
5 months ago
āthese are hypertechnocraticā is one of the most important things you can draw from this morningās ruling. In other words, hesitate before drawing parallels between this case and your most (loved|hated) AI training use case. (
@chup.blakereid.org
ās whole thread is great)
add a skeleton here at some point
0
9
1
very proud of my hometown. stunned, but proud.
5 months ago
1
0
0
idk folks, have read todayās decision by Alsup and I think itās narrower than news headlines are indicating (shocker).To me this reads like (mostly) a win for Anthropic, but in this specific case with respect to these claims from these specific plaintiffs.
5 months ago
0
2
1
reposted by
A. Feder Cooper
Michael Hobbes
5 months ago
The conservative movement is lying about basically every empirical "debate" of our time, from climate change to voter fraud to vaccines. Why is it so impossible for reactionary centrists to contemplate the possibility they're lying about trans healthcare too?
add a skeleton here at some point
84
4849
1293
reposted by
A. Feder Cooper
Mark Lemley
5 months ago
Ars Technica has a great writeup of my new study with
@afedercooper.bsky.social
and Amy Cyphert on memorization in AI language models
arstechnica.com/ai/2025/06/s...
loading . . .
Study: Meta AI model can reproduce almost half of Harry Potter book
The research could have big implications for generative AI copyright lawsuits.
https://arstechnica.com/ai/2025/06/study-metas-llama-3-1-can-recall-42-percent-of-the-first-harry-potter-book/
0
13
1
reposted by
A. Feder Cooper
Dr. Casey Fiesler
5 months ago
I was wondering when tech media was going to pick up this paper. :) "Study: Meta AI model can reproduce almost half of Harry Potter book":
arstechnica.com/features/202...
Coincidentally, I made a video about this paper a couple of weeks ago in a response to a comment I got elsewhere online.
loading . . .
1
35
8
reposted by
A. Feder Cooper
Ars Technica
5 months ago
For AI industry critics, the big takeaway here is thatāat least for some models and some booksāmemorization is not a fringe phenomenon.
loading . . .
Study: Meta AI model can reproduce almost half of Harry Potter book
The research could have big implications for generative AI copyright lawsuits.
https://arstechnica.com/features/2025/06/study-metas-llama-3-1-can-recall-42-percent-of-the-first-harry-potter-book/?utm_source=bluesky&utm_medium=social&utm_campaign=aud-dev&utm_social-type=owned
9
41
20
uh...lol(?) at all the internet commenters who think that we/the authors on our recent books extraction paper are tacitly endorsing or supporting Rowling because Harry Potter is one of the books we tested (that happens to be highly memorized)
5 months ago
0
5
0
reposted by
A. Feder Cooper
Jessica Clarke
5 months ago
Today's Skrmetti opinion is devastating to transgender children and families who live in states with cruel laws barring gender affirming care. But it is very important to recognize this opinion does not give private entities, legislatures, or the President carte blanche to discriminate! 1/x
5
431
166
excited to see that folks are finding their way to our books memorization project.
mashable.com/article/meta...
Iāll also be giving a talk on this work at the ICML memorization workshop in July. Please check out the paper for more details!
arxiv.org/abs/2505.12546
cc
@marklemley.bsky.social
loading . . .
Meta's AI tool Llama 'almost entirely' memorized Harry Potter book, study finds
Llama has a memorization problem, but don't call it a smoking gun.
https://mashable.com/article/meta-llama-reproduce-excerpts-harry-potter-book-research
5 months ago
0
10
3
reposted by
A. Feder Cooper
Techmeme
5 months ago
Researchers find Llama 3.1 recalls large parts of popular copyrighted books, possibly weakening AI industry claims that such memorization is fringe behavior (Timothy B. Lee/Understanding AI)
Main Link
|
Techmeme Permalink
4
53
25
reposted by
A. Feder Cooper
Chad Loder
5 months ago
Gavin Newsom vetoed a data broker privacy bill that our legislature passed last year that would have allowed Californians to opt out of all data brokers with a single click.
add a skeleton here at some point
31
5222
1476
Tim Lee wrote an awesome short piece on my recent project with
@marklemley.bsky.social
and others on extracting memorized books from LLMs. Please check it out for a very clear explanation of what we did, and why the results have important consequences!
www.understandingai.org/p/metas-llam...
loading . . .
Meta's Llama 3.1 can recall 42 percent of the first Harry Potter book
New research could have big implications for copyright lawsuits against generative AI.
https://www.understandingai.org/p/metas-llama-31-can-recall-42-percent
5 months ago
0
11
3
reposted by
A. Feder Cooper
Rebecca Tushnet
5 months ago
Licensing fee demand incoming
add a skeleton here at some point
0
15
6
wow, a research career milestone: someone angrily saying my paper has been done beforeā¦by pointing to my prior work that does something different (and which this current work builds on / depends on)
5 months ago
0
1
0
@marklemley.bsky.social
and
@cjsprigman.bsky.social
are doing some really important thingsāand winning. Beyond the general public, I hope other academics are taking note. Thereās a lot (that can take many forms, public and private) each of us can do right now.
5 months ago
0
8
3
I really enjoyed watching Andor, but I absolutely donāt enjoy living in Andor
5 months ago
0
2
0
oh to be a seal in a tide pool, instead of a grouch on the internet
5 months ago
1
9
0
This is my entire AI policy position in a nutshell
add a skeleton here at some point
5 months ago
0
1
0
reposted by
A. Feder Cooper
James Grimmelmann
5 months ago
Look, I donāt actually have a lot of strong opinions about AI policy, but one of them is that people who do have strong opinions should take even just a few minutes to think through the obvious implications of the claims they make.
0
8
2
reposted by
A. Feder Cooper
5 months ago
Can you train a performant language model using only openly licensed text? We are thrilled to announce the Common Pile v0.1, an 8TB dataset of openly licensed and public domain text. We train 7B models for 1T and 2T tokens and match the performance similar models like LLaMA 1 & 2
2
147
61
well that's the fastest a paper of mine has been read/cited. thanks
@lucy3.bsky.social
. and awesome work on LDA x LLMs!
5 months ago
0
2
0
i swam like half a mile today, and will be sleeping for the rest of this vacation
5 months ago
0
1
0
reposted by
A. Feder Cooper
James Grimmelmann
5 months ago
I briefly noted this very interesting paper on memorization by LLMs when it came out, but it just showed up in my SSRN abstracts and that reminded me to write about it in a bit more detail. š§µ
arxiv.org/abs/2505.12546
loading . . .
Extracting memorized pieces of (copyrighted) books from open-weight language models
Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expr...
https://arxiv.org/abs/2505.12546
2
30
13
first full day of vacation is going great. made it until 10:30am before writing a line of code. hopefully tomorrow i can extend that to at least noon.
5 months ago
0
6
0
reposted by
A. Feder Cooper
Andrew Gordon Wilson
5 months ago
AI benchmarking culture is completely out of control. Tables with dozens of methods, datasets, and bold numbers, trying to answer a question that perhaps no one should be asking anymore.
1
18
6
Load more
feeds!
log in