Francis Bach
@bachfrancis.bsky.social
📤 2091
📥 14
📝 14
Researcher in machine learning
reposted by
Francis Bach
Eugene Berta
about 2 months ago
Still using temperature scaling? With
@dholzmueller.bsky.social
, Michael I. Jordan and
@bachfrancis.bsky.social
we argue that with well designed regularization, more expressive models like matrix scaling can outperform simpler ones across calibration set sizes, data dimensions, and applications.
1
5
2
Not all scaling laws are nice power laws. This month’s blog post: Zipf’s law in next-token prediction and why Adam (ok, sign descent) scales better to large vocab sizes than gradient descent:
francisbach.com/scaling-laws...
loading . . .
3 months ago
1
47
12
Jamais deux sans trois. Le département d’informatique de l’ENS toujours en forme. 131 km, 4500m de dénivelé, cette fois-ci avec du Beaufort aux ravitaillements! Bravo Olivier Cappé!
#LEtapeduTour
6 months ago
1
19
0
Tired of lengthy computations to derive scaling laws? This post is made for you: discover the sharpness of the z-transform!
francisbach.com/z-transform/
loading . . .
6 months ago
0
25
5
Tired of lengthy computations to derive scaling laws? This post is made for you: discover the sharpness of the z-transform!
francisbach.com/z-transform/
6 months ago
0
19
4
What if AI isn’t about building solo geniuses, but designing social systems? Michael Jordan advocates blending ML, economics, and uncertainty management to prioritize social welfare over mere prediction. A must-read rethink.
arxiv.org/abs/2507.062...
loading . . .
A Collectivist, Economic Perspective on AI
Information technology is in the midst of a revolution in which omnipresent data collection and machine learning are impacting the human world as never before. The word "intelligence" is being used as...
https://arxiv.org/abs/2507.06268v1
6 months ago
0
37
9
Big thanks to the COLT 2025 organizers for an awesome event in Lyon! Here are the slides from my keynote this morning in case you’re curious about the references I mentioned:
www.di.ens.fr/~fbach/fbach...
loading . . .
https://www.di.ens.fr/~fbach/fbach_optim_ml_2025_COLT_online.pdf
6 months ago
0
19
2
reposted by
Francis Bach
Julyan Arbel
7 months ago
Register at PAISS 1-5 Sept 2025 @inria_grenoble with very talented speakers this year 🙂
paiss.inria.fr
cc
@mvladimirova.bsky.social
1
8
4
reposted by
Francis Bach
Julien Mairal
8 months ago
The PAISS summer school is back with an incredible line of speakers (and more to come). Spread the word !
1
23
14
reposted by
Francis Bach
Académie des sciences
8 months ago
Épisode 5 de notre série "Les nouveaux visages de l’Académie des sciences" : La statistique, une science en son temps. Retrouvez la vidéo sur Youtube :
www.youtube.com/watch?v=2AhT...
1
13
5
reposted by
Francis Bach
Lénaïc Chizat
9 months ago
Announcing : The 2nd International Summer School on Mathematical Aspects of Data Science
mathsdata2025.github.io
EPFL, Sept 1–5, 2025 Speakers: Bach
@bachfrancis.bsky.social
Bandeira Mallat Montanari Peyré
@gabrielpeyre.bsky.social
For PhD students & early-career researchers Apply before May 15!
loading . . .
Mathematical Aspects of Data Science
Graduate Summer School - EPFL - Sept. 1-5, 2025
https://mathsdata2025.github.io
1
46
25
reposted by
Francis Bach
Académie des sciences
9 months ago
[NOUVELLE SERIE "LES NOUVEAUX VISAGES DE L'ACADEMIE DES SCIENCES] Episode n°1 : Anne Canteaut : une architecte de la cryptographie moderne
www.youtube.com/watch?v=xyGC...
1
9
3
reposted by
Francis Bach
Gabriel Peyré
10 months ago
Futur best seller!
2
37
6
Characterizing finely the decay of eigenvalues of kernel matrices: many people need it, but explicit references are hard to find. This blog post reviews amazing asymptotic results from Harold Widom (1963!) and proposes new non-asymptotic bounds.
francisbach.com/spectrum-ker...
10 months ago
0
50
6
The must-read introduction to PAC-Bayes!
add a skeleton here at some point
10 months ago
0
6
0
reposted by
Francis Bach
Académie des sciences
11 months ago
🔬✨ Journée des Femmes et des filles de science✨🔬 À travers leurs parcours inspirants et leurs engagements, les académiciennes, et toutes les femmes et filles de science façonnent la recherche d’aujourd’hui et de demain. Pour aller plus loin :
urls.fr/PK5xg9
loading . . .
https://urls.fr/PK5xg9
3
48
30
reposted by
Francis Bach
Quentin Berthet - 🚅 to EurIPS 2025 🇩🇰
11 months ago
Check out our paper, with Lawrence Stewart and
@bachfrancis.bsky.social
Link:
arxiv.org/abs/2502.02996
1/8
loading . . .
Building Bridges between Regression, Clustering, and Classification
Regression, the task of predicting a continuous scalar target y based on some features x is one of the most fundamental tasks in machine learning and statistics. It has been observed and...
https://arxiv.org/abs/2502.02996v1
1
9
2
reposted by
Francis Bach
JP Vert
11 months ago
If you're curious of what is "behind a term sheet", don't miss this account by the Cathay innovation team who led our recent series A at Bioptimus
medium.com/cathay-innov...
loading . . .
Behind the Term Sheet: Bioptimus’ $41M Series A
The GPT of Biology Raising the Bar of AI-Driven Scientific Research
https://medium.com/cathay-innovation/behind-the-term-sheet-bioptimus-41m-series-a-5a876c0d646e
0
6
1
An inspirational talk by Michael Jordan: a refreshing, deep, and forward-looking vision for AI beyond LLMs.
www.youtube.com/live/W0QLq4q...
11 months ago
2
27
1
reposted by
Francis Bach
Fabian Schaipp
11 months ago
Learning rate schedules seem mysterious? Why is the loss going down so fast during cooldown? Turns out that this behaviour can be described with a bound from *convex, nonsmooth* optimization. A short thread on our latest paper đźšž
arxiv.org/abs/2501.18965
loading . . .
The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
We show that learning-rate schedules for large model training behave surprisingly similar to a performance bound from non-smooth convex optimization theory. We provide a bound for the constant schedul...
https://arxiv.org/abs/2501.18965
2
31
6
reposted by
Francis Bach
Eugene Berta
11 months ago
Early stopping on validation loss? This leads to suboptimal calibration and refinement errors—but you can do better! With
@dholzmueller.bsky.social
, Michael I. Jordan, and
@bachfrancis.bsky.social
, we propose a method that integrates with any model and boosts classification performance across tasks.
4
18
9
reposted by
Francis Bach
JP Vert
about 1 year ago
I've been eagerly awaiting this book for years! At last, a standalone and meticulous exposition of the current mathematical principles of machine learning, adorned with beautiful proofs. Well done and thank you,
@bachfrancis.bsky.social
. Discover more here:
francisbach.com/my-book-is-o...
loading . . .
My book is (at last) out! – Machine Learning Research Blog
https://francisbach.com/my-book-is-out/
0
15
1
A happy author discovering the first hard copies
add a skeleton here at some point
about 1 year ago
3
97
17
reposted by
Francis Bach
My book is (at last) out, just in time for Christmas! A blog post to celebrate and present it:
francisbach.com/my-book-is-o...
about 1 year ago
2
142
38
My book is (at last) out, just in time for Christmas! A blog post to celebrate and present it:
francisbach.com/my-book-is-o...
about 1 year ago
2
142
38
reposted by
Francis Bach
Académie des sciences
about 1 year ago
Breaking news : L'Académie des sciences accueille 18 nouveaux membres dès 2025, avec une majorité féminine pour la 1ère fois depuis 1666 👩‍🔬 : un symbole fort pour la parité en science ! 💥 🔗 En savoir plus sur les nouveaux membres :
urlr.me/mntDHX
0
17
8
New opening! Post-doctoral position on relaxation methods for large-scale optimization and the management of electrical systems, in collaboration between EDF and Inria Saclay and Paris. See more details here:
laurentpfeiffer.github.io/postdoc/
loading . . .
| Laurent Pfeiffer
https://laurentpfeiffer.github.io/postdoc/
about 1 year ago
2
53
7
you reached the end!!
feeds!
log in