Max kuhn
@topepo.bsky.social
📤 4852
📥 297
📝 164
Writing modeling packages at
@posit.co
(née RStudio). Opinions are my own.
https://max-kuhn.org/
pinned post!
I last posted here about 6 months ago. Here's what I've been working on and/or thinking about.
#rstats
,
#statistics
,
#ml
package upkeep: we are doing major preventive maintenance on the tidymodels packages ("upkeep week!"). It's rote but very rewarding work. Error messages are 100x better. 1/3
loading . . .
Applied Machine Learning for Tabular Data
https://aml4td.org
about 1 year ago
1
27
4
reposted by
Max kuhn
Emil Hvitfeldt
12 days ago
We are excited to see that xgboost recently had a big CRAN release! We have worked hard on the tidymodels team to make sure you all have a smooth transition. Please yet us know if you are experiencing any issues with the releases
tidyverse.org/blog/2025/12...
#rstats
#tidymodels
loading . . .
tidymodels & xgboost
The tidymodels ecosystem is prepared for big xgboost CRAN release.
https://tidyverse.org/blog/2025/12/tidymodels-xgboost/
1
20
4
reposted by
Max kuhn
alex hayes
5 months ago
~~ making sense of academic statistics ~~ i wrote about the confusing relationship between statistics and data analysis, and also about how statistics relates to science
#statistics
#rstats
#datascience
www.alexpghayes.com/post/making-...
15
115
29
reposted by
Max kuhn
The R Data Scientist
14 days ago
Smoothed ROC Curves, Calculus and Curvature
https://rworks.dev/posts/roc-and-differential-geometry/
#rstats
#datascience
loading . . .
Smoothed ROC Curves, Calculus and Curvature
Summary: Smoothed ROC curves in R with monoH.FC splines, calculus, and curvature concepts for AUC and arc length.
https://rworks.dev/posts/roc-and-differential-geometry/
0
1
1
reposted by
Max kuhn
Emil Hvitfeldt
16 days ago
I'm excited to announce the newest release of {tidypredict}! This release brings for standardization to outputs, faster trees for parsing and prediction, and glmnet support
tidyverse.org/blog/2025/12...
#rstats
#tidymodels
loading . . .
tidypredict 1.0.0
tidypredict 1.0.0 brings faster computations for tree-based models, more efficient tree representations, glmnet model support, and a change in how random forests are handled.
https://tidyverse.org/blog/2025/12/tidypredict-1-0-0/
2
34
9
We’ve released two new tidymodels
#rstats
packages for feature selection: filter and important.
tidyverse.org/blog/2025/11...
loading . . .
Two New tidymodels Packages
Two new tidymodels packages focus on supervised feature selection.
https://tidyverse.org/blog/2025/11/two-new-tidymodels-packages/
about 1 month ago
1
39
7
reposted by
Max kuhn
Joe Kirincic
about 1 month ago
I’m not aware of an Arrow or Parquet format, but there is the ONNX format (see
onnx.ai
). Depending on the model, you could try Posit’s orbital project, which translates your model to SQL (see here
orbital.tidymodels.org
).
loading . . .
ONNX | Home
https://onnx.ai
0
0
2
reposted by
Max kuhn
The R Data Scientist
about 1 month ago
Jarl: just another R linter
https://r-consortium.org/posts/jarl-just-another-r-linter/
#rstats
#datascience
loading . . .
Jarl: just another R linter
Summary: Jarl is an executable Rust-based R linter that detects and fixes inefficient or buggy patterns in R code for CI and editors.
https://r-consortium.org/posts/jarl-just-another-r-linter/
0
6
1
reposted by
Max kuhn
Emil Hvitfeldt
about 1 month ago
We are excited to share a few updates happening in tidymodels in this blog post!
tidyverse.org/blog/2025/11...
loading . . .
Q3 2025 tidymodels digest
A summary of what has been going on for the tidymodels group in the mid 2025.
https://tidyverse.org/blog/2025/11/tidymodels-2025-q3/
0
8
1
reposted by
Max kuhn
Yani Bellini Saibene
about 1 month ago
🤖 Introducción a
#Tidymodels
Aprendé a usar tidymodels para crear y evaluar modelos de
#MachineLearning
en
#RStats
Con Francisco Cardozo y Edgar Ruiz 📅 1 Dic, 18:00–20:00 UTC-3 – Online 💵 Estudiantes USD 5 · Académicos USD 10 · Industria USD 15 🔗
www.eventbrite.com.ar/e/1962543491...
#LatinR2025
0
8
6
We're hiring an open-source
#python
developer focused on modeling APIs!
tidyverse.org/blog/2025/11...
#numpy
#scipy
#scikitlearn
loading . . .
Python Open-Source Developer
Posit is hiring a Python open-source developer to create more data analysis tools.
https://tidyverse.org/blog/2025/11/python-open-source-developer/
about 1 month ago
0
34
15
My
#rstats
talk for the R+AI conference, titled "Analyzing
#LLM
Evaluations," starts at 1:40 EST today.
rconsortium.github.io/RplusAI_webs...
about 1 month ago
0
9
0
I just gave a 20m overview of the
#TabPFN
deep learning model at R/Pharma. The slides are at:
topepo.github.io/2025-r-pharma/
(link to Quarto files on the second slide)
about 2 months ago
0
7
1
We've made a blog post about a new major version of the
#rstats
tune package! Two main changes: parallel processing frameworks and the ability to tune postprocessors.
tidyverse.org/blog/2025/11...
loading . . .
tune version 2.0.0
A new version of tune brings significant updates for model tuning and parallel processing.
https://tidyverse.org/blog/2025/11/tune-2/
about 2 months ago
0
21
5
reposted by
Max kuhn
Simon P. Couch
about 2 months ago
I'm excited to share side::kick(), an experimental open-source coding agent for RStudio built entirely in R. It can interact with your files, communicate with your active
#rstats
session, and run code. Check it out:
github.com/simonpcouch/...
loading . . .
3
57
14
reposted by
Max kuhn
Lionel Henry
2 months ago
I'm excited to share a new version of the Air formatter for
#rstats
, with support for tabular formatting! Super useful with `tibble::tribble()` calls or `data.table::fcase()`. It's currently experimental as we're looking to get feedback on the feature, so please let us know what you think.
loading . . .
7
138
33
Before
#NoKings
in Hartford CT...
loading . . .
2 months ago
1
19
4
Great news!
add a skeleton here at some point
2 months ago
0
3
0
reposted by
Max kuhn
Kelly Bodwin
3 months ago
Shannon's slides are always so unbelievably clear and helpful!!!
github.com/shannonpileg...
I'm having "Ohhhhh that's what that means" moments every 10 seconds here.
#positconf2025
add a skeleton here at some point
2
38
15
reposted by
Max kuhn
Kevin Baer
4 months ago
I'm all in on
@topepo.bsky.social
and co's new {important} and other variable importance/feature selection tools in tidymodels!
#rstats
1
7
1
reposted by
Max kuhn
Ella Kaye
4 months ago
Once again,
@davisvaughan.bsky.social
's extrachecks have saved me from a likely CRAN rejection for an upcoming
#RStats
package submission. Thanks Davis!
github.com/DavisVaughan...
loading . . .
GitHub - DavisVaughan/extrachecks
Contribute to DavisVaughan/extrachecks development by creating an account on GitHub.
https://github.com/DavisVaughan/extrachecks
3
39
8
reposted by
Max kuhn
Adam L
4 months ago
Simon Wood, the GOAT of generalized additive models & creator of the mgcv
#rstats
package, has an Annual Review of Statistics essay on GAMs, available open access
#statssky
#mlsky
www.annualreviews.org/content/jour...
0
89
42
reposted by
Max kuhn
Daniel Gutierrez
4 months ago
ML success ≠ Kaggle leaderboard. The real world rewards: - Clear explanations - Thoughtful metrics - Collaboration with domain experts A 0.01 lift in F1 score won’t save you if no one understands your model.
#DataSciene
#MachineLearning
#AI
#RStats
0
4
1
reposted by
Max kuhn
Posit
4 months ago
Announcing a new blog series on LLMs from
@veerle.hypebright.nl
! In Part 2, “Talking to LLMs: From Prompt to Response”, we get hands-on with LLM-powered apps. This guide is for
#Python
&
#RStats
users who want to go beyond the basics. Check it out here:
shiny.posit.co/blog/posts/s...
2
20
6
Slides from my
#rstats
talk “Measuring LLM Effectiveness” at
#dataconfAI
with
@simonpcouch.com
.
topepo.github.io/2025_NYR/
Video in about a month. Great conference!
4 months ago
0
28
7
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
4 months ago
🧠📊 3 days. 2 workshops. 20 talks. 1 amazing community.
#dataconfAI
is officially wrapped! Thanks for showing up with insights, ideas, inspiration, and curiosity. And to all who made it unforgettable—speakers, attendees, sponsors, and volunteers. See you at the next one! 🚀
0
0
2
reposted by
Max kuhn
Simon P. Couch
4 months ago
In working on an eval for an experimental tidymodels AI assistant, I realized that today's frontier LLMs know much more about
#rstats
tidymodels than I thought.
www.simonpcouch.com/blog/2025-08...
1
21
5
It's a lot of fun! Everyone gets something out of it. Plus,
@davisvaughan.bsky.social
always finds a great barista!
add a skeleton here at some point
4 months ago
1
6
0
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
4 months ago
Start off The NY Data Science & AI Conference w/ hands-on workshops on Aug 25 in NYC or online: 📊 Machine Learning in R w/ Max Kuhn 🤖 Intro to LLMs/AI w/ Daniel Chen 🎟️ Learn more & register at
dataconf.ai/nyc
#RStats
#AI
#Workshops
#databs
@topepo.bsky.social
@chendaniely.bsky.social
0
6
2
reposted by
Max kuhn
Emil Hvitfeldt
5 months ago
Excited to share my newest quarto revealjs plugin: imagemover Easily reposition and resize images directly in your quarto revealjs slides for a much smoother slidecrafting experience
github.com/EmilHvitfeld...
#quarto
loading . . .
6
205
60
reposted by
Max kuhn
Dr. U
5 months ago
Time to convert this into an LLM powered snippet using {chores} by
@simonpcouch.com
.
#useR2025
#rstats
add a skeleton here at some point
0
4
1
reposted by
Max kuhn
Kelly Bodwin
5 months ago
Welp, {chores} by
@simonpcouch.com
is an immediate install for sure. Basically it's {usethis} plus llm bundled into RStudio/Positron key encoding. Excited!!! 🧹🧺
#useR2025
#rstats
#couchverse
?
0
28
7
reposted by
Max kuhn
Lander Analytics
5 months ago
Don't miss out learning from the best, Max Kuhn!
@topepo.bsky.social
#dataBS
#Tidymodels
#MachineLearning
add a skeleton here at some point
0
2
1
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
5 months ago
📊 Want to level up your R modeling skills? Max Kuhn’s Machine Learning in R workshop is an intro to tidymodels, covering data prep, resampling, tuning & evaluation using real workflows! 📍Aug 25 in NYC or online 🎟️ & info:
dataconf.ai/nyc
#RStats
#Tidymodels
#MachineLearning
@topepo.bsky.social
0
7
3
We are super excited to have you join us for the day!
add a skeleton here at some point
5 months ago
0
4
0
Positron is definitely visually more than RStudio, and this is a helpful overview.
add a skeleton here at some point
5 months ago
1
8
1
reposted by
Max kuhn
Hannah Frick
5 months ago
The call for papers for LatinR 2025 (online) is now open! You can present in English, Spanish, or Portuguese 🗣️
#RStats
latinr.org/en/blog/en/2...
loading . . .
Call for papers – LatinR 2024
https://latinr.org/en/blog/en/2025-05-31-call-for-papers-2025.html
0
3
4
We've released 4 new chapters of Applied Machine Learning for Tabular Data. Includes: Bayesian optimization, feature selection, model comparisons, classification metrics, calibration,
#rstats
computing sections, and more
blog.aml4td.org/posts/2025-0...
loading . . .
Part 3 is Finished, Part 4 Started – Applied Predictive Modeling Blog
https://blog.aml4td.org/posts/2025-07-new-chapters/
5 months ago
1
49
8
reposted by
Max kuhn
Ilya Kashnitsky
5 months ago
A perfect URL does not exi... 😃
#rstats
🔗
rstats.wtf
1
28
5
Gotta catch 'em all! I want the Stacks Year of the Mango
add a skeleton here at some point
5 months ago
0
10
0
reposted by
Max kuhn
Emil Hvitfeldt
5 months ago
Slides from yesterdays talk "Don't be dense, embracing sparsity in tidymodels" are live if you are interested!
emilhvitfeldt.github.io/talk-slc-spa...
2
22
6
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
5 months ago
The NY Data Science & AI Conference kicks off w/ workshops on Aug 25! 🧠 Machine Learning in R w/ Max Kuhn (
@topepo.bsky.social
) 🤖 Intro to LLMs/AI w/ Daniel Chen Join us in NYC or online & level up before the main event! 🎟️ Tix & more info:
dataconf.ai/nyc
#DataScience
#RStats
#LLMs
#AI
#databs
0
6
5
We're happy to announce that there will be another
#rstats
Tidy Development Day after the 2025 posit::conf in Atlanta!
www.tidyverse.org/blog/2025/07...
loading . . .
Tidyverse developer day 2025
Join us in Atlanta for tidyverse developer day on September 19, 2025!
https://www.tidyverse.org/blog/2025/07/tdd-2025/
5 months ago
1
29
12
I used this for bundling up tidymodels docs, and it was really easy to use. It also made a huge difference in the LLM quality.
add a skeleton here at some point
5 months ago
2
26
3
reposted by
Max kuhn
Posit
6 months ago
Announcing Orbital for Python! For Scikit-learn users, this tool transforms your ML pipelines into SQL queries, letting predictions run directly in your database without a
#Python
environment. Learn more:
posit.co/blog/introdu...
0
38
18
Super exciting! Related news: we’re hooking up mirai to tidymodels too.
add a skeleton here at some point
6 months ago
0
29
6
reposted by
Max kuhn
Måns Thulin
6 months ago
I really enjoyed attending and speaking at R/Medicine this year! I learned a lot. Huge thanks to the organisers! My talk "Bootstrap inference made easy" is now available online:
www.youtube.com/watch?v=EeAt...
#Rstats
#Statsky
loading . . .
Bootstrap inference made easy: p-values and confidence intervals in one line of code
YouTube video by R Consortium
https://www.youtube.com/watch?v=EeAtvWF3twA
0
7
2
reposted by
Max kuhn
Posit
6 months ago
Ever wonder how the
#tidyverse
came to be? 🤔
#TheTestSet's
first episode features
@hadley.nz
on his accidental empire of
#RStats
packages, bear encounters, and more! Stream it at
thetestset.co
, Spotify, or Apple Podcasts.
#DataAnalytics
#PodcastLaunch
1
74
29
reposted by
Max kuhn
Simon P. Couch
6 months ago
vitals, an R package for LLM evaluation, is now on
#rstats
CRAN!🧸 Specifically aimed at folks building with ellmer, the package will help you engineer prompts, choose models, and measure cost/latency/performance rigorously.
www.tidyverse.org/blog/2025/06...
loading . . .
Introducing vitals, a toolkit for evaluating LLM products in R
The first release of vitals, a package for large language model evaluation in R, just made it to CRAN.
https://www.tidyverse.org/blog/2025/06/vitals-0-1-0/
1
61
15
reposted by
Max kuhn
Tomasz Kalinowski
6 months ago
I’m very pleased to announce that a preview of the 3rd Edition of Deep Learning with R is now available!
mng.bz/X7xa
The book is 50% off through July 8!
loading . . .
Deep Learning with R, Third Edition
Deep learning from the ground up using R and the powerful Keras library!</b> Deep Learning with R, Third Edition</i> introduces deep learning from scratch with examples that use the R language and th...
https://mng.bz/X7xa
0
100
19
reposted by
Max kuhn
Henrik Bengtsson
6 months ago
The future package turns ten today 🥳 To celebrate, I’ll start a blog series covering recent improvements that set us up for new, exciting ways for writing concurrent
#RStats
- neater than what our trusty workhorses future.apply & furrr offer
www.jottr.org/2025/06/19/f...
#parallel
#futureverse
2
41
11
Load more
feeds!
log in