Max kuhn
@topepo.bsky.social
📤 4874
📥 297
📝 167
Writing modeling packages at
@posit.co
(née RStudio). Opinions are my own.
https://max-kuhn.org/
pinned post!
I last posted here about 6 months ago. Here's what I've been working on and/or thinking about.
#rstats
,
#statistics
,
#ml
package upkeep: we are doing major preventive maintenance on the tidymodels packages ("upkeep week!"). It's rote but very rewarding work. Error messages are 100x better. 1/3
loading . . .
Applied Machine Learning for Tabular Data
https://aml4td.org
over 1 year ago
1
27
4
reposted by
Max kuhn
Davis Vaughan
1 day ago
Last week we released dplyr 1.2.0, but we left off something VERY important 🙂 `dplyr::if_else()` and `dplyr::case_when()` are now up to 30x faster and use 10x less memory! We dive into how we achieved these numbers in this new
#rstats
post!
tidyverse.org/blog/2026/02...
loading . . .
`dplyr::if_else()` and `dplyr::case_when()` are up to 30x faster
dplyr 1.2.0 comes with much faster and more memory efficient `if_else()` and `case_when()` functions!
https://tidyverse.org/blog/2026/02/dplyr-performance/
4
125
22
reposted by
Max kuhn
Nick Strayer
7 days ago
For more than a year I have been working on a brand new Jupyter Notebook editor for Positron. This is a ground-up build of a new Jupyter Notebook experience built to leverage all the knowledge and tools Posit/Positron brings to the data science table. 🧵#jupyter
1
26
8
reposted by
Max kuhn
Davis Vaughan
7 days ago
dplyr 1.2.0 is out now and we are SO excited! - `filter_out()` for dropping rows - `recode_values()`, `replace_values()`, and `replace_when()` that join `case_when()` as a complete family of recoding/replacing tools These are huge quality of life wins for
#rstats
!
tidyverse.org/blog/2026/02...
loading . . .
dplyr 1.2.0
dplyr 1.2.0 fills in some important gaps in dplyr's API: we've added a new complement to `filter()` focused on dropping rows, and we've expanded the `case_when()` family with three new recoding and re...
https://tidyverse.org/blog/2026/02/dplyr-1-2-0/
12
459
147
reposted by
Max kuhn
Rami Krispin
10 days ago
The hexagon here is priceless 😎
taf-society.github.io/caretForecast/
#rstats
#timeseries
loading . . .
caretForecast
Conformal Time Series Forecasting Using Machine Learning
https://taf-society.github.io/caretForecast/
1
17
3
reposted by
Max kuhn
Isabella Velásquez
16 days ago
Tomorrow at the Data Science Lab 🧪 we are hearing from the amazing
@theotheredgar.bsky.social
about the {mall} package: Run Natural Language Processing against your
#RStats
tibbles or
#Python
Polars DataFrames for sentiment analysis, text summaries, and more! Join us at 12 pm ET:
pos.it/dslab
1
18
3
reposted by
Max kuhn
The R Data Scientist
about 1 month ago
Using neural networks in R is still not obsolete in 2026
https://joshuamarie.com/posts/15-nn-r/
#rstats
#datascience
loading . . .
Using neural networks in R is still not obsolete in 2026
Summary: Explores R's neural networks in 2026, focusing on torch, tidymodels, and the kindling package for streamlined deep learning in R.
https://joshuamarie.com/posts/15-nn-r/
0
11
2
reposted by
Max kuhn
Davis Vaughan
about 1 month ago
I sent 200 pull requests using Claude Code and wrote about the experience. It's pretty wild! For dplyr releases, we send a PR any time we break an
#rstats
package. This release advances a lot of deprecated functions, triggering issues in many old packages!
blog.davisvaughan.com/posts/2026-0...
loading . . .
Semi-automating 200 Pull Requests with Claude Code – Davis Vaughan
https://blog.davisvaughan.com/posts/2026-01-09-claude-200-pull-requests/
6
61
14
reposted by
Max kuhn
Emil Hvitfeldt
about 2 months ago
We are excited to see that xgboost recently had a big CRAN release! We have worked hard on the tidymodels team to make sure you all have a smooth transition. Please yet us know if you are experiencing any issues with the releases
tidyverse.org/blog/2025/12...
#rstats
#tidymodels
loading . . .
tidymodels & xgboost
The tidymodels ecosystem is prepared for big xgboost CRAN release.
https://tidyverse.org/blog/2025/12/tidymodels-xgboost/
2
25
4
reposted by
Max kuhn
alex hayes
7 months ago
~~ making sense of academic statistics ~~ i wrote about the confusing relationship between statistics and data analysis, and also about how statistics relates to science
#statistics
#rstats
#datascience
www.alexpghayes.com/post/making-...
15
114
28
reposted by
Max kuhn
The R Data Scientist
2 months ago
Smoothed ROC Curves, Calculus and Curvature
https://rworks.dev/posts/roc-and-differential-geometry/
#rstats
#datascience
loading . . .
Smoothed ROC Curves, Calculus and Curvature
Summary: Smoothed ROC curves in R with monoH.FC splines, calculus, and curvature concepts for AUC and arc length.
https://rworks.dev/posts/roc-and-differential-geometry/
0
1
1
reposted by
Max kuhn
Emil Hvitfeldt
2 months ago
I'm excited to announce the newest release of {tidypredict}! This release brings for standardization to outputs, faster trees for parsing and prediction, and glmnet support
tidyverse.org/blog/2025/12...
#rstats
#tidymodels
loading . . .
tidypredict 1.0.0
tidypredict 1.0.0 brings faster computations for tree-based models, more efficient tree representations, glmnet model support, and a change in how random forests are handled.
https://tidyverse.org/blog/2025/12/tidypredict-1-0-0/
2
34
9
We’ve released two new tidymodels
#rstats
packages for feature selection: filter and important.
tidyverse.org/blog/2025/11...
loading . . .
Two New tidymodels Packages
Two new tidymodels packages focus on supervised feature selection.
https://tidyverse.org/blog/2025/11/two-new-tidymodels-packages/
3 months ago
1
39
7
reposted by
Max kuhn
Joe Kirincic
3 months ago
I’m not aware of an Arrow or Parquet format, but there is the ONNX format (see
onnx.ai
). Depending on the model, you could try Posit’s orbital project, which translates your model to SQL (see here
orbital.tidymodels.org
).
loading . . .
ONNX | Home
https://onnx.ai
0
0
2
reposted by
Max kuhn
The R Data Scientist
3 months ago
Jarl: just another R linter
https://r-consortium.org/posts/jarl-just-another-r-linter/
#rstats
#datascience
loading . . .
Jarl: just another R linter
Summary: Jarl is an executable Rust-based R linter that detects and fixes inefficient or buggy patterns in R code for CI and editors.
https://r-consortium.org/posts/jarl-just-another-r-linter/
0
6
1
reposted by
Max kuhn
Emil Hvitfeldt
3 months ago
We are excited to share a few updates happening in tidymodels in this blog post!
tidyverse.org/blog/2025/11...
loading . . .
Q3 2025 tidymodels digest
A summary of what has been going on for the tidymodels group in the mid 2025.
https://tidyverse.org/blog/2025/11/tidymodels-2025-q3/
0
8
1
reposted by
Max kuhn
Yani Bellini Saibene
3 months ago
🤖 Introducción a
#Tidymodels
Aprendé a usar tidymodels para crear y evaluar modelos de
#MachineLearning
en
#RStats
Con Francisco Cardozo y Edgar Ruiz 📅 1 Dic, 18:00–20:00 UTC-3 – Online 💵 Estudiantes USD 5 · Académicos USD 10 · Industria USD 15 🔗
www.eventbrite.com.ar/e/1962543491...
#LatinR2025
0
8
6
We're hiring an open-source
#python
developer focused on modeling APIs!
tidyverse.org/blog/2025/11...
#numpy
#scipy
#scikitlearn
loading . . .
Python Open-Source Developer
Posit is hiring a Python open-source developer to create more data analysis tools.
https://tidyverse.org/blog/2025/11/python-open-source-developer/
3 months ago
0
34
15
My
#rstats
talk for the R+AI conference, titled "Analyzing
#LLM
Evaluations," starts at 1:40 EST today.
rconsortium.github.io/RplusAI_webs...
3 months ago
0
9
0
I just gave a 20m overview of the
#TabPFN
deep learning model at R/Pharma. The slides are at:
topepo.github.io/2025-r-pharma/
(link to Quarto files on the second slide)
3 months ago
0
7
1
We've made a blog post about a new major version of the
#rstats
tune package! Two main changes: parallel processing frameworks and the ability to tune postprocessors.
tidyverse.org/blog/2025/11...
loading . . .
tune version 2.0.0
A new version of tune brings significant updates for model tuning and parallel processing.
https://tidyverse.org/blog/2025/11/tune-2/
3 months ago
0
21
5
reposted by
Max kuhn
Simon P. Couch
3 months ago
I'm excited to share side::kick(), an experimental open-source coding agent for RStudio built entirely in R. It can interact with your files, communicate with your active
#rstats
session, and run code. Check it out:
github.com/simonpcouch/...
loading . . .
3
58
14
reposted by
Max kuhn
Lionel Henry
4 months ago
I'm excited to share a new version of the Air formatter for
#rstats
, with support for tabular formatting! Super useful with `tibble::tribble()` calls or `data.table::fcase()`. It's currently experimental as we're looking to get feedback on the feature, so please let us know what you think.
loading . . .
7
137
33
Before
#NoKings
in Hartford CT...
loading . . .
4 months ago
1
19
4
Great news!
add a skeleton here at some point
4 months ago
0
3
0
reposted by
Max kuhn
Kelly Bodwin
5 months ago
Shannon's slides are always so unbelievably clear and helpful!!!
github.com/shannonpileg...
I'm having "Ohhhhh that's what that means" moments every 10 seconds here.
#positconf2025
add a skeleton here at some point
2
38
15
reposted by
Max kuhn
Kevin Baer
5 months ago
I'm all in on
@topepo.bsky.social
and co's new {important} and other variable importance/feature selection tools in tidymodels!
#rstats
1
7
1
reposted by
Max kuhn
Ella Kaye
5 months ago
Once again,
@davisvaughan.bsky.social
's extrachecks have saved me from a likely CRAN rejection for an upcoming
#RStats
package submission. Thanks Davis!
github.com/DavisVaughan...
loading . . .
GitHub - DavisVaughan/extrachecks
Contribute to DavisVaughan/extrachecks development by creating an account on GitHub.
https://github.com/DavisVaughan/extrachecks
3
39
8
reposted by
Max kuhn
Adam L
5 months ago
Simon Wood, the GOAT of generalized additive models & creator of the mgcv
#rstats
package, has an Annual Review of Statistics essay on GAMs, available open access
#statssky
#mlsky
www.annualreviews.org/content/jour...
0
90
42
reposted by
Max kuhn
Daniel Gutierrez
5 months ago
ML success ≠ Kaggle leaderboard. The real world rewards: - Clear explanations - Thoughtful metrics - Collaboration with domain experts A 0.01 lift in F1 score won’t save you if no one understands your model.
#DataSciene
#MachineLearning
#AI
#RStats
0
4
1
reposted by
Max kuhn
Posit
5 months ago
Announcing a new blog series on LLMs from
@veerle.hypebright.nl
! In Part 2, “Talking to LLMs: From Prompt to Response”, we get hands-on with LLM-powered apps. This guide is for
#Python
&
#RStats
users who want to go beyond the basics. Check it out here:
shiny.posit.co/blog/posts/s...
2
20
6
Slides from my
#rstats
talk “Measuring LLM Effectiveness” at
#dataconfAI
with
@simonpcouch.com
.
topepo.github.io/2025_NYR/
Video in about a month. Great conference!
6 months ago
0
28
7
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
6 months ago
🧠📊 3 days. 2 workshops. 20 talks. 1 amazing community.
#dataconfAI
is officially wrapped! Thanks for showing up with insights, ideas, inspiration, and curiosity. And to all who made it unforgettable—speakers, attendees, sponsors, and volunteers. See you at the next one! 🚀
0
0
2
reposted by
Max kuhn
Simon P. Couch
6 months ago
In working on an eval for an experimental tidymodels AI assistant, I realized that today's frontier LLMs know much more about
#rstats
tidymodels than I thought.
www.simonpcouch.com/blog/2025-08...
1
19
4
It's a lot of fun! Everyone gets something out of it. Plus,
@davisvaughan.bsky.social
always finds a great barista!
add a skeleton here at some point
6 months ago
1
6
0
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
6 months ago
Start off The NY Data Science & AI Conference w/ hands-on workshops on Aug 25 in NYC or online: 📊 Machine Learning in R w/ Max Kuhn 🤖 Intro to LLMs/AI w/ Daniel Chen 🎟️ Learn more & register at
dataconf.ai/nyc
#RStats
#AI
#Workshops
#databs
@topepo.bsky.social
@chendaniely.bsky.social
0
6
2
reposted by
Max kuhn
Emil Hvitfeldt
6 months ago
Excited to share my newest quarto revealjs plugin: imagemover Easily reposition and resize images directly in your quarto revealjs slides for a much smoother slidecrafting experience
github.com/EmilHvitfeld...
#quarto
loading . . .
6
205
60
reposted by
Max kuhn
Dr. U
6 months ago
Time to convert this into an LLM powered snippet using {chores} by
@simonpcouch.com
.
#useR2025
#rstats
add a skeleton here at some point
0
4
1
reposted by
Max kuhn
Kelly Bodwin
6 months ago
Welp, {chores} by
@simonpcouch.com
is an immediate install for sure. Basically it's {usethis} plus llm bundled into RStudio/Positron key encoding. Excited!!! 🧹🧺
#useR2025
#rstats
#couchverse
?
0
28
7
reposted by
Max kuhn
Lander Analytics
6 months ago
Don't miss out learning from the best, Max Kuhn!
@topepo.bsky.social
#dataBS
#Tidymodels
#MachineLearning
add a skeleton here at some point
0
2
1
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
6 months ago
📊 Want to level up your R modeling skills? Max Kuhn’s Machine Learning in R workshop is an intro to tidymodels, covering data prep, resampling, tuning & evaluation using real workflows! 📍Aug 25 in NYC or online 🎟️ & info:
dataconf.ai/nyc
#RStats
#Tidymodels
#MachineLearning
@topepo.bsky.social
0
7
3
We are super excited to have you join us for the day!
add a skeleton here at some point
7 months ago
0
4
0
Positron is definitely visually more than RStudio, and this is a helpful overview.
add a skeleton here at some point
7 months ago
1
8
1
reposted by
Max kuhn
Hannah Frick
7 months ago
The call for papers for LatinR 2025 (online) is now open! You can present in English, Spanish, or Portuguese 🗣️
#RStats
latinr.org/en/blog/en/2...
loading . . .
Call for papers – LatinR 2024
https://latinr.org/en/blog/en/2025-05-31-call-for-papers-2025.html
0
3
3
We've released 4 new chapters of Applied Machine Learning for Tabular Data. Includes: Bayesian optimization, feature selection, model comparisons, classification metrics, calibration,
#rstats
computing sections, and more
blog.aml4td.org/posts/2025-0...
loading . . .
Part 3 is Finished, Part 4 Started – Applied Predictive Modeling Blog
https://blog.aml4td.org/posts/2025-07-new-chapters/
7 months ago
1
49
8
reposted by
Max kuhn
Ilya Kashnitsky
7 months ago
A perfect URL does not exi... 😃
#rstats
🔗
rstats.wtf
1
28
5
Gotta catch 'em all! I want the Stacks Year of the Mango
add a skeleton here at some point
7 months ago
0
10
0
reposted by
Max kuhn
Emil Hvitfeldt
7 months ago
Slides from yesterdays talk "Don't be dense, embracing sparsity in tidymodels" are live if you are interested!
emilhvitfeldt.github.io/talk-slc-spa...
2
22
6
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
7 months ago
The NY Data Science & AI Conference kicks off w/ workshops on Aug 25! 🧠 Machine Learning in R w/ Max Kuhn (
@topepo.bsky.social
) 🤖 Intro to LLMs/AI w/ Daniel Chen Join us in NYC or online & level up before the main event! 🎟️ Tix & more info:
dataconf.ai/nyc
#DataScience
#RStats
#LLMs
#AI
#databs
0
6
5
We're happy to announce that there will be another
#rstats
Tidy Development Day after the 2025 posit::conf in Atlanta!
www.tidyverse.org/blog/2025/07...
loading . . .
Tidyverse developer day 2025
Join us in Atlanta for tidyverse developer day on September 19, 2025!
https://www.tidyverse.org/blog/2025/07/tdd-2025/
7 months ago
1
29
12
I used this for bundling up tidymodels docs, and it was really easy to use. It also made a huge difference in the LLM quality.
add a skeleton here at some point
7 months ago
2
26
3
Load more
feeds!
log in