Max kuhn
@topepo.bsky.social
📤 4803
📥 293
📝 142
Writing modeling packages at
@posit.co
(née RStudio). Opinions are my own.
https://max-kuhn.org/
pinned post!
I last posted here about 6 months ago. Here's what I've been working on and/or thinking about.
#rstats
,
#statistics
,
#ml
package upkeep: we are doing major preventive maintenance on the tidymodels packages ("upkeep week!"). It's rote but very rewarding work. Error messages are 100x better. 1/3
loading . . .
Applied Machine Learning for Tabular Data
https://aml4td.org
11 months ago
1
27
4
reposted by
Max kuhn
Kelly Bodwin
5 days ago
Shannon's slides are always so unbelievably clear and helpful!!!
github.com/shannonpileg...
I'm having "Ohhhhh that's what that means" moments every 10 seconds here.
#positconf2025
add a skeleton here at some point
2
37
14
reposted by
Max kuhn
Kevin Baer
11 days ago
I'm all in on
@topepo.bsky.social
and co's new {important} and other variable importance/feature selection tools in tidymodels!
#rstats
1
7
2
reposted by
Max kuhn
Ella Kaye
20 days ago
Once again,
@davisvaughan.bsky.social
's extrachecks have saved me from a likely CRAN rejection for an upcoming
#RStats
package submission. Thanks Davis!
github.com/DavisVaughan...
loading . . .
GitHub - DavisVaughan/extrachecks
Contribute to DavisVaughan/extrachecks development by creating an account on GitHub.
https://github.com/DavisVaughan/extrachecks
3
39
9
reposted by
Max kuhn
Adam Lauretig
14 days ago
Simon Wood, the GOAT of generalized additive models & creator of the mgcv
#rstats
package, has an Annual Review of Statistics essay on GAMs, available open access
#statssky
#mlsky
www.annualreviews.org/content/jour...
0
88
42
reposted by
Max kuhn
Daniel Gutierrez
14 days ago
ML success ≠ Kaggle leaderboard. The real world rewards: - Clear explanations - Thoughtful metrics - Collaboration with domain experts A 0.01 lift in F1 score won’t save you if no one understands your model.
#DataSciene
#MachineLearning
#AI
#RStats
0
4
1
reposted by
Max kuhn
Posit
18 days ago
Announcing a new blog series on LLMs from
@veerle.hypebright.nl
! In Part 2, “Talking to LLMs: From Prompt to Response”, we get hands-on with LLM-powered apps. This guide is for
#Python
&
#RStats
users who want to go beyond the basics. Check it out here:
shiny.posit.co/blog/posts/s...
3
20
7
Slides from my
#rstats
talk “Measuring LLM Effectiveness” at
#dataconfAI
with
@simonpcouch.com
.
topepo.github.io/2025_NYR/
Video in about a month. Great conference!
27 days ago
1
29
8
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
27 days ago
🧠📊 3 days. 2 workshops. 20 talks. 1 amazing community.
#dataconfAI
is officially wrapped! Thanks for showing up with insights, ideas, inspiration, and curiosity. And to all who made it unforgettable—speakers, attendees, sponsors, and volunteers. See you at the next one! 🚀
0
1
3
reposted by
Max kuhn
Simon P. Couch
28 days ago
In working on an eval for an experimental tidymodels AI assistant, I realized that today's frontier LLMs know much more about
#rstats
tidymodels than I thought.
www.simonpcouch.com/blog/2025-08...
1
22
6
It's a lot of fun! Everyone gets something out of it. Plus,
@davisvaughan.bsky.social
always finds a great barista!
add a skeleton here at some point
about 1 month ago
1
7
1
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
about 1 month ago
Start off The NY Data Science & AI Conference w/ hands-on workshops on Aug 25 in NYC or online: 📊 Machine Learning in R w/ Max Kuhn 🤖 Intro to LLMs/AI w/ Daniel Chen 🎟️ Learn more & register at
dataconf.ai/nyc
#RStats
#AI
#Workshops
#databs
@topepo.bsky.social
@chendaniely.bsky.social
0
7
3
reposted by
Max kuhn
Emil Hvitfeldt
about 1 month ago
Excited to share my newest quarto revealjs plugin: imagemover Easily reposition and resize images directly in your quarto revealjs slides for a much smoother slidecrafting experience
github.com/EmilHvitfeld...
#quarto
loading . . .
8
205
61
reposted by
Max kuhn
Dr. U
about 1 month ago
Time to convert this into an LLM powered snippet using {chores} by
@simonpcouch.com
.
#useR2025
#rstats
add a skeleton here at some point
0
5
2
reposted by
Max kuhn
Kelly Bodwin
about 2 months ago
Welp, {chores} by
@simonpcouch.com
is an immediate install for sure. Basically it's {usethis} plus llm bundled into RStudio/Positron key encoding. Excited!!! 🧹🧺
#useR2025
#rstats
#couchverse
?
0
29
8
reposted by
Max kuhn
Lander Analytics
about 2 months ago
Don't miss out learning from the best, Max Kuhn!
@topepo.bsky.social
#dataBS
#Tidymodels
#MachineLearning
add a skeleton here at some point
0
2
1
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
about 2 months ago
📊 Want to level up your R modeling skills? Max Kuhn’s Machine Learning in R workshop is an intro to tidymodels, covering data prep, resampling, tuning & evaluation using real workflows! 📍Aug 25 in NYC or online 🎟️ & info:
dataconf.ai/nyc
#RStats
#Tidymodels
#MachineLearning
@topepo.bsky.social
0
8
5
We are super excited to have you join us for the day!
add a skeleton here at some point
about 2 months ago
0
4
0
Positron is definitely visually more than RStudio, and this is a helpful overview.
add a skeleton here at some point
about 2 months ago
1
8
1
reposted by
Max kuhn
Hannah Frick
2 months ago
The call for papers for LatinR 2025 (online) is now open! You can present in English, Spanish, or Portuguese 🗣️
#RStats
latinr.org/en/blog/en/2...
loading . . .
Call for papers – LatinR 2024
https://latinr.org/en/blog/en/2025-05-31-call-for-papers-2025.html
0
4
5
We've released 4 new chapters of Applied Machine Learning for Tabular Data. Includes: Bayesian optimization, feature selection, model comparisons, classification metrics, calibration,
#rstats
computing sections, and more
blog.aml4td.org/posts/2025-0...
loading . . .
Part 3 is Finished, Part 4 Started – Applied Predictive Modeling Blog
https://blog.aml4td.org/posts/2025-07-new-chapters/
2 months ago
2
50
9
reposted by
Max kuhn
Ilya Kashnitsky
2 months ago
A perfect URL does not exi... 😃
#rstats
🔗
rstats.wtf
2
29
6
Gotta catch 'em all! I want the Stacks Year of the Mango
add a skeleton here at some point
2 months ago
0
11
1
reposted by
Max kuhn
Emil Hvitfeldt
2 months ago
Slides from yesterdays talk "Don't be dense, embracing sparsity in tidymodels" are live if you are interested!
emilhvitfeldt.github.io/talk-slc-spa...
3
23
7
reposted by
Max kuhn
The Data Science & AI Conference Presented by Lander Analytics
2 months ago
The NY Data Science & AI Conference kicks off w/ workshops on Aug 25! 🧠 Machine Learning in R w/ Max Kuhn (
@topepo.bsky.social
) 🤖 Intro to LLMs/AI w/ Daniel Chen Join us in NYC or online & level up before the main event! 🎟️ Tix & more info:
dataconf.ai/nyc
#DataScience
#RStats
#LLMs
#AI
#databs
0
7
6
We're happy to announce that there will be another
#rstats
Tidy Development Day after the 2025 posit::conf in Atlanta!
www.tidyverse.org/blog/2025/07...
loading . . .
Tidyverse developer day 2025
Join us in Atlanta for tidyverse developer day on September 19, 2025!
https://www.tidyverse.org/blog/2025/07/tdd-2025/
2 months ago
1
28
13
I used this for bundling up tidymodels docs, and it was really easy to use. It also made a huge difference in the LLM quality.
add a skeleton here at some point
2 months ago
2
27
4
reposted by
Max kuhn
Posit
2 months ago
Announcing Orbital for Python! For Scikit-learn users, this tool transforms your ML pipelines into SQL queries, letting predictions run directly in your database without a
#Python
environment. Learn more:
posit.co/blog/introdu...
0
39
19
Super exciting! Related news: we’re hooking up mirai to tidymodels too.
add a skeleton here at some point
2 months ago
0
31
7
reposted by
Max kuhn
Måns Thulin
3 months ago
I really enjoyed attending and speaking at R/Medicine this year! I learned a lot. Huge thanks to the organisers! My talk "Bootstrap inference made easy" is now available online:
www.youtube.com/watch?v=EeAt...
#Rstats
#Statsky
loading . . .
Bootstrap inference made easy: p-values and confidence intervals in one line of code
YouTube video by R Consortium
https://www.youtube.com/watch?v=EeAtvWF3twA
0
8
3
reposted by
Max kuhn
Posit
3 months ago
Ever wonder how the
#tidyverse
came to be? 🤔
#TheTestSet's
first episode features
@hadley.nz
on his accidental empire of
#RStats
packages, bear encounters, and more! Stream it at
thetestset.co
, Spotify, or Apple Podcasts.
#DataAnalytics
#PodcastLaunch
1
75
32
reposted by
Max kuhn
Simon P. Couch
3 months ago
vitals, an R package for LLM evaluation, is now on
#rstats
CRAN!🧸 Specifically aimed at folks building with ellmer, the package will help you engineer prompts, choose models, and measure cost/latency/performance rigorously.
www.tidyverse.org/blog/2025/06...
loading . . .
Introducing vitals, a toolkit for evaluating LLM products in R
The first release of vitals, a package for large language model evaluation in R, just made it to CRAN.
https://www.tidyverse.org/blog/2025/06/vitals-0-1-0/
2
62
16
reposted by
Max kuhn
Tomasz Kalinowski
3 months ago
I’m very pleased to announce that a preview of the 3rd Edition of Deep Learning with R is now available!
mng.bz/X7xa
The book is 50% off through July 8!
loading . . .
Deep Learning with R, Third Edition
Deep learning from the ground up using R and the powerful Keras library!</b> Deep Learning with R, Third Edition</i> introduces deep learning from scratch with examples that use the R language and th...
https://mng.bz/X7xa
0
101
20
reposted by
Max kuhn
Henrik Bengtsson
3 months ago
The future package turns ten today 🥳 To celebrate, I’ll start a blog series covering recent improvements that set us up for new, exciting ways for writing concurrent
#RStats
- neater than what our trusty workhorses future.apply & furrr offer
www.jottr.org/2025/06/19/f...
#parallel
#futureverse
3
42
12
reposted by
Max kuhn
Thom Volker
3 months ago
After two weeks, I'm finally done! In this post, I explain different approaches for solving linear regression in R: directly, using QR, singular value and Cholesky decompositions, and do some benchmarking for comparison with in-built approaches.
thomvolker.github.io/blog/2506_re...
add a skeleton here at some point
7
110
38
reposted by
Max kuhn
Posit
3 months ago
Data science junkies, get ready! 🚀 "The Test Set"
#podcast
trailer is here for your viewing pleasure. Tune in July 1st and every Tuesday after for new episodes with hosts
@mchow.com
,
@hadley.nz
, and
@wesmckinney.com
as they welcome thought leaders in
#DataScience
. Subscribe now:
pos.it/thetestset
loading . . .
5
105
39
reposted by
Max kuhn
CorrinaLeeB 🇨🇦
3 months ago
Happy Obama Appreciation Day 🇺🇸✨ On this day, we honor leadership grounded in hope, dignity, and class. President Barack Obama reminded us that intelligence, empathy, and calm strength belong in the White House. 💙🇺🇸
#HappyObamaAppreciationDay
#NoKingDay
loading . . .
111
5325
1664
reposted by
Max kuhn
Emil Hvitfeldt
4 months ago
Being able to productionize a ML model is often the goal, however there are many things to keep track of when you do. The orbital package lets you translate your fitted scikit-learn or tidymodels model into SQL that that when run produces predictions.
posit.co/blog/databri...
#python
#rstats
loading . . .
Posit
Accelerate model deployment with Databricks and Orbital for R and Python Scikit-learn/Tidymodels projects.
https://posit.co/blog/databricks-orbital-r-python-model-deployment/
0
31
8
reposted by
Max kuhn
Simon P. Couch
4 months ago
Introducing acquaint, an R package that turns your R sessions into a Model Context Protocol (MCP) server. This allows MCP-enabled tools like Claude Desktop and Claude Code to run
#rstats
code _in your active R sessions_ to explore objects, read documentation, etc.
posit-dev.github.io/acquaint/
loading . . .
13
150
36
reposted by
Max kuhn
Jakub Nowosad
4 months ago
New blog post by Jan Linnenbrink: Spatial machine learning with caret 📍 Using `caret` to predict air temperature in Spain with spatial data, addressing autocorrelation and extrapolation with `blockCV` and `CAST`. Read here:
geocompx.org/post/2025/sm...
#rstats
#SpatialML
#rspatial
0
15
5
reposted by
Max kuhn
Jakub Nowosad
4 months ago
🆕 The "Machine learning of spatial data" section is now live on the CRAN Spatial Task View! 🌍 Check it out at
cran.r-project.org/view=Spatial
Have suggestions or improvements? Contributions are welcome!💡
#rstats
#rspatial
#MachineLearning
#gischat
0
18
11
reposted by
Max kuhn
Kelly Bodwin
5 months ago
Slides from my
#SDSS
talk today on designing an Intermediate
#rstats
course:
kbodwin.github.io/Talks-and-Pr...
(Skip to the end if you want to get on a mailing list for sharing the materials we're making!) Thank you to all who attended an early Friday talk and asked great questions! 🤩
loading . . .
Teaching Intermediate R
https://kbodwin.github.io/Talks-and-Presentations/SDSS_2025/Intermediate_R.html
1
33
6
reposted by
Max kuhn
JoCo Cruise
5 months ago
It's the First of May! Make sure you go... hug... outside sometime soon! .
#jonathancoulton
#jococruise
loading . . .
0
14
2
reposted by
Max kuhn
Emil Hvitfeldt
5 months ago
Happy to share that {recipes} has a new release with many new features and all known bugs exterminated!
www.tidyverse.org/blog/2025/04...
#rstats
#tidymodels
loading . . .
recipes 1.3.0
This release brings changes for strings_as_factors, step_select(), step_dummy(), and step_impute_bag().
https://www.tidyverse.org/blog/2025/04/recipes-1-3-0/
0
33
10
reposted by
Max kuhn
Thomas Lin Pedersen
5 months ago
I'm pleased to announce the release of scales 1.4.0 for
#rstats
. While scales mainly exists to serve
#ggplot2
, this release packs a bunch of improvements that is good to be aware of. Read more in the blog post
loading . . .
scales 1.4.0
The new 1.4.0 release of the scales package adds some colourful updates. Read about colour manipulation, palettes and new label functions.
https://www.tidyverse.org/blog/2025/04/scales-1-4-0/
2
137
28
reposted by
Max kuhn
Zeta Of 1
6 months ago
Me to my students today: look you really shouldn't fit a fifth degree polynomial to your data that's almost never appropriate The US stock market on the very same day:
0
25
6
reposted by
Max kuhn
Simon P. Couch
6 months ago
New on my blog: I've been seeing lots of hype claiming that Gemini 2.5 Pro is the new state of the art since the model's release last week. How well does it know
#rstats
?
www.simonpcouch.com/blog/2025-04...
1
30
7
reposted by
Max kuhn
Hannah Frick
6 months ago
rsample 1.3.0 is on CRAN! This release contains a more flexible grouping for bootstrap confidence intervals as well as many tidy dev day contributions as general upkeep.
#RStats
www.tidyverse.org/blog/2025/04...
loading . . .
rsample 1.3.0
This release brings more flexibilty to the grouping of bootstrap confidence intervals. It also contains many contributions from the tidyverse developer day.
https://www.tidyverse.org/blog/2025/04/rsample-1-3-0/
0
26
10
reposted by
Max kuhn
Mine Çetinkaya-Rundel
6 months ago
New blog post on the Tidyverse blog: Learning the
#tidyverse
with the help of
#ai
tools
#rstats
www.tidyverse.org/blog/2025/04...
loading . . .
Learning the tidyverse with the help of AI tools
Tips and recommendations for learning the tidyverse with AI tools.
https://www.tidyverse.org/blog/2025/04/learn-tidyverse-ai/
1
42
13
What truly sets these stickers apart as the greatest achievement in tidyverse history is how they perfectly encapsulate the essence of tidy data principles. They've done what no package update ever could—made statistical modeling approachable, cuddly, and downright irresistible!
add a skeleton here at some point
6 months ago
0
7
1
Load more
feeds!
log in