@datascienceweekly.bsky.social
📤 55
📥 321
📝 25
Data Science Weekly - Issue 630, by @DataSciNews
open.substack.com/pub/datascie...
loading . . .
Data Science Weekly - Issue 630
Curated news, articles and jobs related to Data Science, AI, & Machine Learning
https://open.substack.com/pub/datascienceweekly/p/data-science-weekly-issue-630?r=fsqv&utm_campaign=post&utm_medium=web
1 day ago
0
0
0
reposted by
Daniel Chen
10 days ago
hi
#python
#databs
folks! I'll be giving a talk at
#pydata
global today on how we can aim to use
#llm
in
#datascience
. Repository, code, slides are all up here:
github.com/chendaniely/...
loading . . .
GitHub - chendaniely/pydata-global-2025-llm: My talk for PyData Global 2025
My talk for PyData Global 2025. Contribute to chendaniely/pydata-global-2025-llm development by creating an account on GitHub.
https://github.com/chendaniely/pydata-global-2025-llm
0
7
1
reposted by
Russ Poldrack
4 days ago
First section of a new chapter on Data Management in my Better Code, Better Science series:
russpoldrack.substack.com/p/data-manag...
loading . . .
Data management
Better Code, Better Science: Chapter 7, Part 1
https://russpoldrack.substack.com/p/data-management
0
15
2
reposted by
Dr Abeba Birhane
4 days ago
I wrote this brief talk on why “augmenting diversity” with LLMs is empirically unsubstantiable, conceptually flawed, and epistemically harmful and a nice surprise to see the organisers have made it public
synthetic-data-workshop.github.io/papers/13.pdf
19
823
268
reposted by
John Paul Helveston
6 days ago
My intro to programming in
#rstats
class materials are all openly available
p4a.seas.gwu.edu
loading . . .
EMSE 4571: Intro to Programming for Analytics – Intro to Programming for Analytics
https://p4a.seas.gwu.edu/
1
9
1
reposted by
Maëlle Salmon
5 days ago
New Post on
@ropensci.org
: Better
#RStats
Code, Without Any Effort, Without Even AI Edited by
@etiennebacher.bsky.social
& Steffi LaZerte Read about: ✨ {lintr} for detecting lints ✨ Air for formatting code ✨ jarl for detecting+fixing lints ✨ {flir} for refactoring
ropensci.org/blog/2025/12...
loading . . .
Better Code, Without Any Effort, Without Even AI
Useful local, free, deterministic tools to improve your code
https://ropensci.org/blog/2025/12/15/better-code/
0
37
11
reposted by
Dr. Dominic Royé
5 days ago
🚨 New
#DataViz
post! I explore smart alternatives to a broken chart and highlight why avoiding bad practices matters. 👉 Which alternative do you prefer? Let us know in the comments!
#rstats
dominicroye.github.io/blog/2025-12...
loading . . .
Broken Chart: discover 9 visualization alternatives
Researcher in climate science at MBG-CSIC
https://dominicroye.github.io/blog/2025-12-14-broken-charts/
2
47
15
reposted by
Lynn Cherny
1 day ago
Herdling (the game) is really beatiful, has lots of cave paintings and shrines in the woods— strong rec.
store.steampowered.com/app/3047750/...
loading . . .
Herdling on Steam
Guide a herd of mysterious creatures on a stirring and beautiful journey into the mountains… and beyond.
https://store.steampowered.com/app/3047750/Herdling/
0
7
1
reposted by
Stephen Turner
1 day ago
Eleven quick tips for organizing a data cleaning challenge
journals.plos.org/ploscompbiol...
0
2
2
reposted by
rmoff 🏃♂️🫖🥓
2 days ago
Hey check it out - I've now got a Substack! It's just the same content as on my blog, but if you prefer getting an email then feel free to subscribe.
interestinglinks.substack.com/p/interestin...
(I've moved the publication domain, just in case you happen to have bookmarked the previous one…)
0
6
2
reposted by
Allen Downey
8 days ago
Thanks to my friends at @datascienceweekly for featuring Probably Overthinking It ... now available in paperback!
add a skeleton here at some point
1
5
1
reposted by
Julien Le Dem
8 days ago
In the past few years, we’ve seen a cambrian explosion of new columnar formats, challenging the hegemony of Parquet. Presumably, the design of yore is not going to cut it moving forward. I spent some time to understand a bit better how things actually changed.
sympathetic.ink/2025/12/11/C...
loading . . .
Column Storage for the AI Era
In the past few years, we’ve seen a cambrian explosion of new columnar formats, challenging the hegemony of Parquet: Lance, Fastlanes, Nimble, Vortex, AnyBlox, F3 (File Format for the Future). The thi...
https://sympathetic.ink/2025/12/11/Column-Storage-for-the-AI-era.html
1
30
5
Data Science Weekly - Issue 629, by @DataSciNews
open.substack.com/pub/datascie...
loading . . .
Data Science Weekly - Issue 629
Curated news, articles and jobs related to Data Science, AI, & Machine Learning
https://open.substack.com/pub/datascienceweekly/p/data-science-weekly-issue-629?r=fsqv&utm_campaign=post&utm_medium=web
8 days ago
0
1
1
reposted by
juanitorduz
27 days ago
Here is the recording of my talk PyData Berlin 2025: Introduction to Stochastic Variational Inference with NumPyro Notebook:
juanitorduz.github.io/intro_svi/
youtu.be/wG0no-mUMf0?...
#pydata
#berlin
#bayes
loading . . .
Scaling Probabilistic Models with Variational Inference
YouTube video by PyData
https://youtu.be/wG0no-mUMf0?si=MOf5NdzqBLvaTMN9
0
12
4
reposted by
Andrew Heiss
10 days ago
Some closing thoughts for my students this semester on LLMs and learning
#rstats
datavizf25.classes.andrewheiss.com/news/2025-12...
13
324
131
reposted by
Paula Moraga
11 days ago
This semester I taught Spatial Data Science with
#rstats
Students analyzed areal, geostatistical & point pattern data, creating fantastic projects on disease mapping 🗺️ air pollution 🏭 crime 🚨 & species modeling 🐾 Book freely available: 👉
paulamoraga.com/book-spatial/
2
39
13
reposted by
Richard McElreath 🐈⬛
11 days ago
I'm teaching Statistical Rethinking again starting Jan 2026. This time with live lectures, divided into Beginner and Experienced sections. Will be a lot more work for me, but I hope much better for students. I will record lectures & all will be found at this link:
github.com/rmcelreath/s...
8
483
195
reposted by
Sydney Mathematical Research Institute (SMRI)
9 days ago
Gradient optimization methods: the benefits of instability — Peter Bartlett, UC Berkeley
www.youtube.com/watch?v=wEgT...
#MathSky
#SMRISeminar
loading . . .
Gradient optimization methods: the benefits of instability
YouTube video by Sydney Mathematical Research Institute - SMRI
https://www.youtube.com/watch?v=wEgTda8TY-M
0
15
6
reposted by
Nicola Rennie
10 days ago
Here's a little round up of my 2025 year in
#DataViz
featuring 💜 Some of my favourite
#RStats
charts 💜 A look back at 5 years of
#TidyTuesday
💜 Links to cool
#QuartoPub
and visualisation things I've seen this year Link:
nrennie.rbind.io/blog/year-in...
loading . . .
My year in data visualisation – Nicola Rennie
A round up of my projects this year, highlighting some of my favourite charts from 2025, and looking back on five years of TidyTuesday contributions.
https://nrennie.rbind.io/blog/year-in-data-viz-2025/
1
35
7
reposted by
Cynthia Dunlop
10 days ago
Why write engineering blogs? Here’s how some of our favorite bloggers responded to the question “Why did you start blogging and why do you continue?”
writethatblog.substack.com/p/why-write-...
1
23
7
reposted by
Andrea Lathrop
9 days ago
I have asked this, before, but is there anywhere else online that good Cog Sci/AI/ML discussions are ongoing, where I can at least eavesdrop academically? Twitter/X and Bluesky both feel kind of... sparse... and I might like a more concentrated topic focus, anyway.
3
6
3
Data Science Weekly - Issue 628, by @DataSciNews
open.substack.com/pub/datascie...
loading . . .
Data Science Weekly - Issue 628
Curated news, articles and jobs related to Data Science, AI, & Machine Learning
https://open.substack.com/pub/datascienceweekly/p/data-science-weekly-issue-628?r=fsqv&utm_campaign=post&utm_medium=web
16 days ago
0
0
0
reposted by
David Keyes
19 days ago
Ever since we started making documents in Quarto and Typst, I've wanted to make EVERYTHING in Quarto and Typst. Curious to learn how make documents like these? Boy, do I have the blog post for you!
#rstats
rfortherestofus.com/2025/11/quar...
loading . . .
0
52
12
reposted by
Lynn Cherny
16 days ago
TIL
allmaps.org
- lots of great svg annotations on historical maps, and help with labeling
1
10
6
reposted by
Frank Corso
20 days ago
Came across GameShell via this week's
@datascienceweekly.bsky.social
. It's a terminal game that helps someone learn/practice shell commands. Pretty cool idea:
github.com/phyver/GameS...
loading . . .
GitHub - phyver/GameShell: a game to learn (or teach) how to use standard commands in a Unix shell
a game to learn (or teach) how to use standard commands in a Unix shell - phyver/GameShell
https://github.com/phyver/GameShell
0
1
1
Data Science Weekly - Issue 627, by @DataSciNews
open.substack.com/pub/datascie...
loading . . .
Data Science Weekly - Issue 627
Curated news, articles and jobs related to Data Science, AI, & Machine Learning
https://open.substack.com/pub/datascienceweekly/p/data-science-weekly-issue-627?r=fsqv&utm_campaign=post&utm_medium=web
22 days ago
0
0
0
reposted by
Samuel Tingle
24 days ago
🚨📢 Incredibly proud to share our new paper and R package on non-linear regression modelling for medical professionals; making it simple! 🔗 Open access paper:
doi.org/10.1093/post...
#rstats
#nonlinear
#biostatistics
More details 🧵👇👇👇
1
7
2
reposted by
Jakub Nowosad
25 days ago
🔍 David O’Sullivan explores how to generate random points on the globe in R -- from uniform random to Halton sequences and blue-noise sampling. Read more:
dosull.github.io/posts/2025-0...
#RStats
#RSpatial
#GISchat
0
12
4
reposted by
nixCraft
23 days ago
a game to learn (or teach) how to use standard commands in a Unix or Linux shell. GameShell is available in English, French and Italian.
github.com/phyver/GameS...
loading . . .
GitHub - phyver/GameShell: a game to learn (or teach) how to use standard commands in a Unix shell
a game to learn (or teach) how to use standard commands in a Unix shell - phyver/GameShell
https://github.com/phyver/GameShell
0
61
14
reposted by
Alan Rogers
22 days ago
I taught (and co-taught) a course on human population genetics from 2000-2024. Having retired, I'm now making all the course materials public:
github.com/alanrogers/p...
#popgen
#evbio
loading . . .
GitHub - alanrogers/popgen: A course on population genetics
A course on population genetics. Contribute to alanrogers/popgen development by creating an account on GitHub.
https://github.com/alanrogers/popgen
4
256
92
reposted by
Ev Fedorenko
24 days ago
It has been so so fun to think with some of my favorite scientists about what it means to understand!
add a skeleton here at some point
0
54
10
reposted by
Alexander Reelsen
24 days ago
Building a Durable Execution Engine With SQLite Great article with accompanying Java code, that uses annotations, bytebuddy and sqlite to be able to resume jobs. Great to follow explanations plus code.
loading . . .
Building a Durable Execution Engine With SQLite
Lately, there has been a lot of excitement around Durable Execution (DE) engines. The basic idea of DE is to take (potentially long-running) multi-step workflows, such as processing a purchase order…
https://www.morling.dev/blog/building-durable-execution-engine-with-sqlite/
0
7
2
reposted by
Hackaday
23 days ago
One-Way Data Extraction For Logging On Airgapped Systems
loading . . .
One-Way Data Extraction For Logging On Airgapped Systems
Hackaday Article
https://hackaday.com/2025/11/27/one-way-data-extraction-for-logging-on-airgapped-systems/
3
82
12
reposted by
Adam Austin
23 days ago
Tis the season! Friendly annual reminder about my {secretsanta} 📦 for running your group's surprise gifting event. I didn't get a chance to test it out yet this year (Gmail auth changes sometimes) so please report any bugs.
github.com/ataustin/sec...
#RStats
loading . . .
GitHub - ataustin/secretsanta: he's making a list, sampling it twice
he's making a list, sampling it twice. Contribute to ataustin/secretsanta development by creating an account on GitHub.
https://github.com/ataustin/secretsanta
0
5
1
reposted by
rmoff 🏃♂️🫖🥓
23 days ago
⚡️ Writing an abstract for a lightning talk ⚡️
rmoff.net/2022/08/31/%...
loading . . .
⚡️ Writing an abstract for a lightning talk ⚡️
(src)
https://rmoff.net/2022/08/31/%EF%B8%8F-writing-an-abstract-for-a-lightning-talk-%EF%B8%8F/
0
4
1
reposted by
rmoff 🏃♂️🫖🥓
23 days ago
Wait what's this?! I've managed to pull my finger out to ship a list of **a whole lot of interesting links about stuff going on in the world of data for November**, a whole three days left until the end of the month!
rmoff.net/2025/11/26/i...
loading . . .
Interesting links - November 2025
https://rmoff.net/2025/11/26/interesting-links-november-2025
1
13
5
reposted by
Grant McDermott
23 days ago
Happy `tinyplot` v0.6.0 (codename "Thanksgiving") release day to all those that celebrate. Some new features, but mostly bug fixes and internal improvements.
grantmcdermott.com/tinyplot/NEW...
#rstats
1
25
6
Some thoughts on Math Textbooks and how to choose one:
open.substack.com/pub/kidswhol...
loading . . .
How Math Textbooks Are Written (and How to Choose One)
When learning at home with a math-loving kid, choose books written by people who clearly “give a damn.”
https://open.substack.com/pub/kidswholovemath/p/how-math-textbooks-are-written-and?r=2s5u8z&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true
29 days ago
0
1
0
Data Science Weekly - Issue 626, by @DataSciNews
open.substack.com/pub/datascie...
loading . . .
Data Science Weekly - Issue 626
Curated news, articles and jobs related to Data Science, AI, & Machine Learning
https://open.substack.com/pub/datascienceweekly/p/data-science-weekly-issue-626?r=fsqv&utm_campaign=post&utm_medium=web
29 days ago
0
1
0
reposted by
PyData Global
about 2 months ago
We received so many additional proposals for PyData Boston 2025 - THANK YOU! 🎉 PyData Boston 2025 will feature 3 days of talks, tutorials, lightning talks, Keynote sessions, and more. Connect with fellow members of the PyData community, and get your tickets now!
pydata.org/boston2025
0
3
1
reposted by
DuckDB
about 2 months ago
The PyData Amsterdam 2025 keynote “Minus Three Tier: Data Architecture Turned Upside Down” by
@hannes.muehleisen.org
is out now.
www.youtube.com/watch?v=DxwD...
loading . . .
KEYNOTE: Hannes Mühleisen - Data Architecture Turned Upside Down | PyData Amsterdam 2025
YouTube video by PyData
https://www.youtube.com/watch?v=DxwDaoUijTc
1
25
5
reposted by
Andrew Heiss
about 1 month ago
Here are all the assignments, btw, all CC-licensed for anyone to adapt from!
#rstats
#dataviz
github.com/andrewheiss/...
add a skeleton here at some point
0
40
8
reposted by
Stephen Turner
about 1 month ago
gggenomes: A Grammar of Graphics for Comparative Genomics
thackl.github.io/gggenomes/
#Rstats
loading . . .
A Grammar of Graphics for Comparative Genomics
An extension of ggplot2 for creating complex genomic maps. It builds on the power of ggplot2 and tidyverse adding new ggplot2-style geoms & positions and dplyr-style verbs to manipulate the…
https://thackl.github.io/gggenomes/
0
67
31
reposted by
Sung Kim
about 1 month ago
Meta has released the beta version of Pyrefly Pyrefly is a super fast, open source language server and typechecker for Python, first released in April 2025. Project:
pyrefly.org
Sandbox:
pyrefly.org/sandbox/
Repo:
github.com/facebook/pyr...
loading . . .
Pyrefly: A Fast Python Type Checker and Language Server | Pyrefly
https://pyrefly.org/
0
15
2
reposted by
Jenna Jordan
about 1 month ago
Excellent advice for job seekers from Abigail Haddad, succinctly summed up by an amazing tl;dr title: Make Things, Tell People
presentofcoding.substack.com/p/make-thing...
loading . . .
Make Things, Tell People
On side projects and finding work
https://presentofcoding.substack.com/p/make-things-tell-people
1
7
3
reposted by
Lynn Cherny
30 days ago
The online edition of
@hardmaru.bsky.social
new book on neuroevolution, Harnessing Creativity in AI Agent Design
neuroevolutionbook.com
loading . . .
- Neuroevolution
https://neuroevolutionbook.com
0
3
1
reposted by
Mickaël CANOUIL, Ph.D.
30 days ago
After years of working with Quarto, I've refined a set of editor settings that significantly improve document editing efficiency. My latest blog post shares these battle-tested configurations.
mickael.canouil.fr/posts/2025-1...
#Quarto
#VSCode
#Positron
#Productivity
#DataScience
loading . . .
Optimising VS Code and Positron for Quarto: Essential Settings for Better Editing – MCU
Discover the custom settings I use in VS Code and Positron to enhance my Quarto document editing workflow, from improved Git diffs to better visual guides for nested divs.
https://mickael.canouil.fr/posts/2025-11-20-quarto-editor-settings/
0
35
9
Thinking about Mathematical Maturity in Elementary School... Helping move kids from *doing math* to *thinking mathematically.* Some thoughts:
about 1 month ago
1
0
0
Data Science Weekly - Issue 625, by @DataSciNews
open.substack.com/pub/datascie...
loading . . .
Data Science Weekly - Issue 625
Curated news, articles and jobs related to Data Science, AI, & Machine Learning
https://open.substack.com/pub/datascienceweekly/p/data-science-weekly-issue-625?r=fsqv&utm_campaign=post&utm_medium=web
about 1 month ago
0
0
0
reposted by
Jakub Nowosad
about 1 month ago
tmap or ggplot2 for maps? 🗺️ David O’Sullivan breaks down the trade-offs in a blog post. URL:
dosull.github.io/posts/2024-1...
#RStats
#RSpatial
#Maps
#tmap
#ggplot2
loading . . .
tmap vs. ggplot2 for mapping – Geospatial Stuff
For me at least the choice between ggplot2 and tmap is an ongoing question. Here are my latest thoughts on the subject (with code).
https://dosull.github.io/posts/2024-11-16-tmap-vs-ggplot/tmap4-vs-ggplot2.html
0
37
19
Load more
feeds!
log in