rmoff 🏃♂️🫖🥓
@rmoff.net
📤 5047
📥 822
📝 718
Shitposting & Memes. Data & Stuff.
#dataBS
#trailrunning
🔗
https://rmoff.info
pinned post!
My most popular blog post is one I wrote about Kafka listener configuration. But the one that I am most proud of when people says it's been useful for them is when I wrote about trying to get the balance right when travel is part of your job:
rmoff.net/2019/02/09/t...
.
loading . . .
Travelling for Work, with Kids at Home
https://rmoff.net/2019/02/09/travelling-for-work-with-kids-at-home/
11 months ago
4
26
2
reposted by
rmoff 🏃♂️🫖🥓
DuckDB
5 days ago
🚀 We released version 0.3 of the DuckLake specification and the DuckDB ducklake extension today. It includes interoperability with Iceberg, support for geometry types and more. Check the announcement blog for more details
ducklake.select/2025/09/17/d...
0
35
12
reposted by
rmoff 🏃♂️🫖🥓
Lukas Eder
6 days ago
"5 HoUrS oF TrIaL AnD ErRoR SaVeS YoU 10 MiNuTeS oF ReAdInG ThE DoCuMeNtAtIoN." The documentation:
2
20
2
reposted by
rmoff 🏃♂️🫖🥓
DuckDB
7 days ago
📈 DuckDB 1.4.0 is out! This is our first LTS release which comes with *one year of community support*. It also supports database encryption, the MERGE SQL statement and Iceberg writes. For more details, read the announcement blog post at
duckdb.org/2025/09/16/a...
0
53
25
reposted by
rmoff 🏃♂️🫖🥓
Hannes Mühleisen
7 days ago
We're testing a new distribution channel for
@duckdb.org
:
#docker
images. For now they live at `hfmuehleisen/duckdb`, feel free to test them out. And yes, hell got a little colder today.
hub.docker.com/r/hfmuehleis...
loading . . .
https://hub.docker.com/r/hfmuehleisen/duckdb
0
21
3
I've been trying to learn more about some of the concepts in the
#AI
space, and writing things up as I go. Here's my third blog post in the series, looking at
#RAG
with some hands-on examples.
rmoff.net/2025/09/12/s...
Check out the other articles, covering MCP & Models:
rmoff.net/categories/s...
loading . . .
Stumbling into AI: Part 3—RAG
A short series of notes for myself as I learn more about the AI ecosystem as of September 2025. The driver for all this is understanding more about Apache Flink’s Flink Agents project, and…
https://rmoff.net/2025/09/12/stumbling-into-ai-part-3rag/
8 days ago
0
6
0
Blog Writing for Developers
rmoff.net/2023/07/19/b...
loading . . .
Blog Writing for Developers
Writing is one of the most powerful forms of communication, and it’s useful in a multitude of roles and contexts. As a blog-writing, documentation-authoring, twitter-shitposting DevEx engineer I…
https://rmoff.net/2023/07/19/blog-writing-for-developers/
11 days ago
0
5
1
I really enjoyed this talk from Paul Iusztin at QCon London. Covers lots of concepts around like
#LLMs
,
#RAG
, and
#Agents
and explains them well.
www.infoq.com/presentation...
loading . . .
The Data Backbone of LLM Systems
Drawing from his 8 years of experience in AI, Paul Iusztin breaks down the core components of a scalable architecture, emphasizing the importance of RAG. He shares practical patterns, including the Fe...
https://www.infoq.com/presentations/llm-data-code-model-prompt/
12 days ago
1
4
0
Write more blog articles, not fewer (Don’t leave the scraps on the cutting floor)
rmoff.net/2025/03/11/w...
loading . . .
Write more blog articles, not fewer (Don’t leave the scraps on the cutting floor)
Some would say that the perfect blog article takes the reader on a journey on in which the development process looks like this:
https://rmoff.net/2025/03/11/write-more-blog-articles-not-fewer-dont-leave-the-scraps-on-the-cutting-floor/
12 days ago
0
7
1
Love it - someone used SQL to build… a DOOM-like multiplayer shooter 😁
cedardb.com/blog/doomql/
loading . . .
Building a DOOM-like multiplayer shooter in pure SQL
CedarDB is a database system that delivers unmatched performance for transactions and analytics, from small writes to handling billions of rows. Built on cutting-edge research to power today’s tools…
https://cedardb.com/blog/doomql/
13 days ago
0
4
1
Part 2 is out! In which I try to get my head around LLMs and the like
rmoff.net/2025/09/08/s...
add a skeleton here at some point
14 days ago
0
7
1
A new blog post, in which I write up my learning about MCP 👇
rmoff.net/2025/09/04/s...
19 days ago
0
5
1
PopSQL Moves to Limited Support :-(
popsql.com/blog/popsql-...
loading . . .
PopSQL Moves to Limited Support — Here’s What You Need to Know
PopSQL will be moving to limited support.
https://popsql.com/blog/popsql-limited-support
19 days ago
0
0
1
Apache Kafka 4.1.0 released
kafka.apache.org/blog#apache_...
loading . . .
Apache Kafka
Apache Kafka: A Distributed Streaming Platform.
https://kafka.apache.org/blog#apache_kafka_410_release_announcement
19 days ago
0
3
0
reposted by
rmoff 🏃♂️🫖🥓
Matt Webb 🌸🌼🌸
20 days ago
A 1,000 episodes to explore Here's a map on my unofficial archive site, Braggoscope
www.braggoscope.com/explore
0
9
6
Now I wish I'd watched this before I tried figuring it out for myself 😅 Tim Berglund nails it in this crystal-clear explanation of MCP:
www.youtube.com/watch?v=FLpS...
add a skeleton here at some point
19 days ago
1
1
1
🏃♂️ having too much fun learning about MCP :)
20 days ago
0
4
1
Replacing a cache service with a database - blag
avi.im/blag/2025/db...
loading . . .
Replacing a cache service with a database - blag
Why do we use caches at all? Can databases fully replace them?
https://avi.im/blag/2025/db-cache/
21 days ago
0
7
0
Oracle Autonomous Database & Confluent Tableflow: Real-Time Kafka Analytics Without ETL
blogs.oracle.com/datawarehous...
loading . . .
https://blogs.oracle.com/datawarehousing/post/adb-confluent-tableflow-iceberg-without-etl
22 days ago
0
2
0
Sunny Days Are Warm: Why LinkedIn Rewards Mediocrity
www.elliotcsmith.com/linkedin-tox...
loading . . .
Sunny Days Are Warm: Why LinkedIn Rewards Mediocrity
I, like many people, find LinkedIn particularly annoying. I like the premise of it, don’t get me wrong, a resume you don’t need to update all that often seems cool. Unfortunately though, its turned…
https://www.elliotcsmith.com/linkedin-toxic-mediocrity/
about 1 month ago
2
5
0
Brad Stulberg - Motivation is Overrated: Here’s What Works Instead
bstulberg.medium.com/motivation-i...
loading . . .
Motivation is Overrated: Here’s What Works Instead
On the power of showing up and behavioral activation
https://bstulberg.medium.com/motivation-is-overrated-heres-what-works-instead-7c5744efd82f#bypass
about 1 month ago
0
2
0
✍️ Blogged: Interesting Links - August 2025 🗂️ Topics cover
#ApacheKafka
,
#ApacheIceberg
,
#ApacheFlink
, Stream Processing,
#DataEngineering
, RDBMS, and some stuff that I just found generally interesting :) 👉
rmoff.net/2025/08/21/i...
#dataBS
loading . . .
Interesting links - August 2025
https://rmoff.net/2025/08/21/interesting-links-august-2025/
about 1 month ago
0
5
1
Details of how DataDog built a new time series storage in Rust, replacing five previous generations of platform, giving 60x ingest speed and 5x faster queries
www.datadoghq.com/blog/enginee...
about 1 month ago
0
7
0
Ostensibly about the adoption of StarRocks at Fresha, this blog post goes into a bunch of interesting detail about data platform architecture, migration strategy, data serving tiers, and more - recommended!
freedium.cfd/https://medi...
loading . . .
https://freedium.cfd/https://medium.com/fresha-data-engineering/how-we-accidentally-became-one-of-uks-first-starrocks-production-pioneers-7db249f10010
about 1 month ago
1
5
0
srsly, probably 50% of the articles with interesting titles are just AI-generated crap. Smells include 🫵all✅the💽emojis • bullet • lists and then crappy ascii-art diagrams. (I guess this list'll get fed back into LLMs so they learn how to produce this shit better…)
add a skeleton here at some point
about 1 month ago
1
1
0
Preparing this month's Interesting Links (
rmoff.net/categories/i...
) and jfc there is so much shit on Medium. Some of it is human-generated shit, and more and more it's AI slop. Unfortunately there is the odd gem in there amongst the swill so it'd be remiss not to keep trawling it.
loading . . .
Categories • rmoff's random ramblings
https://rmoff.net/categories/interesting-links/
about 1 month ago
0
4
1
✍️ Blogged: Kafka to Iceberg - Exploring the Options
rmoff.net/2025/08/18/k...
#dataBS
about 1 month ago
1
16
5
Vortex (
vortex.dev
) is now a Linux Foundation project
www.linuxfoundation.org/press/lf-ai-...
loading . . .
LF AI & Data Foundation Hosts Vortex Project to Power High Performance Data Access for AI and Analytics
Contributed by SpiralDB, Vortex is an extensible, next-generation columnar storage format designed for building high-performance, future-proof data systems
https://www.linuxfoundation.org/press/lf-ai-data-foundation-hosts-vortex-project-to-power-high-performance-data-access-for-ai-and-analytics
about 2 months ago
0
4
1
Building Reproducible ML Systems with Apache Iceberg and SparkSQL: Open Source Foundations
www.infoq.com/articles/rep...
loading . . .
Building Reproducible ML Systems with Apache Iceberg and SparkSQL: Open Source Foundations
Traditional data lakes aregreat for storing massive amounts of stuff, but they're terrible at the transactional guarantees and versioning that ML workloads desperately need. Apache Iceberg and…
https://www.infoq.com/articles/reproducible-ml-iceberg/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=AI%2C+ML+%26+Data+Engineering
about 2 months ago
0
2
0
Erica Pisani—A First-Timer’s Guide to Curating a Technical Conference Track
www.infoq.com/articles/gui...
loading . . .
A First-Timer’s Guide to Curating a Technical Conference Track
One first-time track host shares the process, constraints, and takeaways from building a track from scratch at QCon London 2025.
https://www.infoq.com/articles/guide-curating-technical-conference-track/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=AI%2C+ML+%26+Data+Engineering
about 2 months ago
0
1
0
Richard van der Hoff - How
matrix.org
discovered, and recovered from, Postgres corruption
matrix.org/blog/2025/07...
loading . . .
Matrix.org
Matrix, the open protocol for secure decentralised communications
https://matrix.org
about 2 months ago
0
3
0
I should go on vacation more often—whilst I was away
#ApacheFlink
2.1 was been released!
flink.apache.org/2025/07/31/a...
about 2 months ago
1
1
1
✍️ Blogged: Interesting Links - July 2025 👉 Lots of Iceberg this month, plus Kafka, Flink, several interesting papers, and 9x examples of data in action at companies including Peloton, Cloudflare, Datadog, and Stifel 📚
rmoff.net/2025/07/18/i...
loading . . .
Interesting links - July 2025
https://rmoff.net/2025/07/18/interesting-links-july-2025/
2 months ago
0
6
1
reposted by
rmoff 🏃♂️🫖🥓
Data Behind the Scenes Conference
2 months ago
The weekend is here! Perfect time to submit your
#dataBS
talk. We want your lessons-learned stories from data work, everything from "my data pipeline: the unsung hero" to "how we got that AI system working" to "what we learned when it fell over." Details and sign-up:
bit.ly/dataBSconf-cfs
loading . . .
Data Behind the Scenes Conf - Call for Speakers
What This Conference Is About "Data, Behind the Scenes" is a (free) online-only, single track conference centered on the real stories of data work from the folks in the trenches. We’re not here for th...
https://bit.ly/dataBSconf-cfs
0
13
10
Ignore the clickbaity listicle title - there are some really good points in this article from Bernd Wessely
freedium.cfd/https://medi...
2 months ago
0
4
0
I currently run my blog using Hugo and GitHub Pages. Does anyone know of a way to bolt on a substack-esque email subscription option? I want to retain ownership of content, but seem to remember someone talking about something that did this. Sound familiar,
@ssp.sh
perhaps?
2 months ago
1
2
1
check it out, I wrote a blog 👇 😁 (it's showing how you can use SQL in Flink to build streaming ETL, with the resulting data ending up as Iceberg tables courtesy of Tableflow.)
www.confluent.io/blog/streami...
#dataBS
loading . . .
Streaming ETL Pipelines With Flink® SQL Apache Iceberg™
Follow this demo for building streaming ETL pipelines with Confluent Cloud using Flink® SQL to transform data and Tableflow to create Iceberg tables for analysis.
https://www.confluent.io/blog/streaming-etl-flink-tableflow/
2 months ago
0
6
1
Details of Peloton's Data Infrastructure, including Kafka, Hudi, and Debezium
hudi.apache.org/blog/2025/07...
loading . . .
Modernizing Data Infrastructure at Peloton Using Apache Hudi | Apache Hudi
Peloton re-architected its data platform using Apache Hudi to overcome snapshot delays, rigid service coupling, and high operational costs. By adopting CDC-based ingestion from PostgreSQL and…
https://hudi.apache.org/blog/2025/07/15/modernizing-datainfra-peloton-hudi/
2 months ago
0
10
1
Maths is hard, part 94.
2 months ago
1
4
0
reposted by
rmoff 🏃♂️🫖🥓
Confluent
2 months ago
Building a streaming data pipeline? Step one: make sense of the data you’re working with.
@rmoff.net
walks through how to explore and validate messy real-world data from
#ApacheKafka
in
#ApacheIceberg
using Tableflow. Read it now 👉
cnfl.io/4jrtjrD
loading . . .
Data Exploration with Tableflow, Apache Iceberg, and Trino
This blog post demonstrates how to use Tableflow to easily transform Kafka topics into queryable Iceberg tables using UK Environment Agency sensor data as a data source.
https://cnfl.io/4jrtjrD
0
4
2
✍️ I'm continuing my learning journey with
#ApacheIceberg
, taking a look at why housekeeping is necessary:
rmoff.net/2025/07/14/k...
I also learnt a little bit about
#ApachePolaris
, and even got chance to try out Nimtable :)
#dataBS
loading . . .
Keeping your Data Lakehouse in Order: Table Maintenance in Apache Iceberg
https://rmoff.net/2025/07/14/keeping-your-data-lakehouse-in-order-table-maintenance-in-apache-iceberg/
2 months ago
1
7
0
Debezium 3.2.0.Final Released
debezium.io/blog/2025/07...
2 months ago
1
8
4
Improving Debezium performance, by Vojtěch Juránek
debezium.io/blog/2025/07...
loading . . .
Improving Debezium performance
Debezium is an open source distributed platform for change data capture. Start it up, point it at your databases, and your apps can start responding to all of the inserts, updates, and deletes that…
https://debezium.io/blog/2025/07/07/quick-perf-check/
2 months ago
0
4
0
Every time I use a notebook like Jupyter I remember how damn powerful they are for learning and developing something. So much of our tooling is just functional and necessary, but notebooks… notebooks are FUN as well as powerful :)
2 months ago
2
10
0
Brandolini lifting heavy today.
3 months ago
0
2
0
reposted by
rmoff 🏃♂️🫖🥓
DuckDB
3 months ago
🚀 The DuckDB 1.3.2 bugfix release is out! 📦 The Python and CLI clients are already on the latest version, while the rest will follow in the coming days. 🔖 See the detailed change log at
github.com/duckdb/duckd...
0
25
7
How Atlassian migrated four million JIRA database instances from RDS for Postgres to Aurora
www.atlassian.com/blog/atlassi...
loading . . .
Migrating the Jira Database Platform to AWS Aurora - Work Life by Atlassian
Explore how Atlassian successfully migrated four million Jira databases to AWS Aurora, overcoming unique technical and operational challenges at massive scale. Learn how the team balanced…
https://www.atlassian.com/blog/atlassian-engineering/migrating-jira-database-platform-to-aws-aurora
3 months ago
0
3
0
reposted by
rmoff 🏃♂️🫖🥓
Andy Pavlo
3 months ago
At last
@abigalekim.bsky.social
's paper is out! Its the most complete eval of DB extensions/plugins ever. We analyze PostgreSQL, MySQL, MariaDB, SQLite, DuckDB, Redis. TLDR: Postgres extns ecosystem is fraught with footguns. Other DBMSs have fewer extns but less problems. DuckDB has cleanest API.
add a skeleton here at some point
1
67
14
reposted by
rmoff 🏃♂️🫖🥓
DuckDB
3 months ago
📢 DuckLake 0.2 is out! We added new features to the specification and improved support in the DuckDB ducklake extension. See the announcement blog post at
duckdb.org/2025/07/04/d...
.
2
42
13
✍️ Blogged: Writing to
#ApacheIceberg
using the
#ApacheKafka
Kafka Connect Iceberg sink
rmoff.net/2025/07/04/w...
#dataBS
3 months ago
1
16
1
An all-time classic conference talk:
www.destroyallsoftware.com/talks/wat
3 months ago
0
9
2
Load more
feeds!
log in