Reuben Bond
@rbn.bsky.social
๐ค 2984
๐ฅ 246
๐ 114
Distributed Systems & databases person. Works at Microsoft on Orleans & Aspire
reposted by
Reuben Bond
Andy Pavlo
23 days ago
Our SIGMOD paper with our friends at Tsinghua +
@wesmckinney.com
+
@pateljm.bsky.social
on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet. ๐ Paper:
db.cs.cmu.edu/papers/2025/...
๐ Code:
github.com/future-file-...
4
67
26
reposted by
Reuben Bond
Chris
about 1 month ago
Why, hello there! ๐ช Thanks
@rbn.bsky.social
:D
0
2
1
reposted by
Reuben Bond
Carlos Baquero
7 months ago
In the last week I probably explained Rateless Set Reconciliation to a dozen other scientists. What an amazing paper and result, and already one year old.
6
51
8
What's a good discord server for people building things in the AI/LLM space?
10 months ago
2
7
0
reposted by
Reuben Bond
Heidi Howard
10 months ago
Interesting workshop and a lovely community, would very much recommend!
add a skeleton here at some point
1
8
7
reposted by
Reuben Bond
Peter Kraft
11 months ago
Hi everyone! I'm a co-founder
@dbos.dev
, where we're building a serverless platform for highly reliable applications. I love conversations about databases, distributed systems, and anything technical. Thanks
@qianli.dev
for introducing me to Bluesky, and looking forward to meeting people here! ๐ฆ
0
31
6
reposted by
Reuben Bond
Justin
11 months ago
NULL BITMAP: in honour of DSQL's release last week I wrote about what the point of having extremely precise trustworthy clocks is
buttondown.com/jaffray/arch...
loading . . .
What are the Magical Clocks for?
At Re:Invent last week, AWS announced DSQL, their new serverless SQL database. As a fan of distributed SQL databases I have been enjoying reading about the...
https://buttondown.com/jaffray/archive/what-are-the-magical-clocks-for/
2
44
11
CRDTs fit a system model that has hardly any overlap with datacenter-based applications. If I'm wrong, please point to datacenter-based apps which have benefited from the application of CRDTs
11 months ago
9
13
3
reposted by
Reuben Bond
Alex Miller
11 months ago
New blog post on the fun new hardware advancements which databases can leverage for great gains, and why the cloud means it doesn't matter that they exist. ๐ซ
transactional.blog/b...
3
53
21
reposted by
Reuben Bond
Qian Li
11 months ago
ACM Symposium on Cloud Computing (SoCC) is next week! Lots of great papers on serverless, cloud, and AI/ML systems. I appreciate the push on diversity: both keynote speakers are women, plus a women in systems meetup! (This was my first time as a program committee member๐ค)
acmsocc.org/2024/index.h...
loading . . .
2024 ACM Symposium on Cloud Computing
https://acmsocc.org/2024/index.html
0
13
1
reposted by
Reuben Bond
ahmet alp balkan
12 months ago
We've now published a blog about how LinkedIn runs large stateful apps with a generic operator on Kubernetes.
www.linkedin.com/blog/enginee...
Donโt forget to come to our
#KubeCon
session on Thursday.
add a skeleton here at some point
1
21
7
reposted by
Reuben Bond
Clemens Vasters ๐ช๐บ๐ฉ๐ช
12 months ago
I made a substantial update to my "messaging and eventing resources" page, adding the CNCF xRegistry project and related subprojects, pending proposals for Apache Avro, a section on Microsoft Fabric RTI, and a new talk recording.
github.com/clemensv/mes...
loading . . .
GitHub - clemensv/messaging: Resource collection for messaging and eventing
Resource collection for messaging and eventing. Contribute to clemensv/messaging development by creating an account on GitHub.
https://github.com/clemensv/messaging
0
16
4
reposted by
Reuben Bond
Steve Yackey
12 months ago
Spent some time trying out
#dotnet
Aspire last night. Definitely impressed so far. I was surprised how easy it was to pull in non dotnet workloads too.
1
4
1
Distributed Systems &
#DataBS
folks, please welcome
@lalithsuresh.bsky.social
! Lalith authored my favorite paper on group membership, RAPID. Instead of building strongly consistent membership atop weak (eg, Raft + SWIM), RAPID delivers consistency from the get-go
www.usenix.org/system/files...
12 months ago
2
43
11
CASPaxos is a Paxos variant which implements a linearizable (compare and set) register without needing logs. It's the simplest practical consensus algorithm, IMO. I wrote more about it here:
reubenbond.github.io/posts/caspaxos
loading . . .
reublog - CASPaxos
reublog
https://reubenbond.github.io/posts/caspaxos
12 months ago
0
21
5
reposted by
Reuben Bond
Vivek Ranjan
12 months ago
Over communicate if youโre remote. And map your communication style and message to the audience.
add a skeleton here at some point
1
7
3
Scan RAM without trashing your CPU caches by leveraging non-temporal instructions, Memory Type Range Registers, or cache line flushing. Obscure, but perhaps useful for minimizing the impact of garbage collectors / LSM compaction.
dl.acm.org/doi/pdf/10.1...
12 months ago
1
7
2
reposted by
Reuben Bond
Qian Li
12 months ago
Hello friends! I'm so excited to join Bluesky here -- it feels like a cleaner and safer social platform. The first thing I did was to follow this instruction to set my domain as my handle. It's super simple!
bsky.social/about/blog/4...
loading . . .
How to set your domain as your handle - Bluesky
Using a domain as your handle helps with account identity, verification, and portability. Here's how to set your domain as your handle.
https://bsky.social/about/blog/4-28-2023-domain-handle-tutorial
1
66
9
reposted by
Reuben Bond
Apurva Mehta
12 months ago
Is it end of the road for RocksDB in stream processing? Disaggregated state is the clearly superior architecture, with
@responsive.dev
investing heavily in
SlateDB.io
while Flink 2.0 has forked RocksDB. Here's why we've bet on SlateDB for Kafka Streams:
www.responsive.dev/blog/why-sla...
loading . . .
If not RocksDB, then what? Why SlateDB is the right choice for Stream Processing.
Why we think SlateDB is the right foundation for a state storage service for Kafka Streams.
https://www.responsive.dev/blog/why-slatedb-for-kafka-streams
1
18
8
Disaggregated storage pushes us to reexamine past ideas under a new light. Phil Bernstein gives middle-tier distributed transactions another look in this paper
sites.computer.org/debull/A19ju...
. What other examples come to mind?
12 months ago
1
17
4
Databases built on disaggregated storage (blobs) leverage reconfiguration rather than quorums. You don't need Paxos in your database if the storage layer guarantees linearizability and durability already.
add a skeleton here at some point
12 months ago
2
19
4
reposted by
Reuben Bond
Kaivalya Apte
12 months ago
Talking about Database Internals, I want to let yโall know that I just released another internals episode with the one and only
@gwenshap.bsky.social
If you want to know why Nile Postgres was created and how it supports multitenant SaaS, tune in NOW
youtu.be/WZ7t0vqGKoo
0
19
9
Data Replication Design Spectrum by
@alexmillerdb.bsky.social
(
transactional.blog/blog/2024-da...
) pairs well with Categorizing How Distributed Databases Utilize Consensus Algorithms by Adam Prout (
medium.com/@adamprout/c...
). Both deserve a read if you are interested in distributed databases.
loading . . .
Categorizing How Distributed Databases Utilize Consensus Algorithms
Distributed databases generally fall into two camps when it comes to architectures for maintaining high availability (HA) [1]. Bothโฆ
https://medium.com/@adamprout/categorizing-how-distributed-databases-utilize-consensus-algorithms-492c8ff9e916
12 months ago
1
26
6
Orleans can automatically migrate frequently communicating grains to the same hosts, reducing network calls. Here's an app running on a real cluster showing grains (small circles) being migrated between hosts to eliminate network calls (red lines).
loading . . .
12 months ago
4
25
4
Stateful services: low latency, efficiency, scalability - pick three - Atul Adya (Databricks):
hpts.ws/papers/2024/...
Atul previously built Slicer at Google. Dicer is an evolution of that for Databricks. This presentation at HPTS resonated strongly with me - if only we had a recording
loading . . .
https://hpts.ws/papers/2024/2024_session8_adya.pptx
12 months ago
1
3
1
Papers would be better if they omitted part where they rub an LLM on the problem. I see two reasons why it's done: for novelty and to satisfy the authors' bosses
12 months ago
0
1
0
reposted by
Reuben Bond
Christian Corsano
about 1 year ago
Orleans is such a backend secret-sauce ๐ฅ Awesome work !
add a skeleton here at some point
0
8
3
A new feature in Orleans enables it to seamlessly balance already-active grains across your cluster as it scales or rolls. In this demo, 100K grains are activated on a single node. The cluster is scaled to 10 nodes and the grains are gradually spread across them until everything is balanced.
loading . . .
about 1 year ago
0
9
4
Around 70% of traffic in Azure is RDMA. Azure Storage is powered by RDMA between clients and frontends and within the clusters to improve I/O perf & save CPU
over 2 years ago
1
2
0
Language is both liberating and constraining. It lets us convey ideas but it also shapes what we can think. We can fumble our way into inventing new bits of language by giving names to concepts, allowing us to think new thoughts and express them. Naming things is difficult.
over 2 years ago
1
0
0
Lots of accounts here are mass following, presumably using a script. Does BlueSky want to prevent that kind of thing?
over 2 years ago
0
0
0
I'm Reuben, I work at Microsoft on the .NET team, building Orleans. I'm interested in distributed systems, databases, and making cloud app dev accessible to all developers
over 2 years ago
0
6
1
you reached the end!!
feeds!
log in