Andy Pavlo
@andypavlo.bsky.social
📤 5157
📥 59
📝 100
Associate Prof. of Databases @ Carnegie Mellon.
reposted by
Andy Pavlo
Yaroslav Tkachenko
21 days ago
Found a love letter to
@andypavlo.bsky.social
at
#PgConf
in Vancouver
0
18
1
reposted by
Andy Pavlo
CMU Database Group
23 days ago
Prof. Andy Pavlo Wins the 2026 IEEE TCDE Ramez Elmasri Outstanding Database Education Award:
db.cs.cmu.edu/2026/05/prof...
loading . . .
Prof. Andy Pavlo Wins 2026 IEEE TCDE Ramez Elmasri Outstanding Database Education Award - Carnegie Mellon Database Group
Pittsburgh, PA – The Carnegie Mellon Database Research Group is proud to... Read More +
https://db.cs.cmu.edu/2026/05/prof-andy-pavlo-wins-2026-ieee-tcde-ramez-elmasri-outstanding-database-education-award/
0
19
1
I'm excited to announce CMU-DB's DJ Cache ran the whole
@cmu.edu
circuit by winning the "Most Dank DJ" award again! This semester was the most vicious bracket ever seen in the cut. Every set was a straight-up street fight on the aux. Congratulations!
about 1 month ago
0
18
0
reposted by
Andy Pavlo
SIGMOD/PODS Conference
about 1 month ago
#SIGMOD2026
Research Track Honorable Mention: F3: The Open-Source Data File Format for the Future Xinyu Zeng, Ruijun Meng, Martin Prammer,
@wesmckinney.com
,
@pateljm.bsky.social
,
@andypavlo.bsky.social
, and Huanchen Zhang
2026.sigmod.org/sigmod_award...
#ACM
#sigmodresearchtrack
#sigmodawards
loading . . .
https://2026.sigmod.org/sigmod_awards.shtml
1
3
1
We now offer
@db.cs.cmu.edu
's Database Systems course offline to incarcerated students across US prisons. No WiFi, completely free. Locked in by the system, freed by the lock manager:
db.cs.cmu.edu/2026/05/cmud...
Thanks to
@convex.dev
for helping make sure the database game is for everybody!
about 1 month ago
0
18
0
reposted by
Andy Pavlo
CMU Database Group
about 1 month ago
Today's Postgres vs. World Seminar Speaker: Dr. Xiangyao Yu (
@xiangyaoyu.bsky.social
) will present the architecture of the Sirius GPU-native database system. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
Sirius: A GPU-Native SQL Engine (Xiangyao Yu) - Carnegie Mellon Database Group
GPUs have evolved into powerful, cost-efficient engines for general-purpose parallel compute. This... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-sirius-a-gpu-native-sql-engine-xiangyao-yu/
0
5
3
reposted by
Andy Pavlo
CMU Database Group
about 2 months ago
Today's Postgres vs. World Seminar Speaker: Steve Schirripa (MSE'00) will present how
@villagesql.bsky.social
is injecting new life into MySQL via extensions. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
The Extensibility Tax: Decisions, Principles, and Lessons from a Year of Teaching MySQL New Tricks (Steve Schirripa) - Carnegie Mellon Database Group
What does it take to add custom data types and indexes to... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-villagesql-steve-schirripa/
0
3
3
The founders of FloeDB (Mark Cusack + Kurt Westerfeld) gave an interesting
@db.cs.cmu.edu
talk their new Iceberg-compatible query engine. Two key takeaways from their talk: 1️⃣ Floe is a hard fork of Yellowbrick. 2️⃣ Floe is building a "catalog-of-catalogs"
www.youtube.com/watch?v=Kq3c...
loading . . .
Floe: A SQL Compute Service for the Data Lakehouse (Kurt Westerfeld + Mark Cusack)
YouTube video by CMU Database Group
https://www.youtube.com/watch?v=Kq3csHJqgJQ
about 2 months ago
2
9
0
reposted by
Andy Pavlo
CMU Database Group
about 2 months ago
Today's Postgres vs. World Seminar Speaker: Sugu Sougoumarane (ex-Vitess/PlanetScale) will present the architecture of
@supabase.com
's Multigres horizontal sharding system for Postgres. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
Multigres: Bringing Horizontal Scaling and Enterprise Operations to PostgreSQL (Sugu Sougoumarane) - Carnegie Mellon Database Group
Multigres is an adaptation of Vitess for PostgreSQL that enables horizontal scaling... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-multigres-sugu-sougoumarane/
1
6
2
reposted by
Andy Pavlo
CMU Database Group
2 months ago
Today's Postgres vs. World Seminar Speaker: Tyler F. Cloutier will explain why and how they embedded a MMORPG game engine inside of the
@spacetimedb.bsky.social
database system. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
Inverting the Backend: Why We Built our MMORPG Inside a Database (Tyler Cloutier) - Carnegie Mellon Database Group
This talk explores what happens when you move all the infrastructure and... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-spacetimedb-tyler-cloutier/
0
6
1
reposted by
Andy Pavlo
CMU Database Group
2 months ago
Today's Postgres vs. World Seminar Speaker: Dr. Marcel Kornacker will present the design & implementation of the Pixeltable multi-modal DBMS based on PostgreSQL. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
Pixeltable: A DBMS for Multimodal AI Applications (Marcel Kornacker) - Carnegie Mellon Database Group
Pixeltable is a novel OLTP DBMS specifically designed for multimodal AI applications.... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-pixeltable-marcel-kornacker/
0
2
1
Thanks to Google BigTable for providing prizes to the top 3
@db.cs.cmu.edu
students with the fastest projects in our Database Systems course this semester! They have to implement a buffer pool, B+Tree, query executors, and MVCC. Students are encouraged to profile+benchmark to get better rankings.
3 months ago
0
15
1
This email showed up two weeks ago. It is super sketchy so I didn't respond. They followed again but I'm not meeting them. Andy and the police don't mix. 👑 Remember Rule #9 from Biggie's Ten Crack Commandments: If you aren't being arrested, stay away from police
youtu.be/ot9NT9W0Fog
3 months ago
1
14
1
reposted by
Andy Pavlo
CMU Database Group
3 months ago
Today's Postgres vs. World Seminar Speaker: Filip Obradovic will present the wild architecture of TonicDB. It runs directly on hardware without an OS via a shim layer / unikernel. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
TonicDB: Databases without an OS? Meet QuinineHM (Filip Obradovic) - Carnegie Mellon Database Group
We spent years optimizing database internals, only to have our performance eaten... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-tonicdb-filip-obradovic/
0
6
2
reposted by
Andy Pavlo
CMU Database Group
3 months ago
Today's Postgres vs. World Seminar Speaker: Hari Krishna Sunder will present the architecture of the
@yugabytedb.bsky.social
distributed DBMS based on Postgres. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
YugabyteDB: Distributed PostgreSQL for Modern Apps (Hari Krishna Sunder) - Carnegie Mellon Database Group
YugabyteDB is an AI-ready, multimodal, distributed PostgreSQL database. It effectively bridges the... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-yugabytedb-hari-krishna-sunder/
0
7
3
reposted by
Andy Pavlo
CMU Database Group
3 months ago
Today's Postgres vs. World Seminar Speaker: Simon Eskildsen will present the architecture of the
@turbopuffer.bsky.social
search engine. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
turbopuffer: Object Storage-native Database for Search (Simon Eskildsen) - Carnegie Mellon Database Group
turbopuffer is an object storage-native search engine. It puffs data into a... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-turbopuffer-simon-eskildsen/
0
5
5
reposted by
Andy Pavlo
CMU Database Group
4 months ago
Today's Postgres vs. World Seminar Speaker: Adam Prout will present the architecture of Microsoft Azure HorizonDB database service based on PostgreSQL. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
HorizonDB: Co‑Designing PostgreSQL and Azure for Cloud‑Native OLTP (Adam Prout) - Carnegie Mellon Database Group
Azure HorizonDB is a new PostgreSQL service that improves the OLTP performance... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-horizondb-adam-prout/
0
8
1
reposted by
Andy Pavlo
CMU Database Group
4 months ago
Today's Postgres vs. World Seminar Speaker: Marek Galovic will present the TopK document + vector search engine system. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
TopK: Billion-Scale Hybrid Retrieval from the Ground Up (Marek Galovic) - Carnegie Mellon Database Group
TopK is a search engine built from the ground up for unstructured... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-topk-marek-galovic/
0
6
2
reposted by
Andy Pavlo
CMU Database Group
4 months ago
Today's Postgres vs. World Seminar Speaker: Marc Brooker (
@marcbrooker.bsky.social
) will present the architecture of Amazon's Aurora DSQL Postgres-compatible serverless OLTP DBMS. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
Aurora DSQL: Serverless, Scalable, Global OLTP (Marc Brooker) - Carnegie Mellon Database Group
Amazon Aurora DSQL is a distributed SQL database, designed to make it... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-aurora-dsql-marc-brooker/
0
14
3
reposted by
Andy Pavlo
CMU Database Group
4 months ago
Today's Postgres vs. World Seminar Speaker: Tyler Akidau + Adam Symanski will present the architecture of
@redpanda.com
's Oxla database system. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/pg-vs...
loading . . .
Redpanda Oxla or: Why Your Hashmaps are Secretly Wrecking Your Performance (Tyler Akidau + Adam Symanski) - Carnegie Mellon Database Group
In this talk, we’ll first give an overview of the Oxla analytical... Read More +
https://db.cs.cmu.edu/events/pg-vs-world-redpanda-oxla-tyler-akidau-adam-symanski/
0
5
2
Spring 2026 Seminar Series: PostgreSQL vs. The World
db.cs.cmu.edu/seminars/spr...
First talk on Mon Feb 2nd @ 4:30pm EST. We will alternate between a speaker from either a Postgres DBMS or a non-Postgres DBMS. Open to the public over Zoom. All videos available on YouTube afterwards.
loading . . .
PostgreSQL vs. The World Seminar Series - Carnegie Mellon Database Group
Every major cloud vendor now offers an enhanced, opinionated PostgreSQL-compatible database management... Read More +
https://db.cs.cmu.edu/seminars/spring2026/
4 months ago
1
22
2
Thanks to Bohan Zhang for hosting me at OpenAI yesterday. Lots of
@db.cs.cmu.edu
alum are thriving there. Plus the Rockset squad rolled up. It was the nicest tech office I've visited in my life. It was like a classy lawfirm but with an insane number of ex-FBI bouncers at the front entrance.
5 months ago
2
15
1
Congratulations to the 2026 CIDR prize awardees! Tianyu Li → Gong Show Winner Fuheng Zhao → Database Quiz Winner They each received a rare signed print of "The Birth of the Database Messiah" (estimated insurance value $12,000).
5 months ago
0
23
0
I recently came across this database system in my travels:
medium.com/@sschepis/i-...
The title immediately raises my BS alarms. They claim to "teleport data" via "quantum mechanical principles".
5 months ago
1
12
2
I don't want to cook such an early stage company but I think these people are trying to sell MMAP as a service?!? No technical details except it appears to be a MMAP buffer pool. I also don't know why their system is "fully ACID" but RocksDB is not?
ryjoxdemo.com/solutions/ed...
5 months ago
2
11
0
At least this scam company remembered to sanitize their database inputs before sending out their spam...
5 months ago
0
28
0
I've posted my latest recap of the world of databases:
www.cs.cmu.edu/~pavlo/blog/...
All the hot topics from the last year: • More Postgres action! • MCP for everyone! • MongoDB gets litigious with FerretDB! • File formats! • Market movements! • The richest person in the history of the world!
loading . . .
Databases in 2025: A Year in Review
The world tried to kill Andy off but he had to stay alive to to talk about what happened with databases in 2025.
https://www.cs.cmu.edu/~pavlo/blog/2026/01/2025-databases-retrospective.html
5 months ago
1
76
31
Congratulations to the #1 ranked
@db.cs.cmu.edu
PhD student Wan Shen Lim (
@wslim.bsky.social
) for successfully passing his doctoral defense. Wan has been working on hard AF database research with me for the last *nine* years at CMU (undergrad+grad). He also hates chickens.
6 months ago
3
30
1
reposted by
Andy Pavlo
David Andersen
6 months ago
in a tiny job update: I'm taking over as co-director of CMU's parallel data lab (PDL). in a bad news update: I just used the phrase "align with CMU's brand strategy" unironically in an email to the administration. might need an intervention...
2
18
1
reposted by
Andy Pavlo
CMU Database Group
6 months ago
Today's Future Data Systems Seminar Speaker: Jark Wu from AlibabaCloud will present an overview of Apache Fluss. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Apache Fluss: A Streaming Storage for Real-Time Lakehouse - Carnegie Mellon Database Group
Modern data lakehouses promise unified batch and streaming processing, yet their storage... Read More +
https://db.cs.cmu.edu/events/future-data-apache-fluss-a-streaming-storage-for-real-time-lakehouse/
0
7
2
Do you like databases? Do you want to hear two database professors rant about them? Do you need one of those professors to have a Turing Award for databases? If yes, then join Mike Stonebraker and I next Wed Dec 10 @ 1:00pm EST for database hot takes:
www.dbos.dev/webcast-2025...
loading . . .
2025 in Review with Mike Stonebraker and Andy Pavlo
Webcast Dec 10: DBMS researchers Mike Stonebraker (MIT / DBOS) and Andy Pavlo (CMU) discuss which data and CS trends are heating up or cooling down heading into 2026.
https://www.dbos.dev/webcast-2025-in-review-with-mike-stonebraker-and-andy-pavlo
6 months ago
3
76
20
reposted by
Andy Pavlo
Conor Power
6 months ago
There is still time to register for CIDR 2026 in Santa Cruz! If you need a roommate for the conference, there is also a spreadsheet you can use to find someone!
www.cidrdb.org/cidr2026/reg...
loading . . .
CIDR 2026 - Registration
The 16th Conference on Innovative Data Systems Research (CIDR 2026), Registration Information
https://www.cidrdb.org/cidr2026/registration.html
0
7
5
reposted by
Andy Pavlo
CMU Database Group
6 months ago
Today's Future Data Systems Seminar Speaker: Prashant Singh from Snowflake will present the Apache Polaris re-implementation of the Iceberg REST catalog API. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] From Storage Formats to Open Governance: The Evolution to Apache Polaris - Carnegie Mellon Database Group
As organizations build their data lakehouses on Apache Iceberg, the primary challenge... Read More +
https://db.cs.cmu.edu/events/futuredata-apache-polaris/
0
5
2
reposted by
Andy Pavlo
CMU Database Group
7 months ago
Today's Future Data Systems Seminar Speaker: Jeremy Taylor (
@refset.bsky.social
) will present the architecture of the XTDB (
@xtdb.com
) time-traveling database system. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Reconstructing History with XTDB - Carnegie Mellon Database Group
XTDB is a SQL database that challenges long held assumptions about how... Read More +
https://db.cs.cmu.edu/events/futuredata-reconstructing-history-with-xtdb/
0
4
4
reposted by
Andy Pavlo
CMU Database Group
7 months ago
Today's Future Data Systems Seminar Speaker: Benjamin Wagner🇩🇪 will present
@firebolthq.bsky.social
's native support for low-latency queries on Apache Iceberg tables. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Why Powering User Facing Applications on Iceberg is Hard - Carnegie Mellon Database Group
Firebolt is a Postgres compliant analytical database built for low-latency, high-concurrency analytics.... Read More +
https://db.cs.cmu.edu/events/future-data-firebolt/
0
13
3
reposted by
Andy Pavlo
CMU Database Group
7 months ago
Today's Future Data Systems Seminar Speaker: Cheng Chen will present how
@mooncakelabs.bsky.social
extends PostgreSQL to support Apache Iceberg. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Mooncake: Real-Time Apache Iceberg Without Compromise - Carnegie Mellon Database Group
Apache Iceberg is great for large-scale analytics, but it was built for... Read More +
https://db.cs.cmu.edu/events/futuredata-mooncake/
0
5
2
reposted by
Andy Pavlo
Sam Arch
7 months ago
Great idea to compare plans across different systems using rows processed. A good yardstick, but slower sort-based plans from Postgres + MSSQL process fewer rows than faster hash-based plans from DuckDB. Postgres rows scanned also seem underreported. Nice to see some competition with ClickBench.
0
3
2
New database leaderboard from Yellowbrick ranks the quality of DBMS optimizer estimates and plans. They only evaluate TPC-H for now and report results for Postgres + DuckDB + MSSQL:
sql-arena.com/components/p...
Repo:
github.com/sql-arena/db...
LinkedIn Group:
www.linkedin.com/groups/15775...
7 months ago
1
14
3
reposted by
Andy Pavlo
CMU Database Group
7 months ago
Today's Future Data Systems Seminar Speaker: Ryan Johnson (CMU PhD'10) will present
@deltalakeoss.bsky.social
's internal architecture and how it supports multi-statement transactions. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Multi-statement Transactions in the Databricks Lakehouse - Carnegie Mellon Database Group
The data lake architecture originally focused on self-standing tables in cloud storage,... Read More +
https://db.cs.cmu.edu/events/futuredata-deltalake/
0
4
2
reposted by
Andy Pavlo
CMU Database Group
8 months ago
Today's Future Data Systems Seminar Speaker: Joyo Victor will present
@singlestore.com
's "Bottle Service" meta-data system that supports database branching, change-data-capture, and Apache Iceberg. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Storage Metadata for Modern Cloud Databases - Carnegie Mellon Database Group
In modern database architecture, separating compute from storage unlocks powerful capabilities. Our... Read More +
https://db.cs.cmu.edu/events/futuredata-singlestore
0
3
3
Lots of database action this week. Yes, I have a new start-up
@sydht.ai
with my PhD students
@wslim.bsky.social
+
@17zhangw.bsky.social
using LLMs to optimize almost everything in PostgreSQL.
@datadictum.bsky.social
posted a new article on our approach:
www.theregister.com/2025/10/22/c...
loading . . .
Researchers tout vector-based automated tuning in PostgreSQL
: Researchers say 'Proto-X' fine-tunes databases automatically, delivering multifold performance boosts
https://www.theregister.com/2025/10/22/cmu_proto_x_postgres/
8 months ago
2
16
2
reposted by
Andy Pavlo
ScyllaDB
8 months ago
Day 2 of
#P99CONF
is here! The
#ScyllaDB
Lounge opens at 8:00 am PST, and then we get things started with keynotes from
@dorlaor.bsky.social
and
@andypavlo.bsky.social
. Don't forget that all registrants receive Instant Access to the sessions once the conference ends.
www.p99conf.io?latest_sfdc_...
0
3
2
reposted by
Andy Pavlo
CMU Database Group
8 months ago
Today's Future Data Systems Seminar Speaker: Ian Cook (
@ian.columnar.tech
) will present
@columnar.tech
's work on Apache Arrow's database connectivity API (ADBC). ADBC is available in modern DBMSs. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Where We're Going, We Don't Need Rows: Columnar Data Connectivity with ADBC - Carnegie Mellon Database Group
ADBC (Arrow Database Connectivity) is Apache Arrow’s answer to ODBC and JDBC:... Read More +
https://db.cs.cmu.edu/events/futuredata-where-were-going-we-dont-need-rows-columnar-data-connectivity-with-adbc/
0
15
9
reposted by
Andy Pavlo
CMU Database Group
8 months ago
Today's Future Data Systems Seminar Speaker: Will Manning (
@willmanning.com
) will present
@spiraldb.com
's Vortex file format. Vortex is now a
@linuxfoundation.org
project. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Vortex: LLVM for File Formats - Carnegie Mellon Database Group
Apache Parquet revolutionized columnar storage after its initial release in 2013, but... Read More +
https://db.cs.cmu.edu/events/futuredata-vortex/
0
4
4
reposted by
Andy Pavlo
Andrew Lamb
8 months ago
BTW if anyone wants a good intro to database storage / Log structured storage (aka LSM trees)
@db.cs.cmu.edu
lecture this fall is a good one:
www.youtube.com/watch?v=2_sT...
loading . . .
#05 - Log-Structured Database Storage ✸ SingleStore Database Talk (CMU Intro to Database Systems)
YouTube video by CMU Database Group
https://www.youtube.com/watch?v=2_sTdS4h-bY
0
17
4
reposted by
Andy Pavlo
Artem Krylysov
8 months ago
MMAP is incredibly fast when the dataset fits in memory, but it slows to a crawl when it doesn't, especially if the workload is mostly random point lookups. Speaking as someone who built an MMAP-based key-value store before :) Obligatory paper from
@andypavlo.bsky.social
db.cs.cmu.edu/mmap-cidr2022/
0
9
2
reposted by
Andy Pavlo
CMU Database Group
8 months ago
Today's Future Data Systems Seminar Speaker: Jordan Tigani (
@jrdntgn.bsky.social
) will present how
@motherduck.com
supports modern workloads with DuckLake. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] DuckLake: Learning from Cloud Data Warehouses to Build a Robust "Lakehouse" - Carnegie Mellon Database Group
When building scalable data systems, it is easy to focus on the... Read More +
https://db.cs.cmu.edu/events/future-data-ducklake-learning-from-cloud-data-warehouses-to-build-a-robust-lakehouse/
0
13
6
Our SIGMOD paper with our friends at Tsinghua +
@wesmckinney.com
+
@pateljm.bsky.social
on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet. 📄 Paper:
db.cs.cmu.edu/papers/2025/...
📁 Code:
github.com/future-file-...
8 months ago
4
70
26
reposted by
Andy Pavlo
CMU Database Group
8 months ago
Today's Future Data Systems Seminar Speaker: Vinoth Chandar will present the internals of Apache Hudi and his work at Onehouse. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] Apache Hudi: A Database Layer over Cloud Storage for Fast Mutations and Efficient Queries - Carnegie Mellon Database Group
Data lakes emerged as a way to store vast amounts of data... Read More +
https://db.cs.cmu.edu/events/futuredata-apache-hudi/
0
4
1
reposted by
Andy Pavlo
CMU Database Group
9 months ago
Today's Future Data Systems Seminar Speaker: Russell Spitzer will present the internals of Apache Iceberg's query planner and execution engine. Zoom talk open to public at 4:30pm ET. YouTube video available after:
db.cs.cmu.edu/events/futur...
loading . . .
[Future Data] An Extremely Technical Overview of how the Apache Iceberg™ Planning Implementation Actually Works - Carnegie Mellon Database Group
What are you trying to tell me? That I can read data... Read More +
https://db.cs.cmu.edu/events/futuredata-apache-iceberg/
0
8
5
Load more
feeds!
log in