Vignesh Chandramohan
@vigneshc.bsky.social
📤 1234
📥 87
📝 40
Stream processing, data infra, Table formats.
https://vigneshc.github.io/
A short post on Iceberg deletions :
vigneshc.github.io/blog/iceberg...
loading . . .
Iceberg Deletes — Equality Deletes and V3 Deletion Vectors
A short walkthrough of Apache Iceberg deletes: equality deletes, V3 deletion vectors, and streaming considerations.
https://vigneshc.github.io/blog/iceberg-deletion/
28 days ago
0
3
0
Exploring Apache Iceberg and SlateDB formats - with a repo link for additional exploration.
datapapers.substack.com/p/exploring-...
loading . . .
Exploring Table Formats - Iceberg & SlateDB
What is common between Apache Icerberg, SlateDB and object-store first table formats? Explore with real examples.
https://datapapers.substack.com/p/exploring-table-formats-iceberg-and
2 months ago
0
8
1
Short talk on Iceberg use cases in last week's Seattle Iceberg meetup.
youtu.be/F7qpOVVnxek?...
loading . . .
Iceberg Use Cases at DoorDash
YouTube video by Apache Iceberg™ Meetup
https://youtu.be/F7qpOVVnxek?si=ZpmT_-kTWFlcgrEl
8 months ago
0
2
0
reposted by
Vignesh Chandramohan
Chris
8 months ago
If you find yourself in SF next week,
@almog.xyz
is talking about SlateDB at the SF Systems Meetup on Wednesday!
loading . . .
SF Systems Meetup: Databases and Stateful Apps · Luma
The SF Systems Meetup is back with a pair of talks giving us a peek at the future of state management! This month, we're excited to have talks from Almog Gavra…
https://luma.com/e7feg2i6
0
17
4
1/ Nice read:
medium.com/@xiafan/time...
Software engineer growth in AI era: * Build composable tools. * Assume non deterministic outcomes from a group of ai agents. * Understand how LLM works to the next level, like how you would understand to read a query plan.
loading . . .
Time to Adapt and Become a Better Engineer
A colleague recently asked if I think AI agents could do all the jobs humans do. My answer was an unequivocal yes. Given the same sensory…
https://medium.com/@xiafan/time-to-adapt-and-become-a-better-engineer-7059b7295f21
9 months ago
1
2
0
reposted by
Vignesh Chandramohan
Chris
10 months ago
Just added
sqlync.com
to SlateDB's adopters list! They're building a streaming system that speaks MQTT or PostgreSQL across millions of connected users and devices. 🤯
loading . . .
SQLync
https://sqlync.com
0
5
1
reposted by
Vignesh Chandramohan
Chris
about 1 year ago
Insane amount of SlateDB work going on: - snapshot reads - split/merge DBs (zero copy) - deterministic simulation testing And someone just pushed Python bindings in a PR! 🤯
0
10
3
My Data council talk on SlateDB.
youtu.be/gcTRXZeKbNg?...
loading . . .
Internals of SlateDB: An Embedded Key Value Store Built On Object Storage
YouTube video by Data Council
https://youtu.be/gcTRXZeKbNg?si=bmjfgQxZjr_BFx1r
about 1 year ago
0
22
3
reposted by
Vignesh Chandramohan
Chris
about 1 year ago
SlateDB 0.6.0 is out!
github.com/slatedb/slat...
Highlights include a hybrid cache (using Foyer), a lot of internal cleanup, and more groundwork for transactions. Oh, and put performance jumped ~80% for write-heavy workloads :)
slatedb.io/performance/...
0
8
1
reposted by
Vignesh Chandramohan
Chris
about 1 year ago
Today marks SlateDB’s one year anniversary! It’s been a lot of fun. Thanks to
@rohanpd.bsky.social
@flaneur2024.bsky.social
@almog.ai
@vigneshc.bsky.social
@paulbutler.org
Jason Gustafson, David Moravek, and many others for joining the project. 😀
loading . . .
SlateDB - An embedded storage engine built on object storage | SlateDB
Description will go into a meta tag in <head />
https://SlateDB.io
0
16
6
reposted by
Vignesh Chandramohan
Commonhaus Foundation
about 1 year ago
Commonhaus is 1! 🎂 14 projects, solid foundations, and more on the way. If you believe in light governance, shared care, and thoughtful support for open source, come see what we’re building.
www.commonhaus.org/activity/253...
loading . . .
🎂 Commonhaus Turns One — A Look Back, and the Road Ahead
Commonhaus Foundation celebrates its first anniversary and lays down expectations for its future
https://www.commonhaus.org/activity/253.html
0
31
20
reposted by
Vignesh Chandramohan
Mike Driscoll
about 1 year ago
Yo SF Bay Area
#databs
crew, want to talk lakehouses at a real Lake House? :) Next week after Data Council, join the founders of
@clickhouse.com
,
@motherduck.com
,
@startreedata.bsky.social
, and
@tobikodata.com
to talk real-time databases and next-generation ETL.
www.rilldata.com/events/data-...
1
10
3
reposted by
Vignesh Chandramohan
Chris
over 1 year ago
SlateDB 0.5.0 is out! Features: - Checkpoints - Clones - Read only client - Split/merge database foundation - TTL filtering on reads - Last version with breaking byte format changes By the numbers: - 62 commits - 2 new contributors - 10 total contributors
github.com/slatedb/slat...
loading . . .
Release v0.5.0 · slatedb/slatedb
What's Changed Refactor Block Tests to Use Table-Driven Test Cases by @samsond in #410 Update await calls in README.md by @criccomini in #425 chore: Apply table driven test for sst.rs by @jeffreyl...
https://github.com/slatedb/slatedb/releases/tag/v0.5.0
2
21
4
datapapers.substack.com/p/building-c...
New post.
loading . . .
Building composable data systems: Why, How and Standards
Standards improve interoperability. Reusable libraries built around standards drive adoption. In this post, we explore key papers and real-world examples.
https://datapapers.substack.com/p/building-composable-systems-why-how
over 1 year ago
0
4
1
DEBS conference hosts a grand challenge every year. This year's challenge is detecting outliers in a stream of images from laser powder bed fusion. The challenge involves submitting a kubernetes app (constraint: 2 cores 8 gb). Interesting to try if you have the time!
2025.debs.org/call-for-gra...
loading . . .
CALL FOR GRAND CHALLENGE SOLUTIONS
DEBS2025
https://2025.debs.org/call-for-grand-challenge-solutions/
over 1 year ago
0
1
0
reposted by
Vignesh Chandramohan
Diptanu Choudhury
over 1 year ago
Python Folks - which data/workflow engine has the best developer experience for packaging code? We have looked into - Modal, Beam, Airflow, Flyte, AWS Lambda, Prefect, Dagster and Spark. Haven’t seen any approach which is fast, reliable and intuitive.
6
10
2
What are some papers or blogs about data quality challenges? I see tools like great expectations, table formats features like 'check constraints' in Delta. I don't yet see it as a first class property of catalogs. Found this, are there others?
journalofbigdata.springeropen.com/articles/10....
loading . . .
Big data quality framework: a holistic approach to continuous quality management - Journal of Big Data
Big Data is an essential research area for governments, institutions, and private agencies to support their analytics decisions. Big Data refers to all about data, how it is collected, processed, and ...
https://journalofbigdata.springeropen.com/articles/10.1186/s40537-021-00468-0
over 1 year ago
1
3
1
Great talk by Binwei Yang on Apache Gluten last week.
youtu.be/GWTj3INSzPg?...
Apache Gluten moves execution of spark operators to native backend like Velox, accelerating query performance. It has basic iceberg support too!
github.com/apache/incub...
loading . . .
Big Data Bellevue: Apache Gluten: Accelerating SparkSQL with Spark on Velox
YouTube video by BDB
https://youtu.be/GWTj3INSzPg?si=B4jkZjA6NOsjUtqk
over 1 year ago
0
1
0
reposted by
Vignesh Chandramohan
Chris
over 1 year ago
SlateDB 0.4.0 is out! Features: - Range scans - No DynamoDB needed for S3 - Nightly perf tests - Merge operator groundwork - GC improvements By the numbers: - 57 commits - 5 new contributors - 11 total contributors
github.com/slatedb/slat...
3
25
4
reposted by
Vignesh Chandramohan
Chris
over 1 year ago
SlateDB is now part of
@commonhaus-fdn.bsky.social
! I think we might be the first non-JVM project. Looking forward to more projects joining us here. Our experience with the foundation has been excellent so far.
github.com/commonhaus/f...
loading . . .
SlateDB would like to join Commonhaus · commonhaus foundation · Discussion #213
Project information Project name: SlateDB Project website: https://slatedb.io Code repository: https://github.com/slatedb/slatedb License: Apache 2.0 Do you have authority to represent this project...
https://github.com/commonhaus/foundation/discussions/213
2
29
7
reposted by
Vignesh Chandramohan
Ryanne Dolan
over 1 year ago
The 3 reasons to backfill: 1) new pipeline, who dis? 2) logic changed, recompute 3) whoops downstream lost everything. resend it.
3
13
6
you reached the end!!
feeds!
log in