@flaneur2024.bsky.social
๐ค 1075
๐ฅ 201
๐ 140
maintainer of SlateDB loves Rust, Datasys, Cloud Infra, AI
https://flaneur2020.github.io
family mooncake ๐
3 days ago
0
2
0
reposted by
Sung Kim
3 days ago
He will give practical advice, and concrete criteria to consider, when choosing research projects, and making professional decisions, in these last few years before AGI."
docs.google.com/presentation...
loading . . .
Advice for a young investigator in the first and last days of the Anthropocene
Advice for a (young) investigator in the first and last days of the Anthropocene Jascha Sohl-Dickstein Anthropic Title: Advice for a young investigator in the first and last days of the Anthropocene A...
https://docs.google.com/presentation/d/1qVFDW8qT4CC4E_2TSVevrDbZ_Z9Utu_I1z0-ISLwZts/edit?pli=1&slide=id.p#slide=id.p
0
5
1
๐๐๐
add a skeleton here at some point
5 days ago
0
3
0
reposted by
Sung Kim
10 days ago
You already know the answer, but itโs nice that someone put in the manual effort to create a benchmark Can AI file your taxes? No "TaxCalcBench: Evaluating Frontier Models on the Tax Calculation Task" Paper:
www.arxiv.org/abs/2507.16126
Repo:
github.com/column-tax/t...
loading . . .
GitHub - column-tax/tax-calc-bench: Code & data for TaxCalcBench
Code & data for TaxCalcBench. Contribute to column-tax/tax-calc-bench development by creating an account on GitHub.
https://github.com/column-tax/tax-calc-bench
1
14
1
reposted by
David Calavera
18 days ago
Job update: a couple of weeks ago, I joined
@tensorlake.ai
full time. Iโm having a lot of fun building the product with
@diptanu.bsky.social
and the rest of this wonderful team. We have a few open positions if youโd like to work with us:
www.linkedin.com/jobs/search/...
1
8
5
reposted by
Chris
23 days ago
On to new things! "Ingest, query, and share telemetry data with your engineers and customers at a fraction of the cost."
add a skeleton here at some point
1
6
1
reposted by
Redowan Delowar
26 days ago
Regarding goroutine & unbuffered channel interaction, found myself repeating this multiple times. Maybe it's time to write it down for reference. Early return + unbuffered send = goroutine leak.
rednafi.com/go/early_ret...
#golang
loading . . .
Early return and goroutine leak
At work, a common mistake I notice when reviewing candidatesโ home assignments is how they wire goroutines to channels and then return early. The pattern usually looks like this: start a few goroutin...
https://rednafi.com/go/early_return_and_goroutine_leak/
2
7
2
reposted by
Chris
28 days ago
1/ SlateDB v0.8 is now available! This is release includes OpenDAL object store support, serializable snapshot isolation, first-class Go bindings, Python binding improvements, deterministic simulation tests, performance improvements, and tons of bug fixes. Details below. ๐
1
7
3
reposted by
Chris
28 days ago
2/ Snapshot isolation โ
@flaneur2024.bsky.social
has been hard at work on snapshots/transactions. 0.8 now has `DbSnapshots`, which provide a consistent point-in-time DB view. Sequence numbers are now core to SlateDB and will be used for many features (including transactions) going forward.
1
3
1
it seems there's no gold-standard benchmark for coding models at the moment. imho, the only way to know if one is any good is to throw a (real world) task at it and see if it can handle it smoothly ๐ค.
26 days ago
1
2
0
OpenDAL can now use object_store as its backend service ๐ฅณ! this allows users to leverage the out-of-box primitives from the OpenDAL operator, such as chunking / parallel fetching and caching on user-provided object_store instances
github.com/apache/opend...
loading . . .
feat: allow using object_store as opendal's backend by flaneur2020 ยท Pull Request #6283 ยท apache/opendal
Which issue does this PR close? Closes #6171. Rationale for this change with allowing object_store as opendal's backend, we can leverage opendal's advanced operation like parallel fetchin...
https://github.com/apache/opendal/pull/6283
about 1 month ago
1
3
0
in the age of vibe coding, the measure of a top programmer should be how long they can get Claude Code to work continuously without human intervention. if you can get Claude to code independently for a full 24-hour workday, you deserve top dollar in the market ๐ค
about 1 month ago
1
1
0
reposted by
Sung Kim
about 2 months ago
z.ai's
slime It is built with native SGLang integration, carrying its full inference optimizations straight into training, is an LLM post-training framework for RL scaling, providing two core capabilities:
1
3
2
SlateDB now has snapshot support! so the next step will soon be (possibly SSI?) transactions ๐
github.com/slatedb/slat...
loading . . .
feat: add DbSnapshot by flaneur2020 ยท Pull Request #688 ยท slatedb/slatedb
this pr aims to add the API for creating snapshot. a DbSnapshot contains all the read-only operations in a Db object. it records the seqnum at the moment when the DbSnapshot is created, and always ...
https://github.com/slatedb/slatedb/pull/688
about 2 months ago
0
6
2
reposted by
v
about 2 months ago
I went through DuckDB's WAL, and it does everything I was asking for in my blog post: 1. Per record checksum 2. Explicit error on checksum failure 3. Configurable behavior 4. Partial recovery 5. Safe truncation of the WAL only when WAL contents are checkpointed
add a skeleton here at some point
1
22
9
yesterday a friend told me a narrative about why ResNet works so well๏ผ he said that some patterns can already be learned by a shallow, 3~4 layer FFN. when we naively stack more layers, the extra layers are forced to learn an `identity` function ๐ค
about 2 months ago
1
0
0
recently Kimi K2 has become my most-used LLM for chatting. imo, the best part is that it does not think ๐ฎ. while thinking is useful to improve the final result, fast fact retrieval is often more valueable for me than more 'correct' answers.
about 2 months ago
0
2
0
made a Homebrew formula for FizzBee ๐, now we can install fizzbee with: brew tap fizzbee-io/fizzbee brew install fizzbee
github.com/fizzbee-io/h...
loading . . .
GitHub - fizzbee-io/homebrew-fizzbee: A hombrew formula for fizzbee
A hombrew formula for fizzbee. Contribute to fizzbee-io/homebrew-fizzbee development by creating an account on GitHub.
https://github.com/fizzbee-io/homebrew-fizzbee
2 months ago
2
4
1
reposted by
Polar Signals
3 months ago
๐Writing our Rust database from scratch gave us a golden opportunity to implement deterministic simulation testing as a core feature. Learn about the architecture that gives us full control over DST ingredients. ๐ง
www.polarsignals.com/blog/posts/2...
loading . . .
Deterministic Simulation Testing in Rust: A Theater Of State Machines
How we designed our database for complete control over concurrency, time, randomness, and failure injection.
https://www.polarsignals.com/blog/posts/2025/07/08/dst-rust
0
22
11
reposted by
v
2 months ago
cacm.acm.org/practice/sys...
loading . . .
Systems Correctness Practices at Amazon Web Services โ Communications of the ACM
http://cacm.acm.org/practice/systems-correctness-practices-at-amazon-web-services/
0
1
2
reposted by
Chris
2 months ago
Whew. This one was some work. It's still very simple, but already has found some interesting stuff.
loading . . .
Add deterministic simulation tests to SlateDB by criccomini ยท Pull Request #694 ยท slatedb/slatedb
A slatedb-dst package is added in this PR. The package contains dst.rs, which contains a deterministic simulator that runs random operations against a SlateDB instance and verifies that the DB has ...
https://github.com/slatedb/slatedb/pull/694
0
13
2
i found what vibecode really does for me is help me jump in the first line of code earlier instead of overthinking everything.. i heard INTJ people tend to have trouble taking action until they've got everything perfectly planned out..
2 months ago
2
2
0
i have a bias that the more powerful a programmer is, the more fully they recognize the capabilities and potential of AI programming ๐ค
antirez.com/news/154
loading . . .
Coding with LLMs in the summer of 2025 (an update) - <antirez>
https://antirez.com/news/154
2 months ago
0
1
0
this is obviously the use case where io_uring best showcase its capabilities ๐ฒ
tangled.sh/@rockorager....
loading . . .
@rockorager.dev/lsr
ls but with io_uring
https://tangled.sh/@rockorager.dev/lsr
3 months ago
0
4
1
reposted by
Chris
3 months ago
So.. Pierre Barre just built an NFS server implementation on SlateDB. ZeroFS is today days old, but it's going to get wild. Bottomless storage and checkpoints mean instant FS snapshots (zero overhead).
github.com/Barre/zerofs
loading . . .
GitHub - Barre/zerofs: ZeroFS: The S3FS that does not suck.
ZeroFS: The S3FS that does not suck. Contribute to Barre/zerofs development by creating an account on GitHub.
https://github.com/Barre/zerofs
6
30
6
excited to have finally gotten this RFC merged in SlateDB ๐. the implementation around this has already moved a lot
github.com/slatedb/slat...
loading . . .
docs: Add rfc about Transaction by flaneur2020 ยท Pull Request #260 ยท slatedb/slatedb
fixes #248 this doc is still in WIP, but the discussion is open here. TLDR of this doc: Snapshot (with sequence number in keys) and WriteBatch are MUST before working on the Transaction feature SS...
https://github.com/slatedb/slatedb/pull/260
3 months ago
0
3
1
reposted by
Sung Kim
3 months ago
They research "What Makes a Base Language Model Suitable for RL?" or why RL on LLaMA consistently underperforms Qwen. Their key insights: - High-quality math data is key to RL scaling. - QA-style data helps, but it depends on task similarity. - Instruction data boosts QAโs effectiveness.
1
17
3
reposted by
Chris
4 months ago
Insane amount of SlateDB work going on: - snapshot reads - split/merge DBs (zero copy) - deterministic simulation testing And someone just pushed Python bindings in a PR! ๐คฏ
0
9
3
the essence of vibe coding seems to be about being a good rubber duck for LLMs. it's essentially just another form of rubber duck debugging ๐ค.
4 months ago
1
1
0
just noticed that we can't simply use std::sync::Mutex within a tokio task when there's some .await points living in its guard, because MutexGuard is marked as !Send, but the tokio tasks are possible to get scheduled across different threads ๐ฎ... we must use tokio::sync::Mutex at this situation..
4 months ago
3
3
0
reposted by
Chris
5 months ago
Responsive talkin about their storage engine for Kafka streams. (Built on SlateDB!)
add a skeleton here at some point
0
5
1
reposted by
David Calavera
5 months ago
How Hugging Face migrated terabytes of model data from Git LFS to their own storage backend, called Xet:
huggingface.co/blog/xet-on-...
loading . . .
Xet is on the Hub
Weโre on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/blog/xet-on-the-hub
2
7
5
i'd heard Volcano is widely used for running AI workloads on k8s. my initial quick look only showed me that it added batch processing support to k8s. batch processing - simply a task queue to run tasks one by one, why bother such complexities?
5 months ago
2
1
1
reposted by
Tanel Poder
5 months ago
0x.tools
xCapture v3: Linux Performance Analysis with Modern eBPF and DuckDB ๐๐
tanelpoder.com/posts/xcaptu...
1
29
12
ูฉ( 'ฯ' )ู
add a skeleton here at some point
5 months ago
0
3
0
opinion: a physical printer is the best read it later app ๐ค
6 months ago
0
3
0
reposted by
Qian Li
6 months ago
Highly recommend this!
add a skeleton here at some point
0
5
1
reposted by
Chris
6 months ago
We need more of this in the discourse: AI is normal technology. I actually disagree with the worldview articulated here, but it's closer to the truth than the superintelligence/singularity view IMHO.
loading . . .
AI as Normal Technology
A new paper that we will expand into our next book
https://www.aisnakeoil.com/p/ai-as-normal-technology?utm_source=post-email-title&publication_id=1008003&post_id=161317202&utm_campaign=email-post-title&isFreemail=true&r=9s9ch&triedRedirect=true&utm_medium=email
0
8
2
this weekend's paper reading is 'Sarathi: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills', which discusses Chunked Prefills.
6 months ago
1
1
0
reposted by
Dan McKinley
6 months ago
I have some thoughts about
boringtechnology.club
for you, as the work turns ten years old this weekend.
loading . . .
Choose Boring Technology
How to be old, for young people.
https://boringtechnology.club
2
31
9
reposted by
Sung Kim
6 months ago
Microsoft's Playwright, that enables reliable end-to-end testing for modern web apps (
playwright.dev
) built a Playwright MCP (
github.com/microsoft/pl...
) that provides browser automation using Playwright. This enables LLMs to interact with web pages through structured accessibility snapshots.
loading . . .
Fast and reliable end-to-end testing for modern web apps | Playwright
Cross-browser end-to-end testing for modern web apps
https://playwright.dev/
1
48
7
reposted by
Sung Kim
6 months ago
Fastrace provides a production-ready solution with seamless ecosystem integration, out-of-box OpenTelemetry support, and a more straightforward API that works naturally with the existing logging infrastructure.
fast.github.io/blog/fastrac...
loading . . .
Fastrace: A Modern Approach to Distributed Tracing in Rust | FastLabs / Blog
We develop fast Rust crates and release them fast.
https://fast.github.io/blog/fastrace-a-modern-approach-to-distributed-tracing-in-rust/
0
5
2
reposted by
Matt Butcher
7 months ago
WASI p3 is almost here!
dev.to/fermyon/look...
loading . . .
Looking Ahead to WASIp3
By: Joel Dice WebAssembly System Interface Preview 3 (or WASIp3 for short) is the next major release...
https://dev.to/fermyon/looking-ahead-to-wasip3-5aem
0
41
8
realized the difference of MCP server and the Function Calling in ChatGPT is that MCP is an protocol that interacts with the local clients, it communicates with the client with stdio. so it makes it possible to interact with the softwares installed on the desktop like Blender 3D.
7 months ago
1
1
0
recently i learned a trick about using LLM is: let LLM to write a better prompt for me, then use the generated prompt for work. ๐ค the generated prompt often has a more comprehensive consideration for the task ๐ค
7 months ago
1
3
0
reposted by
Chris
7 months ago
After nearly more than 5 months of discussion, SlateDB has accepted an RFC for synchronous commits and durability. This will be the foundation for transactions and snapshot serializable isolation. Thanks to
@flaneur2024.bsky.social
for powering through this!
github.com/slatedb/slat...
loading . . .
https://github.com/slatedb/slatedb/blob/main/rfcs/0008-synchronous-commit.md
2
11
1
read sqlancer's paper and found its idea is pretty simple & effective ๐ฒ: it picks a random row as a "pivot row," then generates a BOOLEAN condition with this row's random columns that returns TRUE, and validate whether the pivot row is included in the result or not.
7 months ago
1
1
0
i found the experience of using Cursor with Gemini Flash 2 is pretty nice for it's simply pretty fast response. ๐ฒ let me pause the subscription of Cursor for a while..
7 months ago
0
0
0
reposted by
HYTRADBOI
7 months ago
Talk 24: "Serverless primitives for the shared log architecture" by Stephen Balogh (
@sbalogh.bsky.social
)
1
9
6
Load more
feeds!
log in