@honicky.bsky.social
📤 10
📥 30
📝 11
Tired of creating a new notebook and writing AnnData/pandas/numpy/??? just to peek at a H5AD file? Me too! So I created a DuckDB extension that can read AnnData files.
loading . . .
24 days ago
1
0
0
Working with a large set of AnnData files on S3 is tough if you don't have the metadata indexed, so I created a tool to help.
github.com/honicky/annd...
It uses partial downloads to dramatically speed up extracting the metadata without downloading the whole file. `pip install anndata-metadata`
loading . . .
GitHub - honicky/anndata-metadata: A Python library and CLI tool for extracting metadata from AnnData .h5ad files, both locally and on S3
A Python library and CLI tool for extracting metadata from AnnData .h5ad files, both locally and on S3 - honicky/anndata-metadata
https://github.com/honicky/anndata-metadata
9 months ago
1
1
0
I promised to explain how you can build your very own, custom GenePT embeddings. Here's another Lab Note:
learning-exhaust.hashnode.dev/lab-notes-cu...
Enjoy!
loading . . .
Gene Embeddings, LLMs, and a $6.83 Experiment That Might Matter
Better Gene Embeddings Through Prompt Engineering (Or, at Least, We Tried)
https://learning-exhaust.hashnode.dev/lab-notes-custom-genept-embeddings
11 months ago
0
0
0
Bio data is amazing!!! Don't be intimidated!
learning-exhaust.hashnode.dev/lab-notes-co...
loading . . .
GenePT vs scGPT - what I learned in a few days
In the spirit of “Learning in Public,” and “learning exhaust,” I’m going to start adding “Lab Notes” blog posts that chronicle little discoveries or failures that I have along the way to a larger goal...
https://learning-exhaust.hashnode.dev/lab-notes-comparing-genept-and-scgpt
12 months ago
0
0
0
Data products often scale cost like hardware, iterate like software, and scale performance like... data products.
learning-exhaust.hashnode.dev/data-product...
loading . . .
Data Products are Different
Why you have to manage data products differently than software and hardware.
https://learning-exhaust.hashnode.dev/data-products-are-different
about 1 year ago
1
0
0
Oh man, writing up the Cerebras paper on Weight Streaming was a rabbit hole filled with parallel algorithms.
learning-exhaust.hashnode.dev/one-thing-i-...
@picocreator.bsky.social
@eugeneyan.com
@swyx.io
@latentspacepod.bsky.social
loading . . .
Weight streaming might work well on GPUs!
I wonder why it isn't a thing...
https://learning-exhaust.hashnode.dev/one-thing-i-learned-weight-streaming-might-work-well-on-gpus
about 1 year ago
1
4
2
First post to bsky: I've been trying to post a bit more frequently and in smaller bites, so I'm going to try to pick one interesting thing I've learned from papers I read, and then write a quick post about them. Here's my first one:
learning-exhaust.hashnode.dev/one-thing-i-...
loading . . .
Embeddings are task specific
Modern training techniques for embedding models mean that we should probably include a prompt or fine tune to a specific type of task.
https://learning-exhaust.hashnode.dev/one-thing-i-learned-embeddings-are-task-specific
about 1 year ago
1
1
0
you reached the end!!
feeds!
log in