Zarr
@zarr.dev
๐ค 417
๐ฅ 16
๐ 16
Chunked, Compressed, & Cloud-native N-dimensional arrays See
https://zarr.dev/
for more information.
pinned post!
Your friendly neighborhood
#chunked
,
#compressed
, &
#cloud-ready
#n-dimensional
array format is now verified as
@zarr.dev
on
@bsky.app
! Visit us at
zarr.dev
for more info!
loading . . .
Zarr
Have beautiful data in Zarr? Show us on Bluesky!
https://zarr.dev/
10 months ago
0
13
3
reposted by
Zarr
scverse
about 1 month ago
Anndata now supports
@zarr.dev
v3, great for both performance and interoperability. Check out our guide for more info, especially sharding:
anndata.readthedocs.io/en/stable/tu...
. Donโt worry though, we remain fully v2 backwards compatible - upgrade fearlessly!
1
6
2
reposted by
Zarr
ESIP - Earth Science Information Partners
3 months ago
โป๏ธ 7/8 - Webinar: Reusable Deep Learning with EOPF Zarr Samples - ESA Copernicus Earth Observation Processor Framework Learn how to reuse a deep learning model originally trained with data from the openEO API of the Copernicus Dataspace Ecosystem (CDSE).
#datascience
#deeplearning
loading . . .
Welcome! You are invited to join a webinar: Reusable Deep Learning with EOPF Zarr Samples. After registering, you will receive a confirmation email about joining the webinar.
The aim of this webinar is to demonstrate how to reuse a deep learning model originally trained with data from the openEO API of the Copernicus Dataspace Ecosystem (CDSE) with the data and tools provi...
https://us02web.zoom.us/webinar/register/WN_HypA7ltmRcini3l3-JwVMg?mc_cid=5f91e28e78&mc_eid=b0a36df6df#/registration
1
2
1
reposted by
Zarr
Al Merose (he/him)
3 months ago
I've been waiting a few years for this: Zarr v2 support was just launched in Google Earth Engine! Check out this demo that loads ARCO-ERA5 into GEE ๐ฅ
code.earthengine.google.com/67ac1e48848d...
developers.google.com/earth-engine...
1
16
6
reposted by
Zarr
Lazaro Alonso
3 months ago
If you missed today's presentation at the LPS by
@thejeran.bsky.social
on Browzarr, a browser-based visualization toolkit for exploring and analyzing Zarr data stores, go and check out the version online, and open issues and do feature requests on GitHub.
github.com/EarthyScienc...
loading . . .
0
3
1
reposted by
Zarr
Petra Korleviฤ
3 months ago
doodles of two Tree of Life department speakers
@sangerinstitute.bsky.social
this week, Evgeny Zakharov talking about insanely large scale barcode sequencing and Jerome Kelleher talking about large scale human genomic data analysis. that's a lot of data ๐งฌ๐
0
4
2
reposted by
Zarr
Open Sustainable Technology
4 months ago
We welcome a new project to
OpenSustain.tech
#climate
#sustainability
#opensource
#opensustain
#openscience
github.com/zarr-develop...
loading . . .
GitHub - zarr-developers/zarr-python: An implementation of chunked, compressed, N-dimensional arrays for Python.
An implementation of chunked, compressed, N-dimensional arrays for Python. - zarr-developers/zarr-python
https://github.com/zarr-developers/zarr-python
0
10
2
ICYMI: Check out this alternative storage approach for variation data using
#Zarr
. Rather than grouping all data for a given variant together, its groups all data for a given field, and store as chunked, compressed N-D arrays (tensors). ๐คฉ
doi.org/10.1093/giga...
loading . . .
Analysis-ready VCF at Biobank scale using Zarr
AbstractBackground. Variant Call Format (VCF) is the standard file format for interchanging genetic variation data and associated quality control metrics.
https://doi.org/10.1093/gigascience/giaf049
3 months ago
0
7
0
reposted by
Zarr
Rita Strack
4 months ago
OME-Zarr 4 life
1
9
3
reposted by
Zarr
Kit Macleod
4 months ago
"The commitment from major entities like Copernicus, USGS, and Google Earth Engine signals a paradigm shift in how we will store, access, and analyze geospatial data in the cloud...
#Zarr
is poised to become a cornerstone of the cloud-native geospatial ecosystem, unlocking new frontiers..."
add a skeleton here at some point
0
3
2
reposted by
Zarr
Earthmover
5 months ago
1/ Our team just got back from an inspiring week at the
@cloudnativegeo.org
conference in Snowbird, Utah, and one thing is clear:ย ๐๐๐ซ๐ซ ๐๐๐จ๐ฉ๐ญ๐ข๐จ๐ง ๐ข๐ฌ ๐ฌ๐ฎ๐ซ๐ ๐ข๐ง๐ ๐๐๐ซ๐จ๐ฌ๐ฌ ๐ ๐๐จ๐ฌ๐ฉ๐๐ญ๐ข๐๐ฅ.
1
8
4
reposted by
Zarr
Copernicus ECMWF
6 months ago
Curious about climate patterns? The new ERA Explorer app lets you compare 85 years of climate data worldwide. Analyse climatological averages at any map point or explore month-by-month data.Dive deeper with ERA Explorer!
era-explorer.climate.copernicus.eu?lat=50.86&ln...
#C3S
#beautifuldata
loading . . .
3
85
54
๐ง Interested in extending
#Zarr
? Let us know what you think about ZEP10 "Generic extensions". Current plan is for voting on the proposal mid-June. ๐
github.com/zarr-develop...
loading . . .
ZEP10: Generic extensions (v3.2 spec changes) by joshmoore ยท Pull Request #344 ยท zarr-developers/zarr-specs
This is the implementation of ZEP10 which introduces a generic extensions object. ย ย ย ย โ๐ฝ ๐๐ฝ Rendered ZEP10 page: https://zeps--67.org.readthedocs.build/en/67/draft/ZEP0010.html ๐๏ธ Proposed...
https://github.com/zarr-developers/zarr-specs/pull/344
4 months ago
0
4
2
reposted by
Zarr
Joe Hamman
5 months ago
Just back from
#CNG2025
, I wrote up some thoughts on the traction the
@zarr.dev
format is gaining in the broader geospatial industry (specifically Earth Observation). I also summarize what features and tools the user and developer communities are asking for to fully leverage Zarr. ๐๐๐
add a skeleton here at some point
0
3
2
reposted by
Zarr
Earthmover
5 months ago
1/ ๐กย Our latest blog post in the fundamentals series, written by
@tegnicholas.bsky.social
, demystifies cloud-optimized scientific data formats! Read more:
earthmover.io/blog/fundame...
loading . . .
Fundamentals: What is Cloud-Optimized Scientific Data?
What cloud-optimized data really means, and how Zarr and Icechunk enable fast access to massive scientific datasets in cloud object storage.
https://earthmover.io/blog/fundamentals-what-is-cloud-optimized-scientific-data
2
16
12
reposted by
Zarr
Joe Hamman
5 months ago
๐ Big launch today: introducing Flux โ a game-changing way to serve geospatial data via standard APIs in seconds. Built for scale, speed, and simplicity.
add a skeleton here at some point
1
5
3
reposted by
Zarr
Ryan Abernathey
6 months ago
This one was a lot of fun to work on! Why
@xarray.bsky.social
and
@zarr.dev
are faster than DuckDB and Parquet for gridded data...from first principles.
add a skeleton here at some point
0
8
4
reposted by
Zarr
Copernicus ECMWF
6 months ago
What's behind ERA Explorer's speed? Meet the ZARR archive! ๐ ZARR revolutionises data processing, enabling fast, efficient access to over 80 years of climate data. Test out its speed ๐
era-explorer.climate.copernicus.eu?lat=50.86&ln...
#C3S
@zarr.dev
#beautifuldata
loading . . .
1
19
10
reposted by
Zarr
Jackson Burns
6 months ago
Just started using Zarr - literally unbelievably good.
add a skeleton here at some point
0
2
2
๐คฉ "
#NASA
... leveraged
#Icechunkโs
virtualization capabilities to present the entire collection of over a
#million
files as a single analysis-ready
#Zarr
data cube. "
add a skeleton here at some point
6 months ago
0
6
1
reposted by
Zarr
Joe Hamman
6 months ago
๐จ New blog post ๐จ In it, we show off our recent work deploying
#icechunk
on top of
#NASA's
existing archives of Earth observation data. The results: 100x speed up when extracting time series from existing datasets stored as netCDF.
add a skeleton here at some point
0
12
5
reposted by
Zarr
Josh Moore
6 months ago
ICYMI: A MATLAB implementation of
@zarr.dev
has dropped from
@mathworks.bsky.social
๐
github.com/mathworks/MA...
loading . . .
GitHub - mathworks/MATLAB-support-for-Zarr-files: This contribution provides an interface in MATLAB to read and write Zarr files. Zarr is a file format for scientific data (like HDF5 and netCDF) which...
This contribution provides an interface in MATLAB to read and write Zarr files. Zarr is a file format for scientific data (like HDF5 and netCDF) which is optimized for cloud storage and parallel co...
https://github.com/mathworks/MATLAB-support-for-Zarr-files/tree/main
1
21
7
reposted by
Zarr
Al Merose (he/him)
7 months ago
Really cool new neuroscience benchmark just dropped. Data is in
@zarr.dev
and demos use tensorstore.
zapbench-release.storage.googleapis.com/landing.html
loading . . .
ZAPBench
ZAPBench evaluates how well different models can predict the activity of over 70,000 neurons in a novel larval zebrafish dataset.
https://zapbench-release.storage.googleapis.com/landing.html
0
6
3
reposted by
Zarr
Michael Morehead
8 months ago
Very happy to announce our winter release of syGlass: 2.2.0! Now control syGlass with your voice. We've also upgraded the auto-segmentation, mask handling & proofreading, and the narration system. We've added basic support for
@zarr.dev
. Check out the changelog:
www.syglass.io/blog/announc...
loading . . .
0
14
9
reposted by
Zarr
Chris Holmes
7 months ago
You, yes you, can help the Cloud Native Geospatial Movement! Have you benefitted from Cloud-Optimized GeoTIFFโs? SpatioTemporal Asset Catalogs? Zarr, COPC or GeoParquet? And the ecosystem around them? Learn how to give back in my post:
cholmes.medium.com/you-yes-you-...
loading . . .
You, yes you, can help the Cloud Native Geospatial Movement!
Have you benefitted from COGโs, STAC, Zarr, COPC or GeoParquet? Join us in Utah for the first CNG Conference and help just by attending.
https://cholmes.medium.com/you-yes-you-can-help-the-cloud-native-geospatial-movement-d2b58e40804d
1
7
4
reposted by
Zarr
napari
7 months ago
We're a bit late to the bsky game so we appreciate a boost!
@loicaroyer.bsky.social
@haesleinhuepf.bsky.social
@florianjug.bsky.social
@lhinderling.bsky.social
@kbias.bsky.social
@zarr.dev
@xarray.bsky.social
@scipyconf.bsky.social
@drlachie.bsky.social
@henriqueslab.bsky.social
@globias.bsky.social
0
10
1
reposted by
Zarr
Joe Hamman
8 months ago
If you missed the Zarr-Python 3 release party last week, no worries because today, we released zarr-python 3.0.1 and numcodecs 0.15.0. release notes ๐
zarr.readthedocs.io/en/stable/re...
loading . . .
Release notes โ zarr 3.0.1 documentation
https://zarr.readthedocs.io/en/stable/release-notes.html#jan-17-2025
0
10
2
reposted by
Zarr
Aafke Gros
8 months ago
Microscopy Nodes loads data from a
@zarr.dev
bucket, making it easy to use open data archives that host Zarr data! This is a mitotic cell from Walther et al. 2018, that's hosted publicly on the Image Data Resource ๐ฑ
loading . . .
1
10
2
๐ That's what friends are for!!
#opensource
#scientific
#python
#ftw
๐ช
add a skeleton here at some point
9 months ago
0
4
2
๐ Zarr-Python 3 is here! ๐ - Full support for Zarr v3 spec - Chunk-sharding for more efficient data storage - Major performance boosts with async I/O & parallel compression ๐ป pip install --upgrade zarr ๐ป conda install --channel conda-forge zarr Blog post:
https://buff.ly/3C3OwYw
loading . . .
9 months ago
2
56
24
Hi, Zarr community! ๐ ๐จ Big news: Zarr-Python 3.0 is dropping next week! ๐จ If your project depends on Zarr-Python, make sure to check out the migration guide to get ready for the update. Guide:
https://buff.ly/3W0cLxd
More updates soon! ๐ง๐ปโ๐ป
loading . . .
https://buff.ly/3W0cLxd
9 months ago
0
27
14
reposted by
Zarr
Al Merose (he/him)
9 months ago
And if you really want a challenge, this is probably the biggest singular Zarr dataset in existence (~6PiB):
github.com/google-resea...
loading . . .
GitHub - google-research/arco-era5: Recipes for reproducing Analysis-Ready & Cloud Optimized (ARCO) ERA5 datasets.
Recipes for reproducing Analysis-Ready & Cloud Optimized (ARCO) ERA5 datasets. - google-research/arco-era5
https://github.com/google-research/arco-era5?tab=readme-ov-file#analysis-ready-data
1
3
1
reposted by
Zarr
Kyle Harrington
9 months ago
For the
@napari.fosstodon.org.ap.brid.gy
and large image data out there, I made some updates to the tiled loading zarr demo. It still has limitations (scale isn't respected, etc.), but this is an easy way to run it on your own code.
github.com/napari/napar...
More in ๐งต๐
loading . . .
[wip] Example files for 2D and 3D progressive loading by kephale ยท Pull Request #6043 ยท napari/napari
Fixes/Closes This is the first step at addressing #5561 and https://github.com/orgs/napari/projects/24?pane=issue&itemId=29670723 Description This PR introduces: napari.experimental._progressi...
https://github.com/napari/napari/pull/6043
1
9
1
reposted by
Zarr
TEGNicholas.bsky.social
9 months ago
More of this from all the companies that depend on the same tools the scientists use/develop please
add a skeleton here at some point
0
4
2
reposted by
Zarr
Viren Jain
9 months ago
New batch read support has significantly improved efficiency when reading from sharded formats like zarr v3 and neuroglancer precomputed:
github.com/search?q=rep...
and many other general performance and efficiency improvements (e.g.
github.com/google/tenso...
)
1
1
1
Looking for a job where you get to play with
#Zarr
๐ง๐ง ๐ง๐งโ ๐
add a skeleton here at some point
9 months ago
0
2
0
reposted by
Zarr
Joe Hamman
9 months ago
In the talk, I made a few simple points: - Separation of storage and compute is key to unlocking the scaling potential of cloud - Cloud optimized data formats are key (example:
@zarr.dev
and
#icechunk
) - API services should be stateless/serverless and should be able to scale horizontally [0->N]
1
2
1
Hi Zarrites! ๐๐ป We have our bi-weekly community meeting today. ๐๏ธ These meetings are a great place to share updates, showcase your projects, raise an issue, or converse with the vibrant community. ๐ง๐ปโ๐ป โจJoin here:
https://buff.ly/3SHCzNN
10 months ago
0
2
0
Please help the
#Zarr
community welcome
@normanrz.bsky.social
as the newest member of the Steering Council! ๐๐ผ
zarr.dev/blog/steerin...
loading . . .
Steering council membership update
Please welcome the new steering council member, Norman Rzepka (normanrz)!
https://zarr.dev/blog/steering-council-update-2024/
10 months ago
0
10
4
reposted by
Zarr
TEGNicholas.bsky.social
10 months ago
I'm at
#AGU2024
this week! If you want to talk about anything
@xarray.bsky.social
, Pangeo,
@zarr.dev
, or
@cworthy.bsky.social
, message me! I have a poster on the VirtualiZarr package I wrote on Wednesday morning
agu.confex.com/agu/agu24/me...
but also several other projects I've worked on...
loading . . .
VirtualiZarr: Create Virtual Zarr Stores Using Xarray Syntax
VirtualiZarr is a tool for creating โvirtualโ Zarr stores. Virtual Zarr stores ...
https://agu.confex.com/agu/agu24/meetingapp.cgi/Paper/1725217
1
11
5
reposted by
Zarr
Al Merose (he/him)
10 months ago
I have a few ideas: - create kurchunk indexes on the corpus via
@tomnicholas.bsky.social
โs VirtualiZarr
github.com/zarr-develop...
- Use
@openclimatefix.org
โs hypergrib
github.com/JackKelly/hy...
1/
loading . . .
GitHub - zarr-developers/VirtualiZarr: Create virtual Zarr stores from archival data files using xarray syntax
Create virtual Zarr stores from archival data files using xarray syntax - zarr-developers/VirtualiZarr
https://github.com/zarr-developers/VirtualiZarr
1
3
1
reposted by
Zarr
Development Seed
10 months ago
๐ IN21D: Scaling Data Discovery w Dynamic Spatial Aggregations Using STAC Metadata (@emmapaz.com) ๐ GC23A: Choosing Earth Foundation Models: From Research Grade to Community Maintained (Wei Ji Leong) ๐ IN31C: VirtualiZarr: Create Virtual Zarr Stores Using Xarray Syntax (Aimee, Max & Sean)
#AGU24
1
4
1
reposted by
Zarr
Josh Moore
10 months ago
And there's of course
github.com/grimbough/Rarr
if you're interested,
@jerry-shannon.bsky.social
. Nothing like
#Zarr
in
#R
(to make a mouthful)
loading . . .
GitHub - grimbough/Rarr: A simple native R reader for Zarr Arrays
A simple native R reader for Zarr Arrays. Contribute to grimbough/Rarr development by creating an account on GitHub.
https://github.com/grimbough/Rarr
3
7
3
reposted by
Zarr
Daniel Rothenberg
10 months ago
Also nice coverage in the
@nytimes.com
, although not sure I'd call GenCast an "AI Agent" -
www.nytimes.com/2024/12/04/s...
... and I guess some e-mails are warranted to clarify, but I hope to see forecasts published elsewhere than Earth Engine and BigQuery. Standard Zarr files on GCS, please?
loading . . .
Google Introduces A.I. Agent That Aces 15-Day Weather Forecasts
GenCast, from the companyโs DeepMind division, outperformed the worldโs best predictions of deadly storms as well as everyday weather.
https://www.nytimes.com/2024/12/04/science/google-ai-weather-forecast.html
1
8
1
reposted by
Zarr
Joe Hamman
10 months ago
Also on Thursday afternoon, I'll be giving an invited talk titled "Seamless Arrays: A Full Stack, Cloud-Native Architecture for Fast, Scalable Data Access". It combines all that we've been working on for the past year including
@zarr.dev
v3,
#icechunk
, and Xpublish.
agu.confex.com/agu/agu24/me...
loading . . .
Seamless Arrays: A Full Stack, Cloud-Native Architecture for Fast, Scalable Data Access
Just about everyone agrees on what the ideal Earth system data service would pr...
https://agu.confex.com/agu/agu24/meetingapp.cgi/Paper/1645428?utm_campaign=3602356-Newsletters%20-%202024&utm_source=hs_email&utm_medium=email&_hsenc=p2ANqtz-80zwdlp1ds74QkMSCo99FgYM2T_VHX_yoXDvftzHSCpP1Uxnc1nIZdB2eqwHPrTiPXEht4
1
3
1
reposted by
Zarr
Piyush Garg
10 months ago
We have been experimenting with NVIDIA DALI for deep learning training and inference work with Pytorch and zarr-enabled data loaders. It really has optimized our training and inference pipeline.
github.com/NVIDIA/DALI
loading . . .
GitHub - NVIDIA/DALI: A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications. - NVIDIA/DALI
https://github.com/NVIDIA/DALI
1
6
1
Thanks to
gitub.com/ldeakin
the list of
#zarr
implementations is now shiny and new! โจ
zarr.dev/implementati...
๐ If any are missing or need updating, please let us know.
10 months ago
0
9
1
#beautifuldata
in
#zarr
๐
add a skeleton here at some point
10 months ago
0
2
0
reposted by
Zarr
Aafke Gros
11 months ago
And with OME-Zarr loading, due to the pyramidal format, you can design your video at small-scale and only replace to full-scale at the final render! (data from Granita Lokaj, @
s3.embl.de/microscopyno...
)
1
17
3
#beautifuldata
add a skeleton here at some point
10 months ago
0
2
0
reposted by
Zarr
Hao Yin
about 1 year ago
Build your own online integrated platform for single-cell & spatial transcriptomic data using WebAtlas pipeline Cell2location StabMap Unify image & gene expression files into Zarr format Compatible with SpatialData Based on Vitessce framework
#Naturemethods
2024
www.nature.com/articles/s41...
0
7
3
Load more
feeds!
log in