Choice of Zarr compressor #1661

VeckoTheGecko · 2024-08-20T16:12:00Z

The writing of the zarr file in particlefile.py doesn't look to set a compressor. Setting a compressor can significantly help trading off compute for storage or vice versa [1]. The default chosen by zarr is the Blosc compressor, which is a "meta-compressor" (using different algorithms under the hood). Perhaps its worth investigating other compressors to see if there's one that is best suited for our simulation output.

@erikvansebille do you know if zarr compressors have been a topic of discussion before?

The text was updated successfully, but these errors were encountered:

JamiePringle · 2024-08-20T17:04:30Z

All the Zarr files I have made from parcels have been compressed. You need not set a compressor; the default is to compress. For efficiency I have found the correct setting of chunk size to be more important.

VeckoTheGecko · 2024-08-20T18:19:06Z

I mean the type of compression that is used (there are various available compressors zarr-developers/numcodecs). I agree about chunk size, but it would be good to also investigate the compression algorithm used as I think we just rely on the default which may not be best given our data.

CKehl · 2024-09-23T11:52:57Z

With all due respect, I get the impression you miss the point of the paper linked above. The high I/O-time is attributed in major parts to the loading of the fieldsets. Correct if I am mistaken, but that doesn't have anything to do with the zarray compressor of the particle file, does it ? I wish you get the answers you seek through your benchmarking approach.

VeckoTheGecko · 2024-09-23T12:26:48Z

Thanks for clarifying! :)

I haven't had the time yet to fully go through the paper. I've updated the description here to match.

VeckoTheGecko added this to Parcels development Aug 20, 2024

VeckoTheGecko converted this from a draft issue Aug 20, 2024

VeckoTheGecko added the discussion label Aug 21, 2024

VeckoTheGecko mentioned this issue Sep 23, 2024

Benchmarking Suite #1712

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choice of Zarr compressor #1661

Choice of Zarr compressor #1661

VeckoTheGecko commented Aug 20, 2024 •

edited

Loading

JamiePringle commented Aug 20, 2024

VeckoTheGecko commented Aug 20, 2024 •

edited

Loading

CKehl commented Sep 23, 2024

VeckoTheGecko commented Sep 23, 2024

Choice of Zarr compressor #1661

Choice of Zarr compressor #1661

Comments

VeckoTheGecko commented Aug 20, 2024 • edited Loading

JamiePringle commented Aug 20, 2024

VeckoTheGecko commented Aug 20, 2024 • edited Loading

CKehl commented Sep 23, 2024

VeckoTheGecko commented Sep 23, 2024

VeckoTheGecko commented Aug 20, 2024 •

edited

Loading

VeckoTheGecko commented Aug 20, 2024 •

edited

Loading