Zarr output too many files #1789
-
QuestionQuestionHi, I am running an Ocean Parcels simulation with ~190,000 particles per release and tracking them for 30 days. However, I am encountering a disk quota limit on the number of files allowed when running the simulation on a supercomputer due to the zarr output format. Is there a way to work around this issue? For instance, could I export the output directly as a single NetCDF file instead of generating heaps of files in the zarr output? Supporting code/error messages# Paste your code within this block import pandas as pd from shapely.geometry import Polygon, Point pathout = '/scratch/pawsey0106/sbensadon/OceanParcels/CoralBay/SensitivityAnalysis/' Define release schedule with 24-hour run-up timedef create_release_times(start_time, durations, intervals):
Loop through the years 1993-2023for year in range(1993, 2008):
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @SeanBensadon, thanks for your question. You can change the size of each parr-chunk using the Note that there was also a bug in Parcels that has impacted your workflow, where you release particles at different times. It has been patched (#1788) and will become part of the next version 3.1.1. But in any case, setting your own chunks would help A minimum for you could be # Define output file path
output_file = pset.ParticleFile(
name=f'/scratch/pawsey0106/sbensadon/OceanParcels/CoralBay/SensitivityAnalysis/pout/ParcelsOut_{year}.zarr',
outputdt=timedelta(hours=1),
chunks=(len(pset), 1),
) If you want, you can also set the Hope this helps! |
Beta Was this translation helpful? Give feedback.
Hi @SeanBensadon, thanks for your question. You can change the size of each parr-chunk using the
chunks
keyword when you create your ParticleFile object, see also the "note on output chunking" at https://docs.oceanparcels.org/en/latest/examples/tutorial_parcels_structure.html#4.-Execution-and-outputNote that there was also a bug in Parcels that has impacted your workflow, where you release particles at different times. It has been patched (#1788) and will become part of the next version 3.1.1. But in any case, setting your own chunks would help
A minimum for you could be