You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm using jax 0.4.34, flax 0.9.0 and orbax 0.7.0. Until recently I was using orbax 0.4.1. Certain checkpoints created with v0.4.1 have the following directory structure:
For at least some of these checkpoints, when I try to restore with a PyTreeCheckpointHandler I get
ValueError: NOT_FOUND: Error opening "cast" driver: Error opening "zarr" driver:
Metadata at "<myparam>/kernel/scale/.zarray" in OCDBT database at
gs://<checkpoints>/<model>/<run>/<step>/default/ does not exist
Downgrading orbax back to 0.4.1 results in the same error. Did I corrupt my checkpoint state somehow? How can I rehabilitate these checkpoints?
For reference, my modern checkpoint dirs look like this:
I'm using jax 0.4.34, flax 0.9.0 and orbax 0.7.0. Until recently I was using orbax 0.4.1. Certain checkpoints created with v0.4.1 have the following directory structure:
For at least some of these checkpoints, when I try to restore with a
PyTreeCheckpointHandler
I getDowngrading orbax back to 0.4.1 results in the same error. Did I corrupt my checkpoint state somehow? How can I rehabilitate these checkpoints?
For reference, my modern checkpoint dirs look like this:
and I can load them just fine.
The text was updated successfully, but these errors were encountered: