You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If directly run eval_multigpu.py without any training, after the script downloaded and attempted to loaded the shards raised:
safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge
Checking the model sizes under for example
all_models/adapters/gate/summary/
gives only 1KB, which suspect the model is not ready to run before a training.
But it could also be my PyTorch version, is there any version requirement for PyTorch used?
Let me know if you need further details.
The text was updated successfully, but these errors were encountered:
If directly run eval_multigpu.py without any training, after the script downloaded and attempted to loaded the shards raised:
safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge
Checking the model sizes under for example
all_models/adapters/gate/summary/
gives only 1KB, which suspect the model is not ready to run before a training.
But it could also be my PyTorch version, is there any version requirement for PyTorch used?
Let me know if you need further details.
The text was updated successfully, but these errors were encountered: