Deepseek3.2 W4afp8 convert fail

when i do the :
`# Do per-tensor fp8 calibration
torchrun --nproc-per-node 8 --master_port=12346 ptq2.py --model_path /work/models/v32-mid  --config /work/models/TensorRT-Model-Optimizer/modelopt/DeepSeek-V3.2-Exp/inference/config_671B_v3.2.json  --quant_cfg FP8_DEFAULT_CFG  --output_path ds_v32_fp8_per_tensor_calibration`

then:  
<img width="1127" height="717" alt="Image" src="https://github.com/user-attachments/assets/b38d24b1-628f-45c1-bbae-0fb87202e197" />

in  hopper H20 96G

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Deepseek3.2 W4afp8 convert fail #662

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Deepseek3.2 W4afp8 convert fail #662

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions