TensorRT: Quantization issues with convtranspose3D

Hello,

I don’t understand why convtranspose3d works in int8 with implicit quantization, and when I want to use explicit quantization or QAT phase with the TensorRT model optimizer, convtranspose3d is now in fp16.
I am using a model optimizer with a default int8 configuration.
I think this is only a problem related to the TensorRT model optimizer, but I’m not sure.
I am sharing the code and export with quantification below.

Here is my implicit ONNX.
<img width="422" height="629" alt="Image" src="https://github.com/user-attachments/assets/e615e420-bb74-465a-85cf-de7892229db3" />

Here is the result after TensorRT quantization.
<img width="374" height="398" alt="Image" src="https://github.com/user-attachments/assets/1485ed16-2ae4-46de-b3b9-70e5e3bcf595" />

So, here, it s int8 convTranspose3D.


Inference time in implicit mode
<img width="1576" height="154" alt="Image" src="https://github.com/user-attachments/assets/279fdc2c-3cb0-4ebd-af1c-560772e6917e" />
9 ms inference time, so it s ok in implicit mode 


Here is explicit ONNX quantization.
<img width="888" height="798" alt="Image" src="https://github.com/user-attachments/assets/d0722b1a-967f-4c16-9d4f-27bdabe56d6e" />

Here is the result after TensorRT quantization.
<img width="661" height="802" alt="Image" src="https://github.com/user-attachments/assets/5485b0ba-fa86-4d34-9fc9-46d70ba890d7" />

Inference time in explicit mode.
<img width="1579" height="175" alt="Image" src="https://github.com/user-attachments/assets/eb762e98-2ed6-423c-8761-91ec22c44899" />
In explicit quantization: inference time of 11 ms.

Command for run trt quatization : 
.\trtexec.exe --onnx=surround_occ_int8.onnx --noDataTransfers --useCudaGraph --useSpinWait --profilingVerbosity=detailed  --verbise -h --fp16 --int8 

I try with --int8 and without --int89 ut same result  



My code with export onnx : 

[code.txt](https://github.com/user-attachments/files/24143392/code.txt)

[model.txt](https://github.com/user-attachments/files/24143393/model.txt)

I've been looking at documentation and solutions on forums for a few weeks now. I'm out of solutions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TensorRT: Quantization issues with convtranspose3D #688

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

TensorRT: Quantization issues with convtranspose3D #688

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions