Theoretically, can inputs and outputs data type of Add opt be FP8 mode?

Hi, @ajrasane 
A common structure is residual add in cnn model which is time consuming on data type converting from fp8 to fp16. Is it feasible to keep data type as fp8 in the future theoretically?

in my onnx, I try to replace dwconv output channel to 16 making it can be quantifiable. But it leads bad layer fusion, generating more opts from 300+ to 1500+ and resulting more data type converting opts. Performance drops dramatically from 1200+FPS to 280+FPS. 

onnx models are here [https://github.com/PonyPinkPie/export/tree/main/ckpt](url)

<img width="420" height="990" alt="Image" src="https://github.com/user-attachments/assets/7e10d8ce-4258-458c-b19e-9dd417f63fa0" />

<img width="549" height="1256" alt="Image" src="https://github.com/user-attachments/assets/9393e707-a664-4bcf-9845-d8b8fc81a51f" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Theoretically, can inputs and outputs data type of Add opt be FP8 mode? #658

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Theoretically, can inputs and outputs data type of Add opt be FP8 mode? #658

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions