Enables Matmul and Gemm for float16 on CPU #25913

xadupre · 2025-09-01T15:44:55Z

Description

Completes the implementation of Matmul and Gemm for float16 on CPU.

Motivation and Context

See issue #25824, a benchmark should validate that because float32 is usually faster than float16 on CPU.

xadupre added 2 commits September 1, 2025 17:43

Enables Matmul and Gemm for float16 on CPU

65581d5

remove unnecessary includeé

eee1858

xadupre mentioned this pull request Sep 1, 2025

[Performance] ONNX FP16 model is having performance bottle neck when compared to FP32 variant #25824

Open

xadupre added 2 commits September 1, 2025 18:25

add tempalte<>

c159d6c

fix documentation

34acf01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enables Matmul and Gemm for float16 on CPU #25913

Enables Matmul and Gemm for float16 on CPU #25913

xadupre commented Sep 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Enables Matmul and Gemm for float16 on CPU #25913

Are you sure you want to change the base?

Enables Matmul and Gemm for float16 on CPU #25913

Conversation

xadupre commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Uh oh!

xadupre commented Sep 1, 2025 •

edited

Loading