[webgpu] support arbitrary input_channel size for im2col #27038

wenqinI · 2026-01-16T08:24:34Z

Description

This PR supports vec1 for arbitrary input_channel in im2col kernel, which could bring performance gain to more models.

Like for yolov8n_pose model, there is about ~7% gain for whole model, and ~50% for those conv2d op which input_size are not multiple of 4.

Motivation and Context

wenqinI · 2026-01-16T08:24:56Z

@guschmue @fs-eire @qjia7 PTAL, thanks!

qjia7 · 2026-01-16T08:57:57Z

onnxruntime/core/providers/webgpu/nn/im2col_matmul.cc

  // If the status of this condition is uncertain, the feature must be disabled.
  const bool use_subgroup = false;
-  Im2ColMatMulProgram im2col_mm_program{has_bias, tile_m, tile_n, use_subgroup};
+  const uint32_t vec_size = channel_input % 4 == 0 ? 4 : 1;


How about extending it into 1, 2 or 4? const uint32_t vec_size = GetMaxComponents(channel_input);

support vec1 for im2col

10d490c

wenqinI marked this pull request as draft January 16, 2026 08:46

qjia7 reviewed Jan 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[webgpu] support arbitrary input_channel size for im2col #27038

[webgpu] support arbitrary input_channel size for im2col #27038

wenqinI commented Jan 16, 2026

Uh oh!

wenqinI commented Jan 16, 2026

Uh oh!

qjia7 Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[webgpu] support arbitrary input_channel size for im2col #27038

Are you sure you want to change the base?

[webgpu] support arbitrary input_channel size for im2col #27038

Conversation

wenqinI commented Jan 16, 2026

Description

Motivation and Context

Uh oh!

wenqinI commented Jan 16, 2026

Uh oh!

qjia7 Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants