Adding the support of dense models distilled from moe models with the same architecture by vjanfaza · Pull Request #728 · quic/efficient-transformers

vjanfaza · 2026-01-16T01:38:02Z

In this PR, we are adding the support of meta-llama/Llama-Guard-4-12B which is a dense model distilled form llama4 scout moe model. The changes in pytorch_transforms.py file can be applied to any dense model distilled from a moe model with supported architecture in QEfficient.

quic-rishinr · 2026-02-12T05:12:42Z

@quic-amitraj please review the PR

quic-amitraj

LGTM

Signed-off-by: Vahid Janfaza <vjanfaza@qti.qualcomm.com>

vjanfaza requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners January 16, 2026 01:38

quic-amitraj approved these changes Feb 17, 2026

View reviewed changes

Support models that are dense distilled from moe

0143530

Signed-off-by: Vahid Janfaza <vjanfaza@qti.qualcomm.com>

quic-rishinr force-pushed the main-llama4-guard branch from 65c6a36 to 0143530 Compare February 19, 2026 06:32

quic-rishinr merged commit 4bd2239 into quic:main Feb 20, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Adding the support of dense models distilled from moe models with the same architecture#728

Adding the support of dense models distilled from moe models with the same architecture#728
quic-rishinr merged 1 commit intoquic:mainfrom
vjanfaza:main-llama4-guard

vjanfaza commented Jan 16, 2026

Uh oh!

quic-rishinr commented Feb 12, 2026

Uh oh!

quic-amitraj left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

vjanfaza commented Jan 16, 2026

Uh oh!

quic-rishinr commented Feb 12, 2026

Uh oh!

quic-amitraj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants