Gemma3-4b QNN example fixes by qti-kromero · Pull Request #2106 · microsoft/Olive

qti-kromero · 2025-08-18T20:44:31Z

Describe your changes

Updates various handlers to handle Gemma3
Adds custom module/custom class fields to HF model
Olive recipe for Gemma3-4b

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.
Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

examples/gemma3/qnn/user_script.py

examples/gemma3/qnn/gemma3-4b-vision-qnn-config.json

examples/gemma3/qnn/user_script.py

olive/common/hf/wrapper.py

examples/gemma3/qnn/user_script.py

xiaoyu-work · 2025-08-20T04:20:16Z

Did you test the inference of the output model? I tested https://huggingface.co/google/gemma-3-270m but it failed with onnxruntime-genai when inferencing (this will be fixed soon), but I'm not sure if you got an error as well for Gemma3-4b with qnn.

qti-kromero · 2025-08-20T15:47:41Z

Did you test the inference of the output model? I tested https://huggingface.co/google/gemma-3-270m but it failed with onnxruntime-genai when inferencing (this will be fixed soon), but I'm not sure if you got an error as well for Gemma3-4b with qnn.

We have intelligible output from the text portion of the model within our KPIs but we have not successfully prepared the vision portion of the model or combined the two models together in a pipeline of OGA. Do you have details on what specific portion of the model you had issues with?

xiaoyu-work · 2025-08-22T00:30:00Z

Did you test the inference of the output model? I tested https://huggingface.co/google/gemma-3-270m but it failed with onnxruntime-genai when inferencing (this will be fixed soon), but I'm not sure if you got an error as well for Gemma3-4b with qnn.

We have intelligible output from the text portion of the model within our KPIs but we have not successfully prepared the vision portion of the model or combined the two models together in a pipeline of OGA. Do you have details on what specific portion of the model you had issues with?

I encountered a bug in model builder when testing gemma-3-270m, but it has now been fixed. Interestingly the 4B model works without issues, maybe there are some difference between their configs. If it is tested, I'm fine. Thanks.

examples/gemma3/qnn/user_script.py

examples/gemma3/qnn/gemma3-4b-text-qnn-config.json

examples/gemma3/qnn/app.py

examples/gemma3/qnn/custom_gemma3_4b_datasets.py

examples/gemma3/qnn/app.py

examples/gemma3/qnn/custom_gemma3_4b_embedding.py

examples/gemma3/qnn/gemma3-4b-embedding-qnn-config.json

examples/gemma3/qnn/gemma3-4b-text-qnn-config.json

examples/gemma3/qnn/genai_config.json

examples/gemma3/qnn/app.py

examples/gemma3/qnn/olive_req.txt

review-notebook-app · 2025-09-04T20:46:40Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

This fixes the issue of text model repeating words in the output.

examples/gemma3/qnn/genai/app.py

olive/common/hf/utils.py

qti-kromero

apparently I have to make a comment

examples/gemma3/qnn/gemma-3-4b.ipynb

examples/gemma3/qnn/env_setup.sh

examples/gemma3/qnn/qnn_req.txt

examples/gemma3/qnn/README.md

examples/gemma3/qnn/gemma3-4b-text-qnn-config.json

olive/common/hf/wrapper.py

olive/model/handler/hf.py

olive/common/hf/utils.py

qti-kromero · 2025-09-23T20:39:18Z

@jambayk @xiaoyu-work wondering if this could get reviewed when you have a chance

xiaoyu-work · 2025-10-20T22:33:14Z

Hi @qti-kromero , is this changed required for recipe microsoft/olive-recipes#125?

qti-kromero · 2025-10-20T22:40:29Z

Hi @qti-kromero , is this changed required for recipe microsoft/olive-recipes#125?

correct, although we may need to retest these changes since we haven't revisited them for awhile due to higher priority work

xiaoyu-work · 2025-10-20T22:42:11Z

Hi @qti-kromero , is this changed required for recipe microsoft/olive-recipes#125?

correct, although we may need to retest these changes since we haven't revisited them for awhile due to higher priority work

Thanks for confirming this! Let me know if they are ready for review and I will review both PRs together.

jambayk · 2026-02-12T17:25:52Z

Closing for now since the PR has become stale

qti-kromero added 7 commits August 13, 2025 14:29

Initial commit

d494c82

Add README and start config

ddf3ea8

QuaRot passing, working on GptqQuantizer

1f54074

Work on dataset integration

6cae95f

Data processing works

2d0872e

Fix lint issues and cleanup

6a6f67d

Adding vision resources

cd24ddf

github-advanced-security bot found potential problems Aug 18, 2025

View reviewed changes

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Aug 18, 2025

View reviewed changes

jambayk reviewed Aug 18, 2025

View reviewed changes

olive/common/hf/wrapper.py Show resolved Hide resolved

qti-kromero added 2 commits August 18, 2025 18:22

Add Gemma3 vision configurations

636e982

Fix linting error

b4ea7a3

github-advanced-security bot found potential problems Aug 19, 2025

View reviewed changes

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Aug 19, 2025

View reviewed changes

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

Vision model onnx conversion working

1f69af3

github-advanced-security bot found potential problems Aug 19, 2025

View reviewed changes

examples/gemma3/qnn/user_script.py Fixed Show fixed Hide fixed

Enable quant on text model

aed20ec

qti-kromero added 2 commits August 25, 2025 20:11

Improve README

ba0633c

Merge remote-tracking branch 'origin/main' into dev/qti-kromero/gemma3

5ad910d

xiaoyu-work reviewed Aug 28, 2025

View reviewed changes

examples/gemma3/qnn/user_script.py Outdated Show resolved Hide resolved

examples/gemma3/qnn/gemma3-4b-text-qnn-config.json Outdated Show resolved Hide resolved

examples/gemma3/qnn/gemma3-4b-text-qnn-config.json Outdated Show resolved Hide resolved

Add files from Prudvhi

acbdfdc

github-advanced-security bot found potential problems Aug 28, 2025

View reviewed changes

examples/gemma3/qnn/app.py Fixed Show fixed Hide fixed

examples/gemma3/qnn/custom_gemma3_4b_datasets.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Aug 28, 2025

View reviewed changes

qti-kromero added 2 commits September 2, 2025 10:42

Updates

f7178ae

Updates

bd70ff4

github-advanced-security bot found potential problems Sep 3, 2025

View reviewed changes

Add olive requirements file

c962cee

update

360d9c2

github-advanced-security bot found potential problems Sep 4, 2025

View reviewed changes

examples/gemma3/qnn/olive_req.txt Fixed Show fixed Hide fixed

Update Olive scripts for gemma3

5fcda5c

prudhvi-qti and others added 3 commits September 5, 2025 02:53

Update few python packages

14018ee

Use the same llava dataset for text model as well

1f89241

This fixes the issue of text model repeating words in the output.

Minor cleanup

7d4ced8

github-advanced-security bot found potential problems Sep 9, 2025

View reviewed changes

Add system requirements

a0bd703

qti-kromero commented Sep 16, 2025

View reviewed changes

qti-kromero added 2 commits September 18, 2025 10:38

Merge remote-tracking branch 'origin/main' into dev/qti-kromero/gemma3

f712bdc

Remove examples

f685073

qti-kromero commented Sep 18, 2025

View reviewed changes

olive/model/handler/hf.py Outdated Show resolved Hide resolved

olive/common/hf/utils.py Outdated Show resolved Hide resolved

Fix review comments

5dff155

qti-kromero changed the title ~~Add Gemma3-4b QNN example~~ Gemma3-4b QNN example fixes Sep 18, 2025

qti-kromero marked this pull request as ready for review September 18, 2025 18:15

qti-kromero requested review from jambayk and xiaoyu-work September 18, 2025 18:15

jambayk closed this Feb 12, 2026

Conversation

qti-kromero commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Checklist before requesting a review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xiaoyu-work commented Aug 20, 2025

Uh oh!

qti-kromero commented Aug 20, 2025

Uh oh!

xiaoyu-work commented Aug 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

review-notebook-app bot commented Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qti-kromero left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qti-kromero commented Sep 23, 2025

Uh oh!

xiaoyu-work commented Oct 20, 2025

qti-kromero commented Aug 18, 2025 •

edited

Loading