Add a llama 3.1 toy test with cross entropy test #50

rsuderman · 2024-11-27T01:16:28Z

If we pick the maximum token each decode step we should get a somewhat consistent cross entropy loss. This uses a prebaked llama model. The irpa file should stay static but the mlir model should update as features are enabled.

If we pick the maximum token each decode step we should get a somewhat consistent cross entropy loss. This uses a prebaked llama model. The `irpa` file should stay static but the `mlir` model should update as features are enabled.

ScottTodd

Looks good! The top level sharktank_models path SGTM. We can shuffle things around later as more tests are added too.

A few mostly structural and style comments.

sharktank_models/llama3.1/test_llama.py

sharktank_models/llama3.1/assets/toy_llama.irpa

sharktank_models/llama3.1/test_llama.py

Signed-off-by: Rob Suderman <[email protected]>

ScottTodd

Very nice! Thanks for the changes!

sharktank_models/llama3.1/README

.github/workflows/test_sharktank_models.yml

Add a llama 3.1 toy test with cross entropy test

44e046c

If we pick the maximum token each decode step we should get a somewhat consistent cross entropy loss. This uses a prebaked llama model. The `irpa` file should stay static but the `mlir` model should update as features are enabled.

ScottTodd reviewed Nov 27, 2024

View reviewed changes

rsuderman added 5 commits November 27, 2024 12:19

Reworked for comments

fc43868

Removed torch dep

f693db0

update lfs configuration

de1aa2c

remove unknown flag

e0d1590

Signed-off-by: Rob Suderman <[email protected]>

missing slash

f91c857

rsuderman requested a review from ScottTodd November 27, 2024 21:11

ScottTodd approved these changes Nov 27, 2024

View reviewed changes

sharktank_models/llama3.1/README Outdated Show resolved Hide resolved

.github/workflows/test_sharktank_models.yml Show resolved Hide resolved

rsuderman added 2 commits November 27, 2024 15:37

fix presubmit

c9068db

rename readme

c5f8360

rsuderman merged commit a0c84d5 into iree-org:main Nov 27, 2024
2 checks passed

rsuderman deleted the llama3b_prefill branch November 27, 2024 23:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a llama 3.1 toy test with cross entropy test #50

Add a llama 3.1 toy test with cross entropy test #50

rsuderman commented Nov 27, 2024

ScottTodd left a comment

ScottTodd left a comment

Add a llama 3.1 toy test with cross entropy test #50

Add a llama 3.1 toy test with cross entropy test #50

Conversation

rsuderman commented Nov 27, 2024

ScottTodd left a comment

Choose a reason for hiding this comment

ScottTodd left a comment

Choose a reason for hiding this comment