mllama patch modifies nn.LayerNorm globally #315

tyler-romero · 2024-10-19T19:00:01Z

🐛 Describe the bug

Instead of only patching the transformers mllama module (transformers.models.mllama.modeling_mllama), apply_liger_kernel_to_mllama modifies torch.nn.LayerNorm globally.

The issue is here.

The fix would be to:
(1) Not patch LayerNorm in Liger by assigning to modeling_mllama.nn.LayerNorm
(2) Change transformers.models.mllama.modeling_mllama to not use from torch import nn and to instead just import layernorm like from torch.nn import LayerNorm
(3) instead patch layernorm in Liger by assigning to modeling_mllama.LayerNorm

Reproduce

pip install transformers==4.45 liger-kernel-nightly

from liger_kernel.transformers import apply_liger_kernel_to_mllama
from torch import nn

apply_liger_kernel_to_mllama()
print(nn.LayerNorm)
<class 'liger_kernel.transformers.layer_norm.LigerLayerNorm'>

Versions

Environment Report:

Operating System: Linux-6.1.85+-x86_64-with-glibc2.35
Python version: 3.10.12
PyTorch version: 2.4.1+cu121
CUDA version: Not available
Triton version: 3.1.0
Transformers version: 4.45.0

The text was updated successfully, but these errors were encountered:

ByronHsu · 2024-10-19T20:28:22Z

curious does it require to change transformer source code? i think we can maybe raise a request

tyler-romero · 2024-10-19T22:49:26Z

Yeah the proposed fix would require a change to transformers unfortunately. How mllama was implemented differs very slightly from the conventions in other transformers modeling files.

ByronHsu · 2024-10-21T17:04:13Z

sounds good. let us try to send a PR there

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mllama patch modifies nn.LayerNorm globally #315

mllama patch modifies nn.LayerNorm globally #315

tyler-romero commented Oct 19, 2024

ByronHsu commented Oct 19, 2024

tyler-romero commented Oct 19, 2024

ByronHsu commented Oct 21, 2024

mllama patch modifies nn.LayerNorm globally #315

mllama patch modifies nn.LayerNorm globally #315

Comments

tyler-romero commented Oct 19, 2024

🐛 Describe the bug

Reproduce

Versions

Environment Report:

ByronHsu commented Oct 19, 2024

tyler-romero commented Oct 19, 2024

ByronHsu commented Oct 21, 2024