Add LayerNorm support for Vivado #1110

rianbrooksflynn · 2024-11-04T16:06:30Z

Description

This PR adds support for Layer Normalization using either Keras or PyTorch with the Vivado backend in io_parallel mode.

This implementation uses a lookup table for inverse square root; the inputs to the lookup table follow a logarithmic distribution for better accuracy.

Tests have been added for both Keras and Pytorch parsing.

Credit is due to @Ethan0Jiang and @LostEcho365 (Zhixing Jiang and Dennis Yin) for their Vivado implementation and Keras parsing support; my contributions were making a change to the inverse square root lookup table implementation, implementing PyTorch support, and adding unit tests. (Here's a link to their pre-print.) The original code authors have given permission for their code to be merged into hls4ml.

Linked issue: #1109

Type of change

New feature (non-breaking change which adds functionality)
A new research paper code implementation

Tests

Two unit tests added: test/pytest/test_layernorm.py and test/pytest/test_layernorm_pytorch.py

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

vloncar · 2025-06-05T12:49:49Z

First in line on my PR review TODO list, I expect early next week to have time for this.

The-Padi · 2025-06-05T12:51:01Z

First in line on my PR review TODO list, I expect early next week to have time for this.

Thank you very much !

JanFSchulte · 2025-07-01T16:00:32Z

pre-commit.ci autofix

JanFSchulte · 2025-08-05T14:31:00Z

@vloncar I hope this goes in the direction of what you had in mind for the performance validation. I ran synthesis in Vitis 2023.1, 2024.1, and 2025.1 for different input sizes to the LayerNorm and plotted FFs, LUTs, DSPs, BRAM, latency, and II as a function of that input size. 2024.1 and 2025.1 are basically identical, whereas 2023.1 uses a bit less resources but has worse latency.

This is the the default ap_fixed<16.6> and the default target part.

vloncar · 2025-08-05T14:53:00Z

Thanks, looks good. Do all reports say the timing is met? (No scheduling warnings etc, the clock uncertainty is met etc)

JanFSchulte · 2025-08-05T15:05:28Z

I did not observe any warning about the scheduling. The timing results look all very similar:

vloncar · 2025-08-05T15:12:59Z

hls4ml/templates/vivado/nnet_utils/nnet_layernorm.h

+    static const unsigned dim = CONFIG_T::n_in / CONFIG_T::seq_len;
+    data_T in_val[dim];
+    res_T outval[dim];
+    // Use a function_instantiate in case it helps to explicitly optimize unchanging weights/biases


It does not. I think this can be removed from new code.

vloncar · 2025-08-05T15:16:19Z

test/pytest/test_layernorm.py

+    hls_model = hls4ml.converters.convert_from_keras_model(
+        custom_epsilon_model, backend=backend, hls_config=custom_config, io_type='io_parallel', output_dir=output_dir
+    )
+    hls_model.compile()


This test would be faster if we used hls_model.write(), or completely skip the step of writing/linking. We don't use it here. The later accuracy test checks if the produced code is compilable

vloncar · 2025-08-05T15:16:52Z

test/pytest/test_layernorm.py

+    # Predict
+    y_keras = model.predict(data).flatten()
+    y_hls = hls_model.predict(data).flatten()
+    np.testing.assert_allclose(y_keras, y_hls, rtol=0, atol=atol, verbose=True)


why is atol a global variable?

vloncar · 2025-08-05T15:18:37Z

hls4ml/converters/pytorch/core.py

+
+    if not ((len(input_shapes[0])) == 3):
+        raise Exception(
+            'input size is not currently supported by hls4ml; '


Would be good to say Input shape <some shape> is not supported, only ...

vloncar · 2025-08-05T15:21:47Z

hls4ml/model/optimizer/passes/convert_to_channels_last.py

                node.get_output_variable().dim_names = dim_names
+        elif (
+            isinstance(node, LayerNormalization)
+            and not model.config.config['HLSConfig']['Model']['ChannelsLastConversion'] == "off"


With the proposed change in #1352 we'll never get to off check here.

…mand

* paser_mht * change parser and modify keras_to_hls * IR_mutihead_attention * IR done * create mha file in template * mha .h file dummy algo * config of mha * update mha config * dummy mha * add transpose into mha * projection_of_qkv_in_mha * mha_first_draft * able to predict model correct * delete some unnassary comments * delete comments * resource strategy of transformer * change sm lagacy * update MHA, optimized * support resource * update * dense_muti_dim_support * parallel execute dense * updates * add_layerNorm_support * MHA updated * LayerNorm_bug_fix * update bit precision * config update * add some comment * run pre-commit * Added support on QMultiHeadAttention, QLayerNormalization, and quantized_softmax * updated on hls4ml transformer * trying to clean the diff * trying to clean the diff * trying to clean the diff * trying to clean the diff * trying to clean the diff * undo vhdl -> verilog change * halfway working layernorm + test * layernorm is now pretty functional * layernorm on pytorch also * minor cleanup * more cleanup, pre-commit * test for mha which kinda works maybe if you squint * multihead attention working on keras and pytorch * fiddly precision / accuracy changes for layernorm * fix lookup table and label loops * remove dense_seq * undo qkeras changes * fix merge conflict residue * remove non-layernorm changes * change to uniform LUT and fix precision * [pre-commit.ci] auto fixes from pre-commit hooks * fix encodings issue with dos2unix * add Vitis as another tested backend * Address PR feedback * [pre-commit.ci] auto fixes from pre-commit hooks * fix too-long lines * fix merge issue * trigger pre-commit * re-add missing math import * [pre-commit.ci] auto fixes from pre-commit hooks * addressing Vladimir's latest comments * change also pytorch test for layernorm and revert change to build command * sideport changes to channels-last converter from 1352 --------- Co-authored-by: Ethan <[email protected]> Co-authored-by: Jan-Frederik Schulte <[email protected]> Co-authored-by: LostEcho365 <[email protected]> Co-authored-by: Rian Flynn <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Ethan0Jiang and others added 30 commits September 13, 2024 12:25

paser_mht

c4c818b

change parser and modify keras_to_hls

3ee64d1

IR_mutihead_attention

5626a1a

IR done

d51f8a9

create mha file in template

89025a2

mha .h file dummy algo

d76cf60

config of mha

56811de

update mha config

45cd493

dummy mha

1402f48

add transpose into mha

430b9ea

projection_of_qkv_in_mha

97f3e8d

mha_first_draft

52cc7e8

able to predict model correct

3961f97

delete some unnassary comments

3533999

delete comments

d2f0df6

resource strategy of transformer

6aaa5ed

change sm lagacy

3b7a288

update MHA, optimized

130092d

support resource

09b0ba0

update

b49fffd

dense_muti_dim_support

5324a11

parallel execute dense

bf8c788

updates

b6be2c4

add_layerNorm_support

2472b7d

MHA updated

97e71e9

LayerNorm_bug_fix

5ed4a76

update bit precision

5d28f58

config update

2fc68d0

add some comment

b5c95cf

run pre-commit

3b8aa8d

JanFSchulte added 2 commits July 1, 2025 11:53

Merge branch 'main' into layernorm

08ce6f1

re-add missing math import

c74736b

[pre-commit.ci] auto fixes from pre-commit hooks

4586b92

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jul 1, 2025

Merge branch 'main' into layernorm

6fdd250

vloncar reviewed Aug 5, 2025

View reviewed changes

JanFSchulte added 2 commits August 5, 2025 13:35

addressing Vladimir's latest comments

8106c2c

Merge branch 'main' into layernorm

3562e67

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Aug 5, 2025

change also pytorch test for layernorm and revert change to build com…

4d0281e

…mand

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Aug 5, 2025

sideport changes to channels-last converter from 1352

89e0fe0

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Aug 5, 2025

vloncar approved these changes Aug 5, 2025

View reviewed changes

vloncar merged commit fd41dc5 into fastmachinelearning:main Aug 5, 2025
5 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LayerNorm support for Vivado #1110

Add LayerNorm support for Vivado #1110

Uh oh!

rianbrooksflynn commented Nov 4, 2024 •

edited

Loading

Uh oh!

vloncar commented Jun 5, 2025

Uh oh!

The-Padi commented Jun 5, 2025

Uh oh!

JanFSchulte commented Jul 1, 2025

Uh oh!

JanFSchulte commented Aug 5, 2025

Uh oh!

vloncar commented Aug 5, 2025

Uh oh!

JanFSchulte commented Aug 5, 2025

Uh oh!

vloncar Aug 5, 2025

Uh oh!

vloncar Aug 5, 2025

Uh oh!

vloncar Aug 5, 2025

Uh oh!

vloncar Aug 5, 2025

Uh oh!

vloncar Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Add LayerNorm support for Vivado #1110

Add LayerNorm support for Vivado #1110

Uh oh!

Conversation

rianbrooksflynn commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tests

Checklist

Uh oh!

vloncar commented Jun 5, 2025

Uh oh!

The-Padi commented Jun 5, 2025

Uh oh!

JanFSchulte commented Jul 1, 2025

Uh oh!

JanFSchulte commented Aug 5, 2025

Uh oh!

vloncar commented Aug 5, 2025

Uh oh!

JanFSchulte commented Aug 5, 2025

Uh oh!

vloncar Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

vloncar Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

vloncar Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

vloncar Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

vloncar Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

rianbrooksflynn commented Nov 4, 2024 •

edited

Loading