[FEA] Feature to extract attention values from transformer heads #745

vivpra89 · 2023-09-20T18:28:55Z

🚀 Feature request

Ability to extract attention weights from various heads of transformer

Motivation

Plotting attention provides insights into the inner-workings and user behaviors that business teams can relate with. This is easily available with pytorch / Tensorflow.

Is there a way to convert the trained model to PT / TF models to capture the attention values?

vivpra89 added the status/needs-triage label Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Feature to extract attention values from transformer heads #745

[FEA] Feature to extract attention values from transformer heads #745

vivpra89 commented Sep 20, 2023

[FEA] Feature to extract attention values from transformer heads #745

[FEA] Feature to extract attention values from transformer heads #745

Comments

vivpra89 commented Sep 20, 2023

🚀 Feature request

Motivation