Missing attention mask #158

LaurentMazare · 2024-11-21T16:38:09Z

Backend impacted

The MLX implementation

Operating system

Linux

Hardware

Metal with MLX

Description

There is an issue with the current mlx transformer implementation, the attention mask used in the transformer here turns out never to be set.
This is not an issue with the current inference set up as only the big transformer and the depformer use this implementation (and not the codec) and in these cases data is processed one step at a time so there is no need for any mask.

Extra information

.

Environment

.

LaurentMazare added the bug Something isn't working label Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing attention mask #158

Missing attention mask #158

LaurentMazare commented Nov 21, 2024

Missing attention mask #158

Missing attention mask #158

Comments

LaurentMazare commented Nov 21, 2024

Backend impacted

Operating system

Hardware

Description

Extra information

Environment