SLA and SageSLA (SLA based on SageAttention) Code Update

1. We’ve just updated the Triton implementation of **SLA**. Training is now **more stable**, **faster**, and typically achieves **better training results**.

2. We’ve released the code for **SageSLA**, a very fast **SLA (Sparse-Linear Attention)** forward pass based on [SageAttention](https://github.com/thu-ml/SageAttention). It uses some code from [SpargeAttn](https://github.com/thu-ml/SpargeAttn). Please refer to the `SageSLA/` directory for how to use SageSLA.

Feel free to try it out!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SLA and SageSLA (SLA based on SageAttention) Code Update #9

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

SLA and SageSLA (SLA based on SageAttention) Code Update #9

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions