🤖 Harold's Notes

Search

❯

❯

❯

❯

Sequence mixing operators

❯

Literature and ideas

Literature and ideas

Dec 06, 20251 min read

SLA, combination of full attention + linear attention to sparsify attention in diffusion transformers
- https://www.arxiv.org/pdf/2509.24006

Graph View

Backlinks

No backlinks found

Created with Quartz v4.2.3 © 2025