FlashAttention-T: Towards Tensorized Attention

(dl.acm.org)

44 points | by matt_d 2 hours ago ago

3 comments