HN
New
Show
Ask
Jobs
Built with Qwik
Π-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling
(arxiv.org)
1 points | by
PaulHoule
5 hours ago ago
No comments yet.
No comments yet.