7 points | by skilled 2 days ago ago
1 comments
The PDF is not linked from the HF repo so it might be easy to miss:
DeepSeek-V3.2-Exp: Boosting Long-Context Efficiency with DeepSeek Sparse Attention
https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/D...
The PDF is not linked from the HF repo so it might be easy to miss:
DeepSeek-V3.2-Exp: Boosting Long-Context Efficiency with DeepSeek Sparse Attention
https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/D...