HN
New
Show
Ask
Jobs
Built with Qwik
Cassandra: Enabling Reasoning LLMs at Edge via Self-Speculative Decoding
(arxiv.org)
4 points | by
chrsw
8 hours ago ago
No comments yet.
No comments yet.