Atlas: An LLM inference engine written from scratch in Rust and CUDA

(atlasinference.io)

4 points | by emrehan 10 hours ago ago

No comments yet.