Muon: An optimizer for hidden layers in neural networks

(kellerjordan.github.io)

4 points | by tosh 6 hours ago ago

No comments yet.