7 points | by zdw 12 hours ago ago
2 comments
Pretty sweet hack as it's orthogonal to quantisation. And while it uses more compute, it doesn't require more VRAM.
Maybe in the future circuits will become modular and composable like models are today?
[dead]
Pretty sweet hack as it's orthogonal to quantisation. And while it uses more compute, it doesn't require more VRAM.
Maybe in the future circuits will become modular and composable like models are today?
[dead]