Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

5 points | by limondas 6 hours ago ago

2 comments

Try qwen3-coder or qwen3-coder-next models which fit into your configuration. This is team-of-experts model which may load only actual experts into GPU.

[-]

limondas 4 hours ago

Thanks for your reply. But it's to big for my PC. In PC around 1.5GB models got 20 token/s , which is too low for agentic workflow.