Running models locally is starting to get interesting now. Especially the 30B-A3... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

		jononor 6 days ago \| parent \| context \| favorite \| on: Running Qwen3 on your macbook, using MLX, to vibe ... Running models locally is starting to get interesting now. Especially the 30B-A3B version seems like a promising direction, though it is still out of reach on 16 GB VRAM (quite accessible). Hoping for new Nvidia RTX cards with 24/32 GB VRAM. Seems that we might get to GPT4-ish levels within a few years? Which is useful for a bunch of tasks.

avetiszakharyan 6 days ago [–]

I think we are just tiny bit away of being able to really "code" with ai, locally. Because even if it would be on gemini2.5 level, since its free, you can make it self prompt a bit more and eventually solve any problem. if i could ran 200b or if 30b wouldve been as good - it wouldve been enough

Consider applying for YC's Summer 2025 batch! Applications are open till May 13
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact