I would recommend trying oMLX, which is much more performant and efficient than ... | Hacker News

Hacker Timesnew | past | comments | ask | show | jobs | submit

anon373839 71 days ago | parent | context | favorite | on: Qwen3.6-27B: Flagship-Level Coding in a 27B Dense ...

I would recommend trying oMLX, which is much more performant and efficient than LM Studio. It has block-level KV context caching that makes long chats and agentic/tool calling scenarios MUCH faster.

felikz 59 days ago [–]

and it horribly kernel panics when it is running for too long due to Apple does not give a sh over mlx, see list of issues: https://github.com/Harperbot/metal-guard#landed-here-searchi...

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact