Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

ONNX doesn't support the same level of quantization as GGML.

So basically GGML will run on hardware with less memory.



Or alternatively, bigger models with the same memory (just quantised harder).




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: