a) CUDA won in a free market because NVidia showed they cared about it
b) Llama has support for OpenCL (via CLBlast) and Apple Metal
The OpenCL support already has a custom kernel for token generation.
a) CUDA won in a free market because NVidia showed they cared about it
b) Llama has support for OpenCL (via CLBlast) and Apple Metal
The OpenCL support already has a custom kernel for token generation.