Eh, I've been building native code for decades and hit quite a few roadblocks trying to get llama.cpp building with cuda support on my Ubuntu box. Library version issues and such. Ended up down a rabbit hole related to codenames for the various Nvidia architectures... It's a project on hold for now.
Weirdly, the Python bindings built without issue with pip.
Weirdly, the Python bindings built without issue with pip.