coding · free
llama.cpp
— / 5Free
About
LLM inference in C/C++ — runs models locally on CPU/GPU across platforms with minimal dependencies.
coding · free
LLM inference in C/C++ — runs models locally on CPU/GPU across platforms with minimal dependencies.