Tag · llama.cpp

# llama.cpp

All posts tagged "llama.cpp".

Lucebox on Olares One — Episode 4: The llama-server submodule serves it up to you 1h later

28.04.2026

test_dflash compiles, great. But to serve over HTTP I need llama-server, which compiles from the submodule. And the submodule has its own cmake invocation — where I forgot to add -rpath-link. And boom, 1h later, here we go again.
Lire →
Lucebox on Olares One — Episode 1: 134 t/s on RTX 3090, what about my rig?

28.04.2026

You're scrolling r/LocalLLaMA, you see a post claiming 134 t/s on Qwen3.6-27B with an RTX 3090 thanks to Lucebox. Of course you want to try it on your Olares One. Spoiler: it'll take 12 hours of compile time and 6 Docker builds. Episode 1.
Lire →