Back to feed
Reddit r/LocalLLaMA·

I can fit 28% more context after building llama.cpp with OpenBLAS. Huh?

Signal
35
Hype
15
In three linesUser reports llama.cpp built with Vulkan + OpenBLAS fits 28% more context (112,896 tokens vs 87,808) on Qwen 3.6 27B. Cause unclear: expected behavior, bug, or measurement artifact.
Read source
Your take?
LlamaOpen sourceInfrastructure

Summary generated by Claude — human-verified