Back to feed
Reddit r/LocalLLaMA·

Qwen 3.6-35B-A3B with 977 tk/s prompt processing and 262k context window on Intel Arc B70 Pro

Signal
72
Hype
25
In three linesQwen 3.6-35B-A3B achieves 977 tokens/s prompt processing and 262k context window on Intel Arc B70 Pro via llama.cpp with SYCL backend. User reports stable, usable local inference for complex tasks including game generation.
Read source
Your take?
QwenCode generationOpen sourceInfrastructure

Summary generated by Claude — human-verified