Back to feed
Reddit r/LocalLLaMA·

Some tests with qwen3.6 27b + 35b a3b about MTP vs ngram-mod

Signal
35
Hype
15
In three linesUser benchmarks Qwen 3.6 27B and 35B with MTP vs ngram-mod optimization techniques. Finding: MTP degrades performance on React code generation task; ngram-mod preserves quality. Setup: Qwen 27B Q6_K + Qwen 35B Q8 on dual GPU 16GB+12GB.
Read source
Your take?
QwenCode generationBenchmarks

Summary generated by Claude — human-verified