Back to feed
Reddit r/LocalLLaMA·

MTP hyperparameter search

Signal
45
Hype
15
In three linesHyperparameter search on MTP and speculative decoding with llama-server on Qwen 3.6 27B. 6% improvement (13.24 tokens/sec) via Optuna. Python script provided.
Read source
Your take?
QwenOpen sourceInfrastructure

Summary generated by Claude — human-verified