Any idea why prunning can improve perplexity?
Signal
35
Hype
25
In three linesA r/LocalLLaMA user reports an experiment combining WANDA pruning with data-free quantization (HQQ). Pruning before quantization improves perplexity in this specific setup. The author seeks explanations and feedback on this preliminary research result.Read source
Your take?
Summary generated by Claude — human-verified