Back to feed
Reddit r/LocalLLaMA·

Any idea why prunning can improve perplexity?

Signal
35
Hype
25
In three linesA r/LocalLLaMA user reports an experiment combining WANDA pruning with data-free quantization (HQQ). Pruning before quantization improves perplexity in this specific setup. The author seeks explanations and feedback on this preliminary research result.
Read source
Your take?
Open sourceBenchmarks

Summary generated by Claude — human-verified