Reddit r/MachineLearning·1 June 2026

Why our #1 LightGBM feature by importance made predictions worse [D]

Signal

Hype

In three linesA LightGBM quantile regression model ranked a Bayesian target-encoded feature #1 by importance for watch price forecasting, but 4-seed × 3-variant ablation showed +0.28pp MAPE regression on hold-out. The learned signal was irreducible label noise (unobserved factors), failing to generalize.

Read source

Your take?

Benchmarks Fine-tuning

Summary generated by Claude — human-verified

Why our #1 LightGBM feature by importance made predictions worse [D]

Other angles on this story