Back to feed
arXiv cs.AI·

Supervised sparse auto-encoders for interpretable and compositional representations

Signal
72
Hype
25
In three linesSupervised sparse auto-encoders improve model interpretability by aligning learned features with human semantics. Tested on Stable Diffusion 3.5, they enable compositional generalization and image editing through feature-level intervention.
Read source
Your take?
Image generationPapers

Summary generated by Claude — human-verified