Back to feed
Reddit r/MachineLearning·

Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]

Signal
45
Hype
25
In three linesMechanistic interpretability experiment on Qwen3.5-35B-A3B: a routed expert (E114, layer 14) correlates with first-person self-examination register during generation. Author documents results before git release, using W/S/Q decomposition of MoE routing.
Read source
Your take?
QwenOpen source

Summary generated by Claude — human-verified