Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]
Signal
45
Hype
25
In three linesMechanistic interpretability experiment on Qwen3.5-35B-A3B: a routed expert (E114, layer 14) correlates with first-person self-examination register during generation. Author documents results before git release, using W/S/Q decomposition of MoE routing.Read source
Your take?
Summary generated by Claude — human-verified