Back to feed
Reddit r/LocalLLaMA·

Mutating Gemma 4 31B Dense in to a native Gemma 4 additive-MoE model

Signal
35
Hype
45
In three linesA r/LocalLLaMA user developed a training script to convert Gemma 4 31B Dense into a native additive-MoE model, inspired by JDONE-Research/AIOne-Agent-52B-A36B-it. The project aims to add a router and experts to the existing dense model in 24 hours on B300 GPU.
Read source
Your take?
GeminiFine-tuningOpen source

Summary generated by Claude — human-verified