arXiv cs.LG·5 June 2026

Dominant-Layer ZO: A Single Layer Dominates Zeroth-Order Fine-Tuning of LLMs

Signal

Hype

In three linesA study reveals that in zeroth-order (ZO) optimization for LLM fine-tuning, a single decoding layer dominates adaptation. Fine-tuning this dominant layer alone matches or exceeds full-model ZO fine-tuning on LLaMA2-7B and Qwen3-8B, with speedup up to 4.52×. The dominant layer is identifiable before training via activation-outlier analysis.

Read source

Your take?

Fine-tuning Reasoning Benchmarks Llama

Summary generated by Claude — human-verified

Dominant-Layer ZO: A Single Layer Dominates Zeroth-Order Fine-Tuning of LLMs

Other angles on this story