arXiv cs.LG·9 June 2026

Training-Inference Kernel Contracts: Bounding Divergence in Post-Training and Deployment

Signal

Hype

In three linesTheoretical paper proposing kernel contracts to bound divergence between training and inference kernels in post-training. Framework specifying acceptable gaps in finite precision with numerical, statistical, and routing clauses. Derives bounds from logit drift to total-variation distance and applies to RL policy-gradient bias.

Read source

Your take?

Reinforcement learning Papers Alignment

Summary generated by Claude — human-verified

Training-Inference Kernel Contracts: Bounding Divergence in Post-Training and Deployment

Other angles on this story