Back to feed
arXiv cs.AI·

NGM: A Plug-and-Play Training-Free Memory Module for LLMs

Signal
72
Hype
25
In three linesNGM is a training-free memory module for LLMs using a Causal N-Gram Encoder and Cosine-Gated Memory Injector. Tested on Qwen3 (0.6B-14B), it improves average performance by 0.5-1.2 points, with notable gains on code generation (+3.0 LiveCodeBench) and knowledge-intensive tasks (+3.03 GPQA).
Read source
Your take?
QwenCode generationReasoningPapers

Summary generated by Claude — human-verified