Back to feed
arXiv cs.CL·

Investigating Cross-Modal Skill Injection: Scenarios, Methods, and Hyperparameters

Signal
72
Hype
18
In three linesStudy on cross-modal skill injection: transferring domain-expert LLM capabilities to VLMs via model merging. Systematic analysis of 3 aspects: scenarios (strong in instruction-following and cross-lingual, weak in mathematical reasoning), methods (TA and DARE outperform alternatives), hyperparameters. Avoids expensive SFT.
Read source
Your take?
Fine-tuningVisionReasoningPapers

Summary generated by Claude — human-verified