Back to feed
arXiv cs.CL·

Evaluation Drift in LLM Personality Induction: Are We Moving the Goalpost?

Signal
72
Hype
15
In three linesStudy on personality induction in LLMs via fine-tuning (SFT, DPO, ORPO) on long-form essays linked to Big Five profiles. Fine-tuning reduces variance in IPIP-NEO questionnaire responses, but accuracy on full personality profiles remains near chance. Unguided essays lack sufficient cues for faithful personality expression.
Read source
Your take?
Fine-tuningEvalsAlignment

Summary generated by Claude — human-verified