Finetune Stable Diffusion Models with DDPO via TRL
Hugging Face releases a guide to finetune Stable Diffusion models using DDPO (Diffusion DDPOTrainer) integrated into TRL. The method enables optimizing image generation models with custom reward functions without requiring additional training data.