Back to feed
arXiv cs.LG·

Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence

Signal
72
Hype
15
In three linesDecision support framework for digital therapeutics modeling both recommendation and adherence effects using linear dynamical systems. UCB-BOLD algorithm proposed for online treatment selection with sublinear regret guarantees. Evaluation on micro-randomized trial data: 2-3x lower conditional value-at-risk regret than benchmarks.
Read source
Your take?
Reinforcement learningReasoningEvalsPapers

Summary generated by Claude — human-verified