William Overman
Open Menu
Close Menu
Bio
Papers
Manuscript
Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism
Oct 4, 2024