Reinforcement Learning

Preference Based Reinforcement Learning