Guiding Offline Reinforcement Learning Using a Safety Expert
Offline reinforcement learning is used to train policies in situations where it is expensive or infeasible to access the environment during training. An agent trained under such a scenario does not get corrective …
