Returaj Burnwal, Nirav Pravinbhai Bhatt, Balaraman Ravindran, SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Preprint link: https://arxiv.org/abs/2511.08136
Returaj Burnwal, Nirav Pravinbhai Bhatt, Balaraman Ravindran, SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Preprint link: https://arxiv.org/abs/2511.08136