Hi @sheelabhadra thank you for sharing your code and research.
based on your paper, for the Lane following task, the baseline policies were obtained using supervised learning on image-action pairs that were collected from an expert human demonstrator.
can you provide the complete steps to run this code, especially for the MLP policy?
Thank you