complete steps of this code

Hi @sheelabhadra thank you for sharing your code and research. 

based on your paper, for the Lane following task, the baseline policies were obtained using supervised learning on image-action pairs that were collected from an expert human demonstrator.
can you provide the complete steps to run this code, especially for the MLP policy?
Thank you