As the title say. Our simulator already support online learning by default. Adding Q-learning should not be that hard