Given one video sequence(not in VOT or OTB) and the bounding box of target in first frame, how to write code for tracking task? Thanks a lot.