I downloaded the ptm file you uploaded to OneDrive, and it seems to be the checkpoint at 700 iterations. However, the code defaults to 8000 iterations.
parser.add_argument('--iter', default='8000', type=int,
help='iter: iteration of the checkpoint to load. Default: 8000')
Is 700 the best model? Thanks