how to set max_new_tokens? I just could find 'maxlen' var in 'main.py', and it would cost too muck time for testing.