Hi authors, thanks for releasing the code! How does your method compare with SyncTalk on long template video inputs ( > 60 seconds) ?