Can you tell me if this model generates results without sound, and if it will subsequently generate videos with sound?