GitHub - bruno-c/suno-wav-investigation: what happens when you click "download WAV" on suno? one slightly bored man avoids doing what he's supposed to be doing right now, and tries to find out once and for all.

oh suno, why you gotta go and make things so complicated. just tell us how the stuff works pls. more info about the model, how it was trained, which text encoder and how, etc. this will make your (advanced) users so much happier and more productive.

my hypothesis

ps: complete conjecture, no connection to suno and absolutely not an expert on this matter, just a guy who spends too much time on generative models of all kinds (with quite a bit of programming and audio experience)

suno has a fast but lower quality decoder that is uses to convert the latent into an mp3 stream. it's less GPU intensive and enables the extraordinarily fast streaming feature, even before the song is done generating
when the song is fully generated, suno saves the full latent information of the generated song to disk. this, in theory, is a tiny amount of data.
suno has a higher quality, GPU intensive decoder that can't be used for streaming. when you trigger the "download WAV" command, it uses the previously saved latent and decodes it and creates the file

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
1_source_mp3s		1_source_mp3s
2_self_decoded_mp3_wavs		2_self_decoded_mp3_wavs
3_suno_generated_wavs		3_suno_generated_wavs
.gitignore		.gitignore
README.md		README.md
complex.gif		complex.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages