If it's not already in a render pass, it's more performant to use a compute pass for emulating vertex stage stream-out.