The same issue (ValueError: NaN in actions) pops up when I run elevation training with Wandb visualization disabled. See parent issue for what the error looks like.