Allow user to pass into `Super` a torch device. Then, move tensors and model to this device during training if the device is not `None`.