same random seed for train_env and test_env

Hi there,

I have two questions regarding the `test_env`:

1. Why did you only have `test_env` for sac, not for ppo and trpo?
2. In `safe_rl.sac.sac.py line 273` you set the seeds of env and test_env using the same seeds, then test_env would be the same as the training envs, right? Is the purpose of test_env only testing the deterministic actions, not at all the generalization of the policy?

```
# Setting seeds
    tf.set_random_seed(seed)
    np.random.seed(seed)
    env.seed(seed)
    test_env.seed(seed)
```

Thank you very much in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

same random seed for train_env and test_env #3

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

same random seed for train_env and test_env #3

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions