Logging losses and reward with Tensorboard

Hi,

I tried to log policy and critic losses as well as reward using Tensorboard. I run training using default setting with sz50.

I noticed that **critic losses** keep **increasing**. Does this even make sense?

![tensorboard](https://user-images.githubusercontent.com/26178460/209073120-4f58b5a6-9f5c-4fee-bd60-28c031aacf22.png)

I wonder is there any issue with the code regarding critic losses, could you please have a check/comment on this.

Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logging losses and reward with Tensorboard #23

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Logging losses and reward with Tensorboard #23

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions