Skip to content

Conversation

@s8phsaue
Copy link

Fixed a small bug which made the environment model learn on incorrect rewards.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant