Skip to content

Conversation

@lukasmolnar
Copy link
Contributor

Issue ticket number and link

Fixes # (issue)

Describe your changes

Please include a summary of the change, including why you did this, and the desired effect.

Instructions for reviewers

Indicate anything in particular that you would like a code-reviewer to pay particular attention to.
Indicate steps to actually test code, including CLI instructions if different than usual.
Point out the desired behavior, and not just the "check that this appears" (otherwise the code reviewer will be lazy and just verify what you've already verified).

Checklist before requesting a review

  • This is expected to break regression tests.
  • I have assigned a reviewer
  • I have added the PR to the project, and tagged with with priority
  • If it is a core feature, I have added tests.
  • I have set up pre-commit hooks with ruff, or run ruff format . manually

sheim added 30 commits March 25, 2024 15:54
Comment: last_values doesn't seem to have much of an impact on performance at all. However, for off-policy algorithms, having access to the next state (obs) explicitly starts to be a lot handier. Alternative is to do some data juggling to pull it out, which is a bit tedious and probably needs to be stored in a tensor anyway to allow the batch generation
- data was not being added correctly
- runner was indexing along num_envs instead of time
- use data getter to handle overfill
Base automatically changed from SAC to dev July 11, 2024 00:16
@lukasmolnar lukasmolnar changed the base branch from dev to lm/PROJECT_dev July 17, 2024 19:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants