Smooth Exploration #6

lukasmolnar · 2024-02-28T18:33:54Z

Issue ticket number and link

Fixes # (issue)

Describe your changes

Please include a summary of the change, including why you did this, and the desired effect.

Instructions for reviewers

Indicate anything in particular that you would like a code-reviewer to pay particular attention to.
Indicate steps to actually test code, including CLI instructions if different than usual.
Point out the desired behavior, and not just the "check that this appears" (otherwise the code reviewer will be lazy and just verify what you've already verified).

Checklist before requesting a review

This is expected to break regression tests.
I have assigned a reviewer
I have added the PR to the project, and tagged with with priority
If it is a core feature, I have added tests.
I have set up pre-commit hooks with ruff, or run ruff format . manually

sheim

Mostly just code style comments for now : )

learning/modules/actor.py

learning/modules/actor_critic.py

learning/modules/utils/gSDE.py

lukasmolnar · 2024-03-15T18:46:18Z

learning/modules/smooth_actor.py

-        return (
-            torch.ones(self.latent_dim, self.num_actions).to(self.log_std.device) * std
-        )
+        return torch.ones(self.latent_dim, 1).to(self.log_std.device) * std


I need to double check this. It could be a bug in stable-baselines, since how they had it it seems the number of parameters was not reduced (by default it has shape (latent_dim, num_actions)).

run 200 iterations before starting training, to burn in normalization

smooth noise sampling and started gSDE

bb85463

sheim reviewed Feb 28, 2024

View reviewed changes

lukasmolnar added 8 commits March 1, 2024 10:41

adress comments

243cf0f

start moving things to SmoothActor

d838dc6

moved everything to SmoothActor (it runs)

aea5d88

possibly resample in PPO update

3603205

learn_features=True and correct sample dim

bb0a273

adjust sample freq and update plotting

40025c4

Merge branch 'dev' into lm/smooth-exploration

41e9dea

update log_std_init=0.0 and refactor

c301746

lukasmolnar commented Mar 15, 2024

View reviewed changes

lukasmolnar and others added 13 commits April 3, 2024 14:01

log joint data for training and play

a372a52

update logging and plotting

599b2ca

plot FT script

4994550

added sweep config

0205d99

Merge remote-tracking branch 'origin/dev' into lm/smooth-exploration

a68d7ce

update on policy and old policy runners, get nans for log_probs

64b9b14

run 200 iterations before starting training, to burn in normalization

91656e9

Merge pull request #15 from mit-biomimetics/sh/smooth-exploration

c0a6d61

run 200 iterations before starting training, to burn in normalization

update dummy input in export_network

f74e7fe

good choice of params: sample 16, rollout 32, LR x1.1, des_kl 0.02

18045e9

export latent network and update configs

adfb078

export latent net with norm and log get_std

8ff82ae

export actor std to txt file

007ea93

lukasmolnar changed the base branch from dev to SAC July 2, 2024 19:00

Base automatically changed from SAC to dev July 11, 2024 00:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Smooth Exploration #6

Smooth Exploration #6

Uh oh!

lukasmolnar commented Feb 28, 2024

Uh oh!

sheim left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lukasmolnar Mar 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Smooth Exploration #6

Are you sure you want to change the base?

Smooth Exploration #6

Uh oh!

Conversation

lukasmolnar commented Feb 28, 2024

Issue ticket number and link

Describe your changes

Instructions for reviewers

Checklist before requesting a review

Uh oh!

sheim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lukasmolnar Mar 15, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants