94 tbr env with polar stateaction by schmidt1139 · Pull Request #96 · schmidt1139/astro_compass

schmidt1139 · 2025-12-03T00:20:10Z

Finished with obs/script changes

generate_buffer.py takes in ephems and saves a pkl replay buffer pre_train_agent.py takes a replay buffer and pre-trains SAC train_agent.py RL-trains an untrained or pre-trained agent evaluate_agent.py takes a trained agent and runs through tests + evals README provides motivation for changes

includes moving target and linear interpolation

Pulls constructing observation from common function in rl utils

schmidt1139 and others added 30 commits November 25, 2025 00:54

mass commit

b9bff6f

reg test data

9c73d83

11/26 work day commit

773ab32

Update .gitignore to include pre-training-data + output files

2cb37e2

Format with ruff

b6f117b

Add requirements for tensorboard and toml files

87fae88

Add toml config files for easier handling

1e37252

Only checkpoint in pre-training if model is improvement

8f62e61

Remove replay buffer saving from import fcn

9ba6b32

Refactor plotting into more modular fcns

65b0a19

Update "_r_component" to "_reward"

7cb5cae

Ensure the reward components get the same color

7bea438

Update plotting routine for evaluate agent

fae78fd

Delay learning by 50,000 steps to get some good exploration

6c4f268

Agressively extend observation space to maximize probability of learning

fc01a34

Dramatically simplify reward function + fix truncation error

8caba80

Squash

c495463

Updated params for best model yet

a649149

test data updates, removing big files 2

63ec3a6

ephem v3 class

9465075

includes moving target and linear interpolation

handling new ephem type in read ephems function

805bf94

slimming down test data

9e35aa9

Reducing size of replay buffers

e522e13

Adding fast replay buffer test

318572c

fast replay buffer seeding

e77e5d4

Correctly setting device for model loaded from file

1576d76

Script converts from ephem v2 to v3

8fd36e5

Adding binary file compare

046b3b3

Adding fast functions for seeding replay buffer

28ba7ea

schmidt1139 added 29 commits November 27, 2025 20:18

Correcting polar env test

9e358e2

Polar env obs updates

2e6d2cc

Pulls constructing observation from common function in rl utils

Env updates to use common obs function + bug fixes

536daeb

truth file update

8401e1f

Updating rollout function to account for new obs

bd86239

Plotting updates/corrections

5f67fbf

Updates for fast/scalable buffer generation

fdb07c4

Updating compute_reward_fast with johns updates

641a589

Separating configs (all identical at the moment)

15f25f9

updating test path

ef8d50b

add data path from config

4ebcb6e

Saving configs to script output

8eeb503

Bug fix: handling new obs size for pre-training

7577b1e

adding pre-train agent config

a9aaa5a

Updates to eval agent script

dd1eb7d

removing live updates from rollout function

a9d1c08

Adding position and velocity residuals to rollout data

3e28358

Adding mc eval with histogram plotting

ab490b7

update to eval config

425f1a9

added position rollout plot

a5b07f5

limiting internal threads

9fea710

Adding ttg back into reward func

c71b728

Correcting reward components not getting plotted

5660bb6

adding replay buffer checkpointing

292304d

Plotting replay buffer script

f169ae2

Ecc bug fix, and adding back terminal condition

a5a77a0

pre-train update

b62ef16

buffer gen updates

516310c

Bug fix with setting device

4d1c997

schmidt1139 requested a review from MartinAstro December 3, 2025 00:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

94 tbr env with polar stateaction#96

94 tbr env with polar stateaction#96
schmidt1139 wants to merge 65 commits into92-tbr-sac-scriptfrom
94-tbr-env-with-polar-stateaction

schmidt1139 commented Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

schmidt1139 commented Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants