Basic Q learning training script #33

SumayT9 · 2024-02-11T18:45:44Z

Reward function needs more thought.

mIXs222

Looks good overall. Please post if have a promising result on actual notebooks, then we can merge this in.

When we train on actual notebooks, it may be worth looking at parallel learning in both episodes + learners to save learning time.

mIXs222 · 2024-02-11T20:05:41Z

experiments/pod.Dockerfile

+# Copying over the simple notebook for basic training tests
+COPY ./notebooks/simple.ipynb /pod/notebooks/simple.ipynb


No need for this. Docker compose mount the pod directory so notebooks/simple.ipynb will be there

pod/model.py

mIXs222 · 2024-02-11T20:26:23Z

pod/model.py

+        self.history = []
+
+    def plot_rewards(self):
+        # Can't plt.show when running on docker apparently, so printing them out to plot on other machine


If you really want to visualize, you can 1) plot and save fig, or 2) dump to csv/json and plot later

pod/model.py

mIXs222 · 2024-02-11T20:27:21Z

pod/train.py

+from pod.bench import Notebooks, NotebookExecutor, BenchArgs
+from pod.pickling import StaticPodPickling
+from pod.storage import DictPodStorage
+from model import QLearningPoddingModel
+from pod.stats import ExpStat
+from pod.feature import __FEATURE__
+from typing import List
+import time
+from pod.common import PodId
+from loguru import logger
+import gc
+import random
+import numpy as np


make fmt make lint

pod/train.py

Basic Q learning training script

498ee8c

SumayT9 requested a review from mIXs222 February 11, 2024 18:45

mIXs222 reviewed Feb 11, 2024

View reviewed changes

Sumay Thakurdesai added 6 commits February 18, 2024 21:26

Intermediate commit, pre train parallel

9285ce6

Intermediate commit, to make files viewable

87a7371

Code with inductive bias/benchmarking

4199a8e

pre alternating training

bf7031a

Alternating training start

350d4b7

train params current

c8f78d1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Basic Q learning training script #33

Basic Q learning training script #33

Uh oh!

SumayT9 commented Feb 11, 2024

Uh oh!

mIXs222 left a comment

Uh oh!

mIXs222 Feb 11, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mIXs222 Feb 11, 2024

Uh oh!

Uh oh!

mIXs222 Feb 11, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# Copying over the simple notebook for basic training tests
		COPY ./notebooks/simple.ipynb /pod/notebooks/simple.ipynb

Basic Q learning training script #33

Are you sure you want to change the base?

Basic Q learning training script #33

Uh oh!

Conversation

SumayT9 commented Feb 11, 2024

Uh oh!

mIXs222 left a comment

Choose a reason for hiding this comment

Uh oh!

mIXs222 Feb 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mIXs222 Feb 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mIXs222 Feb 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants