Skip to content

Conversation

@SumayT9
Copy link
Collaborator

@SumayT9 SumayT9 commented Feb 11, 2024

Reward function needs more thought.

@SumayT9 SumayT9 requested a review from mIXs222 February 11, 2024 18:45
Copy link
Contributor

@mIXs222 mIXs222 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good overall. Please post if have a promising result on actual notebooks, then we can merge this in.

When we train on actual notebooks, it may be worth looking at parallel learning in both episodes + learners to save learning time.

Comment on lines 39 to 40
# Copying over the simple notebook for basic training tests
COPY ./notebooks/simple.ipynb /pod/notebooks/simple.ipynb
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No need for this. Docker compose mount the pod directory so notebooks/simple.ipynb will be there

pod/model.py Outdated
self.history = []

def plot_rewards(self):
# Can't plt.show when running on docker apparently, so printing them out to plot on other machine
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you really want to visualize, you can 1) plot and save fig, or 2) dump to csv/json and plot later

pod/train.py Outdated
Comment on lines 1 to 13
from pod.bench import Notebooks, NotebookExecutor, BenchArgs
from pod.pickling import StaticPodPickling
from pod.storage import DictPodStorage
from model import QLearningPoddingModel
from pod.stats import ExpStat
from pod.feature import __FEATURE__
from typing import List
import time
from pod.common import PodId
from loguru import logger
import gc
import random
import numpy as np
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

make fmt
make lint

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants