What hyperparameters are used for the Rainbow ablations?

Very nice work!

Is it possible to share the exact hyperparameters used for the ablations on the Rainbow environment? I am trying to recreate these results using the smaller, default Transformer size (3 layers, 128 dim, 8 heads). However, I find that most problems are solved at exactly 64 nodes expanded. (Interestingly there also seems to be a jump in Figure 4 from the paper at 64 nodes expanded.)

Here are what my current results look like:

![image](https://github.com/user-attachments/assets/903235c1-0019-435c-a94f-4bd964b2b01a)

Thanks so much!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What hyperparameters are used for the Rainbow ablations? #11

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

What hyperparameters are used for the Rainbow ablations? #11

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions