Skip to content
This repository was archived by the owner on Nov 2, 2018. It is now read-only.
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions levers/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,10 +10,10 @@ For more details about the task, see our [paper](https://arxiv.org/abs/1605.0773
## Usage
The code uses Torch and run on a GPU for speed-up. To start training, run

th lever.lua --reward_only --comm
th levers.lua --reward_only --comm

Here, option `--comm` enables a communication between the agents, and `--reward_only` picks a reinforcement learning instead of supervised learning.

The following script can be used to replicated the experiments in the paper.

th lever.lua --batchsize 512 --lr 10 --clip .01 --hdim 64 --apg 5 --nlevers 5 --reward_only --maxiter 100000 --comm
th levers.lua --batchsize 512 --lr 10 --clip .01 --hdim 64 --apg 5 --nlevers 5 --reward_only --maxiter 100000 --comm