RL4design

This git repo contains the code used for Reinforcement learning for freeform robot design.

Generation process	Result

Bibtex

@article{li2023reinforcement,
  title={Reinforcement learning for freeform robot design},
  author={Muhan Li and David Matthews and Sam Kriegman},
  journal={arXiv preprint arXiv:2310.05670},
  year={2023}
}

Installation

First make sure that you have installed voxcraft-viz, this program is required if you want to visualize simulated robots in voxcraft environment using the navigator function, otherwise you may skip this step since it doesn't affect training or other visualization functions.

And make sure that you have installed conda or miniconda, then update CONDA_PATH in install.sh

Finally, just run install.sh, or copy commands to your terminal and execute them.

Start training

We have tested the program on NVIDIA 10 series (1080Ti), 30 series (RTX 3060) and 40 series (RTX4080, H100) GPUs. GPUs earlier than 10 series are not supported. To customize configurations for training, go to config.py under each experiment directory, then start training with following commands:

export PYTHONPATH=`pwd`

# To optimize robots for moving further in the voxcraft environment 
python main.py -r experiments/vec_patch_voxcraft/optimize_rl.py

# To optimize robots for achieving various shape requirements (eg: bigger volume)
python main.py -r experiments/vec_patch_shape/optimize_rl.py

Before running the experiment, you will be prompted to add a comment to the experiment using your preferred editor, which will appear later in the navigator:

>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
Options:
0) Draw metrics for multiple trials
1) Draw metrics for single trial
Choice:1
Trials
0) /home/mlw0504/RL4design_results/CustomPPO_2023-10-12_18-22-02/CustomPPO_VoxcraftSingleRewardVectorizedPatchEnvironment_26445_00000_0_2023-10-12_18-22-02
    comment: <The comment added by you>
    reward: 6.665

A snapshot of the code directory will be saved along with experiment data. eg: /home/<user name>/RL4design_results/CustomPPO_2023-10-12_18-22-02/CustomPPO_VoxcraftSingleRewardVectorizedPatchEnvironment_26445_00000_0_2023-10-12_18-22-02/code

Visualization

To run visualizations for robot metrics, rewards, draw the process of robot generation, etc, you may use the following commands. Navigator is a versatile built-in program for such functionalities.

export PYTHONPATH=`pwd`

python main.py -n <you may optionally specify path to saved results>

Example command line interface:

>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
Options:
0) Draw metrics for multiple trials
1) Draw metrics for single trial
Choice:0
Options:
0) Draw aggregated reward curve for multiple trials
1) Draw separate reward curves for each trial
2) Draw aggregated robot metric curves for multiple trials
3) Draw volume task result (For reproducing figures in paper)
4) Draw voxcraft task result (For reproducing figures in paper)
Choice:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
experiments		experiments
images		images
launch		launch
navigator		navigator
renesis		renesis
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RL4design

Bibtex

Installation

Start training

Visualization

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

iffiX/RL4design

Folders and files

Latest commit

History

Repository files navigation

RL4design

Bibtex

Installation

Start training

Visualization

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages