Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization

Overview

This is the official PyTorch implementation of the paper "Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization". Our approach, CP3ER, significantly enhances the stability and performance of visual reinforcement learning models.

Installation

Setup

To install the required packages for DeepMind Control Suite and Metaworld, please run the following commands:

conda env create -f cp3er.yaml # for dmc
 
conda env create -f cp3ermw.yaml  # for metaworld

Then, install the Metaworld package:

conda activate cp3ermw
cd Metaworld
pip install -e .

Reproducing Experimental Results

Training for dmc tasks

python train.py task=acrobot_swingup

You can decide whether to use wandb to log your experiment process by specifying the 'use_wb' parameter, and determine whether to use a GPU for training by specifying the 'device' parameter. For more parameter options, please refer to the cfgs/config.yaml file.

python train.py task=cheetah_run device=cuda:1 use_wb=True seed=1

Training for metaworld tasks

Similar to training for DMC tasks, you can run the following scripts for testing in Metaworld:

python train_mw.py task=assembly-v2

Citation

If you find our research helpful and would like to reference it in your work, please consider citing the paper as follows:

# arxiv version
@misc{li2024generalizingconsistencypolicyvisual,
      title={Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization}, 
      author={Haoran Li and Zhennan Jiang and Yuhui Chen and Dongbin Zhao},
      year={2024},
      eprint={2410.00051},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2410.00051}, 
}

# NeurIPS version
@inproceedings{
      li2024generalizing,
      title={Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization},
      author={Haoran Li and Zhennan Jiang and YUHUI CHEN and Dongbin Zhao},
      booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
      year={2024},
      url={https://openreview.net/forum?id=MOFwt8OeXr}
}

Acknowledgement

CP3ER is licensed under the MIT license. MuJoCo and DeepMind Control Suite are licensed under the Apache 2.0 license. We would like to thank DrQ-v2 authors for open-sourcing the DrQv2 codebase. Our implementation builds on top of their repository.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Metaworld		Metaworld
cfgs		cfgs
model		model
.gitignore		.gitignore
README.md		README.md
cp3er.py		cp3er.py
cp3er.yaml		cp3er.yaml
cp3ermw.yaml		cp3ermw.yaml
dmc.py		dmc.py
logger.py		logger.py
mw.py		mw.py
replay_buffer.py		replay_buffer.py
replay_buffer_mw.py		replay_buffer_mw.py
train.py		train.py
train_mw.py		train_mw.py
utils.py		utils.py
video.py		video.py
wandblogger.py		wandblogger.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization

Overview

Installation

Setup

Reproducing Experimental Results

Training for dmc tasks

Training for metaworld tasks

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

jzndd/CP3ER

Folders and files

Latest commit

History

Repository files navigation

Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization

Overview

Installation

Setup

Reproducing Experimental Results

Training for dmc tasks

Training for metaworld tasks

Citation

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages