Skip to content

Difficulties when Training RL Policies #5

@PeideHuang

Description

@PeideHuang

I tried to run the RL training scripts for multiple tasks such as Stabilize, Reach and Grasp, and Insert by
python3 main/rl/train.py task=<task_name> sim_device=cuda:<gpu_id> rl_device=cuda:<gpu_id> graphics_device_id=<gpu_id>
However, none of the RL agents successfully learn to complete the tasks even after long time (an example for ReachAndGraspSingle shown below). I used the num_envs in the default task config file. Are there any hyperparameters I need to tune?
Screenshot from 2024-07-22 13-12-59

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions