Debug/pipeline by muzi2018 · Pull Request #25 · Ericonaldo/visual_wholebody

muzi2018 · 2025-09-05T16:31:17Z

No description provided.

maxEpisodeLength: 200 #200 approaching: 5

reward: base_height_target: 0.55 only_positive_rewards: False scales: approaching: 5.0 lifting: 1.0 pick_up: 5.0 # 0.5 acc_penalty: -0.001 command_penalty: -1.0 command_reward: 0.25 standpick: 0.25 # no found in reward_vec_task.py action_rate: -0.001 ee_orn: 0.05 # 0.01 base_dir: 0.25 rad_penalty: 0.0 base_ang_pen: 0.0 base_approaching: 0.01 # 0.05 grasp_base_height: 0.5 # no found in reward_vec_task.py gripper_rate: -0.1 # -0.1

policy 4

reward: base_height_target: 0.55 only_positive_rewards: False scales: approaching: 5.0 lifting: 1.0 pick_up: 5.0 # 0.5 acc_penalty: -0.001 command_penalty: -1.0 command_reward: 0.25 standpick: 0.25 # no found in reward_vec_task.py action_rate: -0.001 ee_orn: 0.05 # 0.01 base_dir: 0.25 rad_penalty: 0.0 base_ang_pen: 0.0 base_approaching: 0.01 # 0.05 grasp_base_height: 0.5 # no found in reward_vec_task.py gripper_rate: -0.1 # -0.1

policy 4

…wholebody into train/parameter

reward: base_height_target: 0.55 only_positive_rewards: False scales: approaching: 10.0 lifting: 1.0 pick_up: 5.0 # 0.5 acc_penalty: -0.001 command_penalty: -1.0 command_reward: 0.25 standpick: 0.25 # no found in reward_vec_task.py action_rate: -0.001 ee_orn: 0.05 # 0.01 base_dir: 0.25 rad_penalty: 0.0 base_ang_pen: 0.0 base_approaching: 0.01 # 0.05 grasp_base_height: 0.5 # no found in reward_vec_task.py gripper_rate: -0.1 # -0.1

reward: base_height_target: 0.55 only_positive_rewards: False scales: approaching: 5.0 lifting: 1.0 pick_up: 5.0 # 0.5 acc_penalty: -0.001 command_penalty: -1.0 command_reward: 0.25 standpick: 0.25 # no found in reward_vec_task.py # action_rate: -0.001 ee_orn: 0.1 # 0.01 base_dir: 0.25 rad_penalty: 0.0 base_ang_pen: 0.0 base_approaching: 0.01 # 0.05 grasp_base_height: 0.5 # no found in reward_vec_task.py gripper_rate: -0.1 # -0.1

…lebody into train/scales

def _reward_base_dir(self, obj_pos): base_x_dir = torch.tensor([0., 0., 1.], device=self.device).repeat(self.num_envs, 1) base_x_dir_world = quat_apply(self.base_yaw_quat, base_x_dir) obj_dir = obj_pos - self._robot_root_states[:, :3] obj_dir[:,:2] = 0. obj_dist = torch.norm(obj_dir, dim=-1) safe_dis = obj_dist >= 0.01 obj_dir_unit = obj_dir[safe_dis] / obj_dist[safe_dis].unsqueeze(-1) rew = torch.zeros(self.num_envs, device=self.device, dtype=torch.float) # rew[safe_dis] = torch.abs(torch.abs(torch.sum(base_x_dir_world[safe_dis] * obj_dir_unit, dim=-1)) - 1) rew[safe_dis] = F.cosine_similarity(base_x_dir_world[safe_dis], obj_dir_unit) return rew, rew

…holebody into debug/pipeline

Ericonaldo and others added 30 commits August 13, 2025 11:37

add ack

701bf78

Merge branch 'main' of https://github.com/muzi2018/visual_wholebody

daa5c04

impletement low policy

c131d97

impletement low policy

8b57db5

Merge branch 'main' of https://github.com/muzi2018/visual_wholebody

1abcfbf

update

f1d6f41

update

cd07834

update

479a759

contact_offset: 0.04

9e92554

1024 envs

aac0bdd

update

04f4ece

update

bdfe35f

update

05ac450

b1z1_pickmulti.yaml

0c15d12

maxEpisodeLength: 200 #200 approaching: 5

update

f51af48

update

5a6e2c8

update

b039920

update

7b38f16

policy 4

update

716a2e1

policy 4

Merge branch 'train/parameter' of https://github.com/muzi2018/visual_…

5a78463

…wholebody into train/parameter

syn

24553bf

Merge branch 'train/scales' of https://github.com/muzi2018/visual_who…

fb7f1b8

…lebody into train/scales

update push

fe5b78e

add reward watch

8f840a7

Create high_level_reward.txt

853dffc

muzi2018 added 5 commits September 6, 2025 13:06

success

e1bed5d

Merge branch 'debug/pipeline' of https://github.com/muzi2018/visual_w…

3800500

…holebody into debug/pipeline

success for teacher policy

bd3159f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Debug/pipeline#25

Debug/pipeline#25
muzi2018 wants to merge 35 commits intoEriconaldo:mainfrom
muzi2018:debug/pipeline

muzi2018 commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

muzi2018 commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants