lfd3d

Installation

This project uses pixi for dependency management.

# avoid LeRobot install errors
export GIT_LFS_SKIP_SMUDGE=1
export CPPFLAGS="-I/usr/include"
export CFLAGS="-DHAVE_LINUX_INPUT_H"

pixi install
pixi run install-deps
pixi run setup-pre-commit
pixi shell

Download the MANO models and place them in mano/.

Alternatively, use Docker (see below).

Feature Generation

Mujoco Calibration

Calibration:
- The default path to MuJoCo robot model: ~/.cache/robot_descriptions/mujoco_menagerie/{YOUR ROBOT} e.g. ~/.cache/robot_descriptions/mujoco_menagerie/aloha
- Copy the robot model: Copy the robot directory from the cache into your local robot_descriptions folder.
- Modify the robot description XML (e.g., aloha.xml): Update the robot description file to align the real-world robot’s geometry, joint limits, and transforms with the virtual robot used by MuJoCo.
  - e.g. adjust the positions of "left/base_link" and "right/base_link" to match your real setup.

HOI4D

GT Points (any one of the following):
- Tracks from SpatialTracker: Generated using hoi4d_inference.py from sriramsk1999/SpaTracker
- Tracks from General Flow: Generate event-wise tracks from label_gen_event.py in sriramsk1999/general-flow. When training with these tracks, the projections of the tracks won't align perfectly, due to the errors in the object pose annotations in the dataset
- Tracks from MANO hand pose: Load hand pose tracks provided by HOI4D dataset. When training with these tracks, the projections of the tracks won't align perfectly, due to the errors in the hand pose annotations in the dataset
RGB/text features: Generated with: python rgb_text_feature_gen.py --dataset hoi4d --input_dir </path/to/hoi4d/data>

Test split generated using sriramsk1999/general-flow

DROID

GT Gripper Trajectory: Rendered using Mujoco in src/lfd3d/datasets/droid/render_robotiq.py.
Subgoal generation: Using Gemini in src/lfd3d/datasets/droid/chunk_droid_with_gemini.py.
RGB/text features: Generated with: python rgb_text_feature_gen.py --dataset droid --input_dir </path/to/droid/data>
Disparity: Using sriramsk1999/FoundationStereo

RPAD-Foxglove

Download and process recordings from Foxglove using https://github.com/r-pad/lfd3d-system/.
Generata GT for human demonstrations with modified version of https://github.com/sriramsk1999/wilor/.
Generata GT for robot demonstrations with src/lfd3d/datasets/rpad_foxglove/render_aloha.py.
Annotate events with src/lfd3d/datasets/rpad_foxglove/annotate_events.py.

RPAD-Lerobot

With your collected lerobot dataset, run upgrade_dataset.py from the lerobot repo to generate a [repo_id]_goal repo
Run a training job like this:

python scripts/train.py model=articubot dataset=rpadLerobot dataset.repo_id=beisner/aloha_plate_placement_goal dataset.data_sources="[aloha]" resources.num_workers=16 dataset.cache_dir=/data/lfd3d_dataloading_cache

For LIBERO:

python scripts/train.py model=articubot dataset=liberoLerobot dataset.repo_id=sriramsk/libero_lerobot_singleTask_heatmapGoal dataset.cache_dir=libero_cache model.use_rgb=True model.in_channels=7 training.batch_size=4

Some more examples:

nohup python scripts/train.py model=dino_3dgp dataset=rpadLerobot dataset.repo_id="[sriramsk/fold_onesie_20250831_subsampled_heatmapGoal, sriramsk/fold_shirt_20250918_subsampled_heatmapGoal, sriramsk/fold_towel_20250919_subsampled_heatmapGoal, sriramsk/fold_bottoms_20250919_human_heatmapGoal]"  resources.num_workers=32 training.batch_size=128 dataset.cache_dir=/home/sriram/Desktop/lfd3d/dino_3dgp_multifold_cache training.epochs=500 training.check_val_every_n_epochs=5 > dino_3dgp_multifold.out &

RT-1

NOTE: This section needs to be updated, rgb features should come from dinov2, set up gripper centric preds

Tracks: Generated using rt1_inference.py from sriramsk1999/CoTracker
Depth: Generated using rt1_inference.py from sriramsk1999/RollingDepth

After generating tracks and depth:

Preprocess captions with src/lfd3d/datasets/rt1/process_captions.py.
RGB/text features: Generated using src/lfd3d/datasets/rt1/rgb_text_feature_gen.py.
Chunking and filtering: Generated using src/lfd3d/datasets/rt1/save_event_rgb.py.

Test split taken from 3D-VLA

Training

To train a model:

python scripts/train.py model=df_cross dataset=hoi4d dataset.data_dir=<path/to/dataset/>

Optionally, also set dataset.cache_dir to save processed data for faster training.

Training with multiple datasets:

python scripts/train.py model=df_cross dataset=multi \
    dataset.datasets.rt1.data_dir=/data/sriram/rt1 \
    dataset.datasets.rt1.cache_dir=/home/sriram/Desktop/lfd3d/rt1_cache \
    dataset.datasets.hoi4d.data_dir=/data/sriram/hoi4d/hoi4d_data \
    dataset.datasets.hoi4d.cache_dir=/home/sriram/Desktop/lfd3d/hoi4d_cache

The best checkpoint is saved to WandB at the end of training. If training is interrupted or if you want to save an intermdiate checkpoint stored in logs/:

cd scripts/
python upload_wandb.py --run_id <wandb-run-id> --checkpoint_path <path/to/checkpoint>

Evaluation

Evaluation:

python scripts/eval.py checkpoint.run_id=<wandb-run-id> dataset.data_dir=<path/to/dataset/>

If the model was trained on the cluster, dataset.cache_dir needs to be overridden and set to null.

Evaluation on LeRobot independently for each episode:

python scripts/eval_lerobot_episode.py checkpoint.run_id=gcrbgmsf dataset=rpadLerobot dataset.repo_id="[sriramsk/fold_onesie_20250831_subsampled_heatmapGoal, sriramsk/fold_shirt_20250918_subsampled_heatmapGoal, sriramsk/fold_towel_20250919_subsampled_heatmapGoal, sriramsk/fold_bottoms_20250919_human_heatmapGoal]" model=dino_heatmap checkpoint.type=pix_dist

Docker

Build the docker image:

docker build -t $DOCKERHUB_USERNAME/lfd3d .

To run the training script:

docker run \
    -v </path/to/dataset>:/opt/rpad/data \
    -v $(pwd)/logs:/opt/rpad/logs \
    -v $(pwd)/mano:/opt/rpad/code/mano \
    --gpus all \
    --shm-size=8G \
    -e WANDB_API_KEY=$WANDB_API_KEY \
    -e WANDB_DOCKER_IMAGE=lfd3d \
    $DOCKERHUB_USERNAME/lfd3d pixi run python scripts/train.py \
        model=df_cross \
        dataset=<dataset-name> \
        dataset.data_dir=/opt/rpad/data

Acknowledgements

This codebase was adapted from TAX3D and python-ml-project-template

Name		Name	Last commit message	Last commit date
Latest commit History 462 Commits
.github/workflows		.github/workflows
.vscode		.vscode
cluster		cluster
configs		configs
docs		docs
notebooks		notebooks
robot_descriptions/mujoco_menagerie/aloha		robot_descriptions/mujoco_menagerie/aloha
scripts		scripts
src/lfd3d		src/lfd3d
tests		tests
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
README.md		README.md
pixi.lock		pixi.lock
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

lfd3d

Installation

Feature Generation

Mujoco Calibration

HOI4D

DROID

RPAD-Foxglove

RPAD-Lerobot

RT-1

Training

Evaluation

Docker

Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

r-pad/lfd3d

Folders and files

Latest commit

History

Repository files navigation

lfd3d

Installation

Feature Generation

Mujoco Calibration

HOI4D

DROID

RPAD-Foxglove

RPAD-Lerobot

RT-1

Training

Evaluation

Docker

Acknowledgements

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages