Aloha Sim

Aloha Sim is a python library that defines the sim environment for the Aloha robot. It includes a collection of tasks for robot learning and evaluation.

Installation

Install with pip:

# create a virtual environment and pip install
pip install -e .

OR run directly with uv:

pip install uv
uv run <script>.py

Tell mujoco which backend to use, otherwise the simulation will be very slow

export MUJOCO_GL='egl'

Viewer

Interact with the scene without a policy:

python aloha_sim/viewer.py --policy=no_policy --task_name=HandOverBanana

Tests

# individual tests
python aloha_sim/tasks/test/aloha2_task_test.py
python aloha_sim/tasks/test/hand_over_test.py
...

# all tests
python -m unittest discover aloha_sim/tasks/test '*_test.py'

Inference

⚠️ For Gemini Robotics Trusted Testers Only

Inference with Gemini Robotics models is intended for Trusted Testers. If you are not a Trusted Tester, sign up here.

Follow our SDK documentation to serve the model. The same model used for real-world evaluations can be directly applied in simulation.

Checkout the walkthrough video:

Install SDK dependency

pip install aloha_sim[inference]

Interactive Rollouts

Start the viewer with a chosen task:

# defaut task: "put the banana in the bowl"
python aloha_sim/viewer.py

# "remove the cap from the marker"
python aloha_sim/viewer.py --task_name=MarkerRemoveLid

# "place the can opener in the left compartment of the caddy"
python aloha_sim/viewer.py --task_name=ToolsPlaceCanOpenerInLeftCompartment
...

Checkout task_suite.py for the list of all tasks available.

You can use the viewer to pause/resume the environment, interact with the objects, and enter new instructions for the robot

Instructions for using the viewer:

- shift + 'i' = enter new instruction
- space bar = pause/resume.
- backspace = reset environment.
- mouse right moves the camera
- mouse left rotates the camera
- double-click to select an object

When the environment is not running:

- ctrl + mouse left rotates a selected object
- ctrl + mouse right moves a selected object

When the environment is running:

- ctrl + mouse left applies torque to an object
- ctrl + mouse right applies force to an object

Eval

python aloha_sim/run_eval.py

Runs N evaluation episodes for all tasks and save videos in /tmp/.

Benchmark

Success Rates (with 95% Confidence Interval) from 100 episodes per task x 3 runs.

Basic Tasks

Task	Gemini Robotics On Device Success Rate (95% CI)
BowlOnRack	99.3 (0.93)
DrawerOpen	87.0 (3.83)
HandOverBanana	99.0 (1.13)
HandOverPen	93.7 (2.77)
LaptopClose	78.0 (4.71)

Instruction-Following Tasks

Task	Gemini Robotics On Device Success Rate (95% CI)
DiningPlaceBananaInBowl	93.3 (2.84)
DiningPlaceMugOnPlate	32.0 (5.31)
DiningPlacePenInContainer	19.7 (4.52)
ToolsPlaceCanOpenerInLeftCompartment	82.0 (4.37)
ToolsPlaceCanOpenerInRightCompartment	73.3 (5.03)
ToolsPlaceMagnifierInRightCompartment	91.3 (3.20)
ToolsPlaceMagnifierInLeftCompartment	81.7 (4.40)
ToolsPlaceScissorsInLeftCompartment	73.3 (5.03)
ToolsPlaceScissorsInRightCompartment	79.0 (4.64)
ToolsPlaceScrewdriverInLeftCompartment	81.0 (4.46)
ToolsPlaceScrewdriverInRightCompartment	78.3 (4.69)
BlocksSpelling	4.7 (2.40)

Dexterous Tasks

Task	Gemini Robotics On Device Success Rate (95% CI)
MarkerRemoveLid	73.7 (5.01)
DesktopWrapHeadphone	8.7 (3.21)
TowelFoldInHalf	6.7 (3.18)

To reproduce the results, use random_seed = 42 + episode_index.

Tips

If the environment stepping is very slow, check that you are using the right backend, e.g. MUJOCO_GL='egl'
Tasks with deformable objects like DesktopWrapHeadphone and TowelFoldInHalf are slow to simulate and interact directly with viewer.py.

Note

This is not an officially supported Google product.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
aloha_sim		aloha_sim
media		media
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Aloha Sim

Installation

Viewer

Tests

Inference

Install SDK dependency

Interactive Rollouts

Eval

Benchmark

Basic Tasks

Instruction-Following Tasks

Dexterous Tasks

Tips

Note

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

google-deepmind/aloha_sim

Folders and files

Latest commit

History

Repository files navigation

Aloha Sim

Installation

Viewer

Tests

Inference

Install SDK dependency

Interactive Rollouts

Eval

Benchmark

Basic Tasks

Instruction-Following Tasks

Dexterous Tasks

Tips

Note

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages