Thank you for your great work. How do you evaluate model performance on the RoboSpatial-Home dataset?