Hello, I would like to use the GTAV dataset for fully supervised training based on MaskCLIP. Where should I add the function to read the GTAV dataset?