-
-
Notifications
You must be signed in to change notification settings - Fork 9
Open
Description
Current the nwp dag moves the copies recent zarr file to latest.zarr but this can take 10 munutes as there could be 10,000 files.
- change the chunking, but doesn't work so well if we writing to the store in parrellel. And don't want to have make too many differences to archive and consume. This could be done with an env var when collecting live data
- Use s3 batch jobs
- re chunk after pulling the data in the nwp-consumer
- use
aws sync? - save as zarr and as a zarr.zip, then dag could copy and unzip
- Try zarr3 and larger chunk sizes
Metadata
Metadata
Assignees
Labels
No labels