-
Notifications
You must be signed in to change notification settings - Fork 18
Make example data sets public #9
Copy link
Copy link
Open
Labels
enhancementNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is neededquestionFurther information is requestedFurther information is requested
Description
Here are the ones that we are familiar with and trust the most:
$ gsutil ls gs://musc-codex/datasets/ \
| grep "20180706\|20180614" \
| xargs -I@ -P1 bash -c "gsutil du -sh @"
6.65 GiB gs://musc-codex/datasets/20180614_D22_RepA_Tcell_CD4-CD8-DAPI_5by5
9.88 GiB gs://musc-codex/datasets/20180614_D22_RepB_Tcell_CD4-CD8-DAPI_5by5
9.4 GiB gs://musc-codex/datasets/20180614_D23_RepA_Tcell_CD4-CD8-DAPI_5by5
8.81 GiB gs://musc-codex/datasets/20180614_D23_RepB_Tcell_CD4-CD8-DAPI_5by5
5.55 GiB gs://musc-codex/datasets/20180706-Donor22-R2-Tcell-CODEX_CD3CD4CD85BY5
5.38 GiB gs://musc-codex/datasets/20180706-Donor23-R2-Tcell-CODEX_CD3CD4CD85BY5These would also make testing the framework easier for everybody. The only thing we have to make sure is that making these data sets won't be that costly. We can go with a service like figshare but not sure how their downloading bandwidth scales if we need to download it over and over again.
@hammer: any suggestions?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is neededquestionFurther information is requestedFurther information is requested