Following up on google/array_record/issues/120
The following code fails when it shouldn't
import tensorflow_datasets as tfds
ds = tfds.data_source("ref_coco", data_dir="gs://ref_coco", try_gcs=True)
next(iter(ds['train']))
Colab link
The path gets handed over to a Riegeli FdReader (source) and the array_record maintainers have pointed me upstream. I can't find GCS support mentioned in the docs, but given TFDS provides a try_gcs argument, it seems like GCS buckets should be supported somewhere along the line.