Skip to content

GCS support for FdReader used in (downstream) TFDS #31

@kentslaney

Description

@kentslaney

Following up on google/array_record/issues/120

The following code fails when it shouldn't

import tensorflow_datasets as tfds
ds = tfds.data_source("ref_coco", data_dir="gs://ref_coco", try_gcs=True)
next(iter(ds['train']))

Colab link

The path gets handed over to a Riegeli FdReader (source) and the array_record maintainers have pointed me upstream. I can't find GCS support mentioned in the docs, but given TFDS provides a try_gcs argument, it seems like GCS buckets should be supported somewhere along the line.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions