Skip to content

Metadata handling with mutliple input files #31

@DamienIrving

Description

@DamienIrving

At the moment, every time a command line program is run the relevant metadata (vistrail file name with git hash, script name with git hash, and complete history of command line entries all the way back to initial generation of the data) is stored in the global attributes of the output netCDF file (the build_metadata function within utils/utils.py does this).

This works fine if there is only one input file, but for processes with more than one input file the history of only one of the files is retained. In my own work I get around this problem by looping over all the input files, extracting global history attribute, and then appending them all to the global history attribute of the output file (see the write_metadata function within general_io.py for details). I think we need a similar approach here.

As well as the global history attribute, it would also be nice to append to the global "version" attribute for each of the input files, as in CMIP5 files this specifies which version of the data was used.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions