Skip to content

Version control of data #8

@nheeren

Description

@nheeren

In a larger project, we have the issue that we would like to create a database on github. However, GitHub is meant to keep track of changes in text files and we are using binary files (xlsx) for now. That means uploading new versions of the data files will eventually cause very large overhead over time and no meaningful version control is possible. I could see that the final data will be converted to csv at some point, but so far, this database is a moving target and we would like to use Excel files for now.

Can we add guidelines or recommendations in the wiki on how to do version control of IE datasets and databases? Any suggestions are very much welcome.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions