Skip to content

Communicating data set quality #27

@cookeac

Description

@cookeac

When interchanging a map layer, it can be helpful for the recipient to understand the:

  • Provenance (generated by whom, and when)
  • Scale
  • Processing applied
  • Applicability (intended purposes, or what it might be suitable for).
  1. It would make sense to attach this information to a feature catalog item (either a OCG feature catalog entry, or more specifically to a holding-level FeatureCatalogItemResource in this schema).
  2. This information could also be attached to a feature collection or even individual features when these are served, but it makes more sense to use a feature catalog because that way decisions about applicability can be made before the feature collection is fetched.
  3. Where possible, use should be made of the Geographic Information Data Quality Metadata Standard in ISO 19157 (licenced document, hence not linked). There is however a precis of this standard available and a more general document here.

Key data quality attributes are:

  • Completeness (metrics of omission and commission errors from a data evaluation)
  • Logical Consistency (measures of format, topology, domain, and conceptual consistency)
  • Thematic Accuracy (metrics of attribute correctness, particularly in classification of attributes)
  • Temporal Quality (accuracy of time measurement and temporal consistency and validity)
  • Positional Accuracy (resolution, accuracy of represented positions vs real world)
  • Meta Quality (confidence, homogeneity, and representativity)
  • Conformance (conformance with a standard or agreed profile)

Many of these require a formal QA review (using ISO 19158) to populate, which I suspect won't happen with most farm-level data, except perhaps for data sets compiled formally at national or regional level.

It seems that the most relevant to farm scale data might be:

  • Data Quality - Conformance - to an agreed specification, naming the specification, and with a boolean True/False for conformance (would require agreed specifications)
  • Spatial Resolution - Vector Spatial Representation - topology level code, scale denominator or distance equivalent
  • Provenance - Derivation - derivation method
  • Provenance - Acquisition - acquisition platform or instrument (though these are really designed for satellite data etc)

Feedback needed: Could those who are interested in data quality metrics please comment with their needs/thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions