Skip to content

Minimal or duplicative metadata #2

@smit1678

Description

@smit1678

From #1, pulling out the main question regarding having duplicative metadata. Question at hand seems to be focused on two options:

  1. Have minimal information in a metadata file. Generate additional information on the fly when indexed by a catalog -- like UUID, organization or dataset metadata, or other information that can be read by GDAL.
  2. Have duplicative information exist in the metadata file -- URI of the image is captured in the metadata file, provider and contact information, as well as bounding box and footprint.

There are pros and cons to each option. In terms of goals, it seems that the main goals for metadata should be to:

  • Ensure that an image and its information exist in a valid format for indexing
  • Provide easy method for catalogs and services to process the json
  • Be agnostic to use cases, simple, and human readable
  • Easily maintainable

To help be easily maintainable, one approach we've taken during initial testing and development has been to work on scripts to help automate the metadata file generation: https://github.com/openimagerynetwork/oin-meta-generator, to be later packaged into a command line tool. Using this utility, it would be easy for a provider to create or update thousands of metadata json files.

@kamicut @scisco and I recommend going with Option 2 as the first version of OIN. Some of the conversation was captured in Gitter with @lossyrob.

@lossyrob @warmerdam @cholmes @wonderchook @cgiovando Want to open it up to the group to make sure we're thinking through all the options and get additional input.

cc @scisco @kamicut

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions