-
Notifications
You must be signed in to change notification settings - Fork 5
Description
From #1, pulling out the main question regarding having duplicative metadata. Question at hand seems to be focused on two options:
- Have minimal information in a metadata file. Generate additional information on the fly when indexed by a catalog -- like UUID, organization or dataset metadata, or other information that can be read by GDAL.
- Have duplicative information exist in the metadata file -- URI of the image is captured in the metadata file, provider and contact information, as well as bounding box and footprint.
There are pros and cons to each option. In terms of goals, it seems that the main goals for metadata should be to:
- Ensure that an image and its information exist in a valid format for indexing
- Provide easy method for catalogs and services to process the json
- Be agnostic to use cases, simple, and human readable
- Easily maintainable
To help be easily maintainable, one approach we've taken during initial testing and development has been to work on scripts to help automate the metadata file generation: https://github.com/openimagerynetwork/oin-meta-generator, to be later packaged into a command line tool. Using this utility, it would be easy for a provider to create or update thousands of metadata json files.
@kamicut @scisco and I recommend going with Option 2 as the first version of OIN. Some of the conversation was captured in Gitter with @lossyrob.
@lossyrob @warmerdam @cholmes @wonderchook @cgiovando Want to open it up to the group to make sure we're thinking through all the options and get additional input.