Task #7461

MNDeployment #7082: USGS Science Data Catalog (SDC)

Determine how to handle metadata which is well-formed but does not validate

Added by Bruce Wilson about 8 years ago. Updated over 5 years ago.

Target version:
Start date:
Due date:
% Done:


Story Points:


SDC is an example of cases where there are metadata records that are declared as FGDC, but contain extensions (such as the ESRI extensions) so that the file does not actually validate according to the declared schema. There objective of this task is to determine which approach to take. One extreme is the current approach, where only metadata files that pass validation are accepted. Another is to allow any well-formed XML (and potentially set the sysmeta to declare the metadata type as text/xml). From Bruce's perspective, if we accept metadata that's not schema valid, it should be straightforward to determine from sysmeta (or something similar) which metadata is schema valid and which is not. There are intermediate proposals along the idea of looking to see if schema invalid metadata matches one of a list of known alternatives and only accepting the metadata where it matches something from a known list of variants to the declared schema.

The output from this task is a plan with tasks that will at least allow the ESRI-extended FGDC metadata from SDC to be accepted into DataONE.

Likely requirements:
* The metadata must at least be well-formed XML
* The metadata mush be at least interpretable for discovery.

* Tools need to be able to determine if the metadata is valid for those tools that want to do deeper interpretation (like the Matlab or R tools)
* It is useful to be able to provide feedback to MNs for the metadata that's interpretable but not schema valid.

Related issues

Related to Member Nodes - MNDeployment #3238: Idaho Northwest Knowledge Network member node Operational 2013-01-30
Blocks Member Nodes - Task #7083: USGS SDC metadata issues with CSDGM Closed 2015-05-07


#1 Updated by Bruce Wilson about 8 years ago

  • Blocks Task #7083: USGS SDC metadata issues with CSDGM added

#2 Updated by Laura Moyers over 7 years ago

#3 Updated by Amy Forrester over 5 years ago

  • % Done changed from 30 to 100
  • Status changed from In Progress to Closed

per Dave - 4/27/2018

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 14.8 MB)