Project

General

Profile

Task #5942

MNDeployment #3708: Minnesota Population Center

Task #5921: MPC: Testing

Task #5922: MPC: Registration in environment

Task #5933: MPC: Content Review

Task #5941: MPC: Verify Resource Maps

MPC: Verify Resource Map content

Added by Laura Moyers over 8 years ago. Updated almost 8 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Target version:
Start date:
2014-07-18
Due date:
% Done:

100%

Estimated time:
0.00 h
Story Points:
Sprint:

Description

Verify that Resource Maps are complete, correctly formatted and represent the intended associations.

ipumsi_6-3_at_2001-without-data.rdf.xml Magnifier - Resource Map example without data (4.46 KB) Chris Jones, 2014-09-06 00:36

ipumsi_6-3_at_2001-with-data.rdf.xml Magnifier - Resource Map example with data (4.95 KB) Chris Jones, 2014-09-06 00:46


Related issues

Blocks Member Nodes - Task #5944: MPC: Verify that Resource Maps are correctly processed by CNs. Closed 2014-07-18
Blocks Member Nodes - Task #5932: MPC: Set up synchronization of the MN Rejected 2014-07-18

History

#1 Updated by Laura Moyers over 8 years ago

  • Target version changed from Deploy by end of Y5Q4 to Deploy by end of Y1Q1

#2 Updated by Chris Jones over 8 years ago

  • Assignee set to Chris Jones
  • Status changed from New to In Progress

I'm seeing a few issues with the MPC resource maps at https://dataone-test.pop.umn.edu/mn/v1/object?formatId=http://www.openarchives.org/ore/terms :

1) Some system metadata for resource maps have a formatId of 'application/octet-stream'. This needs to be changed to 'http://www.openarchives.org/ore/terms'. See https://dataone-test.pop.umn.edu/mn/v1/object/ipumsi_6-3_cr_1984.rm.xml as an example.

2) The serialized resource maps include an aggregation statement where the resource map itself is aggregated, which I don't think is correct. The aggregation should only include the science data and science metadata triple statements (or other types of metadata, like provenance, etc.).

3) The triple statements in the resource maps don't point to CN-resolvable URIs, which is a requirement for DataONE data packages. See http://mule1.dataone.org/ArchitectureDocs-current/design/DataPackage.html#generating-resource-maps . An example is in https://dataone-test.pop.umn.edu/mn/v1/object/ipumsi_6-3_cr_1984.rm.xml, where both the subjects and objects in the triple statements point to URIs like http://international.ipums.org/ ...

#3 Updated by Bruce Wilson over 8 years ago

I communicated the formatId issue to Fabio and Wend y today (2014-08-28 2:00 PM EDT).

I think that there's a fourth issue to address:

3b) Identifiers appear to be using mixed case and underscores, but the MPC identifiers are all lower case and dots. For example, in https://dataone-test.pop.umn.edu/mn/v1/object/ipumsi_6-3_cr_1984.rm.xml one of the items in the aggregation is @@. But the object that this likely refers to has an identifier ipumsi_6-3_cr_1984.dc.xml

#5 Updated by Chris Jones about 8 years ago

I've attached two example resource maps to help clarify the content of the MPC resource maps.

The first describes an aggregation that has no science data in it, but rather three metadata files (Dublin Core file, DDIC XML file, and DDIC HTML file). Because only the Dublin Core metadata file is formatType METADATA in our object format list, it's fields will get parsed into the search index. The DDIC XML file, for now, can stay with a formatId of application/octet-stream, and it's fields won't be parsed. The DDIC HTML transform should have a formatId of text/html, and it too won't have it's fields parsed. However, all three of these files will be available for download by scientists since they are part of the aggregation (Data Package).

The second example describes an aggregation that contains one science metadata file (Dublin Core), and two science data files (CSVs). This resource map shows how the one science metadata file 'cito:documents' the two science data files, and all three are members of the aggregation (Data Package).

#6 Updated by Chris Jones about 8 years ago

  • translation missing: en.field_remaining_hours set to 0.0
  • Status changed from In Progress to Closed

Skye and I have worked with Wendy to correct resource map issues, and now the resource maps are being parsed correctly. Closing this ticket.

#7 Updated by Laura Moyers about 8 years ago

  • Target version changed from Deploy by end of Y1Q1 to Deploy by end of NCTE

#8 Updated by Laura Moyers almost 8 years ago

  • Target version changed from Deploy by end of NCTE to Operational

#9 Updated by Laura Moyers almost 8 years ago

  • % Done changed from 0 to 100
  • Estimated time set to 0.00

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 14.8 MB)