Project

General

Profile

Story #6848

Expand ObjectFormat.formatType options

Added by Ben Leinfelder almost 10 years ago. Updated over 9 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Ben Leinfelder
Category:
d1_indexer
Target version:
Start date:
2015-02-11
Due date:
% Done:

100%

Story Points:
Sprint:

Description

I'm trying to support a semantic annotation object format - in this case it is serialized as JSON - and am unsure how to type it. In most ways it should be treated like a RESOURCE formatType in that:

it should be harvested and stored on the CN

it should be indexed by the CN

indexing these objects affects index values for other objects that are described by the annotation model

Unfortunately, we require (or assume?) that a RESOURCE formatType is an ORE model serialized as RDF/XML.

We should relax this requirement using one or more of these options:

allow other ObjectFormats to be be classified as RESOURCEs and synchronize them without an extra "validation" step

consider indexing DATA objects and type the annotation as that

consider allowing non-XML METADATA objects to be stored and indexed and type the annotation as this

consider adding a new formatType that is handled similarly to our current RESOURCE formatType but is more lenient and will allow for JSON serializations (and others)


Related issues

Related to Infrastructure - Task #3972: Validate DataONE Resource Map Objects Closed 2013-09-11

History

#1 Updated by Ben Leinfelder almost 10 years ago

  • Related to Task #3972: Validate DataONE Resource Map Objects added

#2 Updated by Dave Vieglais over 9 years ago

  • Target version changed from CCI-2.0.0 to CCI-2.0-TRUNK

#3 Updated by Ben Leinfelder over 9 years ago

  • Assignee set to Ben Leinfelder
  • % Done changed from 0 to 100
  • Category set to d1_indexer
  • Milestone changed from None to CCI-2.0
  • Status changed from New to Closed

I've since refactored the indexing process such that we can handle RESOURCE format types as XML or as any other format - provided there is an appropriate subprocessor that can handle the format. In my case, the subprocessor parses the JSON format and looks for annotation tags. The existing ORE processo and the generic RDF/XML subprocessor continue to parse their formats as they wish.

#4 Updated by Dave Vieglais over 9 years ago

  • Target version changed from CCI-2.0-TRUNK to CCI-2.0.0

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 14.8 MB)