Project

General

Profile

Bug #3492

Invalid PIDs in production (whitespace)

Added by Dave Vieglais over 7 years ago. Updated about 3 years ago.

Status:
In Progress
Priority:
Urgent
Assignee:
Category:
Support Operations
Target version:
Start date:
2013-01-18
Due date:
% Done:

30%

Milestone:
None
Product Version:
*
Story Points:
Sprint:

Description

Recording this for future reference.

There are nine PIDs in the production environment that contain whitespace. This appears to have no functional effect - sysmeta and objects can be retrieved so no action is required other than to ensure no more sneak in.

The PIDs in question are:

guid

doi:10.5063/AA/Virginia Perez.2.1
judith kruger.3.1
judith kruger.4.1
judith botha.1.1
judith kruger.1.1
judith kruger.2.1
judith kruger.5.1
judith botha.2.1
resourceMap_Lin Cheng-Jung.1.1
resourceMap_Lin Cheng-Jung.1.2
resourceMap_Lin Cheng-Jung.1.3
Lin Cheng-Jung.1.1
Lin Cheng-Jung.1.2
Lin Cheng-Jung.1.3
doi:10.6085/AA/ SHLX00_XXXITV2XLSR03_20111128.40.1


Subtasks

Task #3498: Obsolete these invalid PIDs on SANParksClosedBen Leinfelder

Task #3499: Investigate Virgina Perez.2.1 objectClosedBen Leinfelder

Task #3131: Whitespace in pisco PIDClosedBen Leinfelder

Bug #4674: Ask Judith, Mike and Virgina Perez.2.1 to obsolete those pids which contain the white spaces.NewJing Tao

Task #4675: Administratively delete those pids with the white spacesNew

History

#1 Updated by Dave Vieglais over 7 years ago

There are nine PIDs in the production environment that contain whitespace.

Actually eight, one dupe in the cut'n paste:

doi:10.5063/AA/Virginia Perez.2.1
judith botha.1.1
judith botha.2.1
judith kruger.1.1
judith kruger.2.1
judith kruger.3.1
judith kruger.4.1
judith kruger.5.1

#2 Updated by Ben Leinfelder over 7 years ago

  • Description updated (diff)

#3 Updated by Dave Vieglais about 7 years ago

Some new invalid PIDs appeared in production, mostly from the TFRI node:

Lin Cheng-Jung.1.2
Lin Cheng-Jung.1.3
Lin Cheng-Jung.1.1
resourceMap_Lin Cheng-Jung.1.1
resourceMap_Lin Cheng-Jung.1.3
resourceMap_Lin Cheng-Jung.1.2
doi:10.6085/AA/ SHLX00_XXXITV2XLSR03_20111128.40.1

#4 Updated by Dave Vieglais almost 7 years ago

  • Assignee changed from Dave Vieglais to Robert Waltz

#5 Updated by Robert Waltz over 6 years ago

  • Product Version set to 1.3.0
  • Milestone changed from None to CCI-1.3

#6 Updated by Robert Waltz over 6 years ago

  • Milestone changed from CCI-1.3 to CCI-1.2
  • Target version set to 2013.44-Block.6.1
  • Due date set to 2013-11-09
  • Product Version changed from 1.3.0 to *
  • Start date set to 2013-10-27

#7 Updated by Chris Jones over 6 years ago

  • Assignee changed from Robert Waltz to Ben Leinfelder

#8 Updated by Chris Jones over 6 years ago

  • Due date changed from 2013-11-09 to 2014-02-01
  • Target version changed from 2013.44-Block.6.1 to 2014.4-Block.1.2

#9 Updated by Chris Jones over 6 years ago

  • Assignee changed from Ben Leinfelder to Jing Tao

#10 Updated by Jing Tao over 6 years ago

  • Status changed from New to Closed

We added the code in metacat to enforce there is no white spaces in the docid.

#11 Updated by Jing Tao over 6 years ago

  • Status changed from Closed to In Progress

Reopen the bug - we still need to contact the administrator to ask them to change the docids.

#12 Updated by Chris Jones over 6 years ago

  • Target version changed from 2014.4-Block.1.2 to 2014.12-Block.2.2
  • Due date changed from 2014-02-01 to 2014-03-29

#13 Updated by Chris Jones over 6 years ago

  • Description updated (diff)

#14 Updated by Robert Waltz almost 6 years ago

  • Due date changed from 2014-03-29 to 2014-09-02
  • Milestone changed from CCI-1.2 to None
  • Target version changed from 2014.12-Block.2.2 to Release Backlog

#15 Updated by Dave Vieglais almost 6 years ago

  • Due date changed from 2014-09-02 to 2014-09-24
  • Target version changed from Release Backlog to Maintenance Backlog

#16 Updated by Rob Nahf over 3 years ago

  • % Done changed from 20 to 30

the recent upgrade to 2.3 changes the behavior somewhat for retrieval, due most likely to JAXB schame validation which replaced our custom JiBX validation.

The main difference is that a /meta call returns a ServieFailure. A /cn/object call will still return the object (if it's on the CN) if you unicode escape the space on the URL with %20.

Also, listObjects will omit the object from the list but return other pids.

"Lin Cheng-Jung.1.3" is a pid with no access policy, but via psql, I found the modified date for it, and did a time-bound slice of listObjects:

https://cn.dataone.org/cn/v2/object?fromDate=2013-06-17T03:31:00.000Z&toDate=2013-06-17T03:32:00.000Z

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 14.8 MB)