Task #3140
Story #3136: Inconsistent data in production 1.0.3 release
Missing UCSB xml_revisions
100%
Description
This is one set of docs -- I believe this is due to errors encountered when trying to replicate "deleted" documents from Metacat to Metacat.
You can see initial "insert" events and subsequent "delete" events -- either by the actual node, or as propagated by replication.
During "delete" (CN.archive()) it is conceivable that migrating content from xml_documents to xml_revisions failed, but the exception did not prevent the routine from continuing (and deleting the original xml_documents entry). Now we are in a state where a "deleted" docid cannot propagate via replication because it does not exist in xml_documents.
-- look at what happened to these PIDs on UCSB
select id.guid, id.docid, id.rev, log.*
from identifier id, access_log log
where id.guid in (
'doi:10.6073/AA/knb-lter-gce.144.3',
'doi:10.6073/AA/knb-lter-gce.172.3',
'doi:10.6073/AA/knb-lter-gce.170.1',
'doi:10.6073/AA/knb-lter-gce.178.1',
'doi:10.6073/AA/knb-lter-gce.181.2',
'doi:10.6073/AA/knb-lter-gce.57.14',
'doi:10.6073/AA/knb-lter-gce.50.5',
'doi:10.6073/AA/knb-lter-gce.66.3',
'doi:10.6073/AA/knb-lter-gce.83.5',
'doi:10.6073/AA/knb-lter-gce.71.12',
'doi:10.6073/AA/knb-lter-gce.80.6',
'doi:10.6073/AA/knb-lter-gce.87.4',
'doi:10.6073/AA/knb-lter-gce.87.5',
'doi:10.6073/AA/knb-lter-gce.97.4',
'doi:10.6073/AA/knb-lter-gce.8.3',
'doi:10.6073/AA/knb-lter-kbs.35.6',
'doi:10.6073/AA/knb-lter-kbs.38.2',
'doi:10.6073/AA/knb-lter-kbs.39.2'
)
and id.docid||'.'||id.rev = log.docid
order by guid, date_logged;
I'm not sure how to best go about restoring the missing xm_revisions entries on UCSB.
Related issues
History
#1 Updated by Ben Leinfelder over 12 years ago
- Status changed from New to In Progress
#2 Updated by Ben Leinfelder over 12 years ago
- Parent task changed from #3138 to #3136
#3 Updated by Ben Leinfelder over 12 years ago
- Status changed from In Progress to Closed
Manually forced replication for all 18 of these documents using curl like:
sudo curl --capath /etc/ssl/certs --cert /etc/dataone/client/certs/cn-orc-1.dataone.org.pem --key /etc/dataone/client/private/cn-orc-1.dataone.org.key "https://cn-ucsb-1.dataone.org/knb/servlet/replication?server=cn-orc-1.dataone.org/knb/servlet/replication&action=forcereplicate&docid=autogen.2012062921414281310.1"
We have all docs on UCSB now.