Project

General

Profile

Support #8692

MNDeployment #3213: University of Illinois, Chicago member node

UIC problem with replica node status

Added by Amy Forrester over 6 years ago. Updated over 6 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Target version:
-
Start date:
2018-08-31
Due date:
% Done:

100%

Story Points:
Sprint:

Description

Issue: The replication queue sent from the Coordinating Node to our Member Node stays in the requested status forever.

Discoveries / Actions:
1. Discovered that the logs are in /var/log/messages and not the tomcat logs
2. The root logger in /var/lib/tomcat/webapps/metacat/WEB-INF was changed from WARN to DEBUG
Log4j.rootlogger=DEBUG, stdout
3. Discovered that the above issue is caused by SELinux preventing java from accessing the UIC node certificate.
Aug 26 04:32:59 dataone-membernode setroubleshoot: SELinux is preventing /usr/lib/jvm/java-1.7.0-openjdk-1.7.0.151-2.6.11.1.el7_4.x86_64/jre-abrt/bin/java from getattr access on the file /var/metacat/cert/urn_node_UIC.pem
Deliverables:
1. Ade will ensure that the SELinux issue is fixed
2. Ade will contact Chris to setup another meeting, for the queue to be cleared and the replication request sent again to our node.


Related issues

Blocks Member Nodes - Task #8607: ESSDIVE: Register in Production Closed 2018-06-12

History

#1 Updated by Amy Forrester over 6 years ago

  • Blocks Task #8607: ESSDIVE: Register in Production added

#2 Updated by Amy Forrester over 6 years ago

  • % Done changed from 0 to 100
  • Status changed from New to Closed

9/5/185- per Chris

After looking at the log messages from tomcat, Ade figured out that the SELinux security rules were denying create/read/write permissions for tomcat in the /var/metacat directory (which is outside of its default allowed policies). Ade updated the SELinux policies to include this directory, and replication has begun to succeed again.

UIC currently has around .5TB of space, so Ade will monitor the usage as it fulfills replica requests over the next few days. ESS-DIVE requests will likely total around 30-50GB, but there are multiple other MNs making requests too. Ade may be able to provision more disk space.

#3 Updated by Amy Forrester over 6 years ago

9/13/18: update from Ade

As discussed, attached is the disk usage on our DataONE membernode monitored over a few days. At the moment there is a total available space of 356GB on our member node server and the status has remained same since 7th of September as indicated in the document.

per Chris: It looks like the UIC disk space is doing fine. The ESS_DIVE replicas on UIC are basically complete, with a few that have issues on the CN so won’t replicate over yet.

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 14.8 MB)