Project

General

Profile

Task #635

Select the monitoring framework and design the deployment of the selected framework

Added by Dave Vieglais over 14 years ago. Updated over 14 years ago.

Status:
Closed
Priority:
Low
Assignee:
Category:
Documentation
Target version:
Start date:
Due date:
% Done:

100%

Milestone:
None
Product Version:
*
Story Points:
Sprint:

Description

Select the monitoring framework and design the deployment of the selected framework, including gathering the necessary information for configuration.

Nagios seems like an excellent candidate for this, along with Cacti and Munin. Relevant email message from Nick:


I'm using a combination of Nagios, Munin, and Cacti to do monitoring and stats collecting. My Nagios setup is doing service and status checks and sending out alerts, while Cacti and Munin are doing graphing and stats.

I'm using Munin because it's dead simple to get running on a machine and has a good number of default plugins, while Cacti does the more advanced data collection. I have Cacti collecting data from several hundred switch ports through SNMP, and running some custom Bash and Python scripts to get data from some hardware without any good plugins or SNMP access.

While I don't have any experience collecting data with Nagios, I definitely recommend running Cacti along with it. Cacti is extremely customizable, and it's pretty easy to have a certain set of graphs display for certain users. It also has the ability to zoom in on graphs, and download data in csv format.

Here are some examples (the public username:password is visitor:NITscoc):

Cacti, with a limited set of graphs:
http://raptor.nceas.ucsb.edu/cacti/

Munin, with stats for the UCSB dataone server:
http://raptor.nceas.ucsb.edu/munin/nceas.ucsb.edu/dataone.nceas.ucsb.edu.html

Nagios alerts go out by email and IRC (check out #sysnotify our IRC server). I have the admin web interface running, but it's not currently set up for anyone except admin users.

I have all three installed from the Ubuntu-supported repositories on Ubuntu 8.04, so they're two years old at this point. The new versions probably look nicer.


History

#1 Updated by Rob Nahf over 14 years ago

First pass analysis two main options for Nagios, Nagios Core OSS & Nagios XI. XI is an enterprise bundle: pros - integrated visualizations, packaged as a VM, configuration wizards; cons: cost is $1300 for first year, perpetual license though). Sustainability points us towards the Core OSS (free open-source option). NagVis offers more monitoring graphs, also open-source(www.nagvis.org)

#2 Updated by Rob Nahf over 14 years ago

As with metrics, specifications will be in the file monitoringSpecifications.txt (documents > Projects > cicore > development) in the repository. Nagios seems to be for real-time monitoring, Cacti for long-term reporting, and Munin to fill in gaps.

#3 Updated by Rob Nahf over 14 years ago

Documenting which metrics go with which system (Nagios, Munin, Cacti) will be easier after packages installed, so keeping 1 hour on time remaining until after installation work done.

#4 Updated by Rob Nahf over 14 years ago

  • Status changed from Closed to 4

#5 Updated by Rob Nahf over 14 years ago

accidentally closed the ticket...

#6 Updated by Rob Nahf over 14 years ago

committed finished documentation to the repository under documents/Projects/cicore/operations/monitoring/. Includes specifications, analysis, install and config, as well as some config files.

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 14.8 MB)