]> git.michaelhowe.org Git - packages/a/afs-monitor.git/log
packages/a/afs-monitor.git
21 years agoUp the network timeout limit to five minutes, since afssvr11 is still
Russ Allbery [Fri, 2 Apr 2004 16:18:29 +0000 (16:18 +0000)]
Up the network timeout limit to five minutes, since afssvr11 is still
timing out every night.

21 years agoCheck for vos partinfo failing.
Russ Allbery [Thu, 25 Mar 2004 06:52:06 +0000 (06:52 +0000)]
Check for vos partinfo failing.

21 years agoInitial version.
Russ Allbery [Thu, 25 Mar 2004 06:19:52 +0000 (06:19 +0000)]
Initial version.

21 years agoIncreased the timeout further to 120 seconds. volserver is sometimes just
Russ Allbery [Thu, 25 Mar 2004 05:45:05 +0000 (05:45 +0000)]
Increased the timeout further to 120 seconds.  volserver is sometimes just
really slow.

21 years agoChange the default timeout for the AFS space check to a minute, since the
Russ Allbery [Thu, 25 Mar 2004 02:50:26 +0000 (02:50 +0000)]
Change the default timeout for the AFS space check to a minute, since the
volserver is single-threaded and sometimes doesn't respond quickly.

21 years agoFix the SYNOPSIS in the documentation to have the correct flag to get the
Russ Allbery [Thu, 25 Mar 2004 01:09:34 +0000 (01:09 +0000)]
Fix the SYNOPSIS in the documentation to have the correct flag to get the
version.

21 years agoCompletely rewritten to avoid keeping any state. The monitor now checks
Russ Allbery [Thu, 25 Mar 2004 00:55:18 +0000 (00:55 +0000)]
Completely rewritten to avoid keeping any state.  The monitor now checks
all of the output from bos status against a set of known-okay regexes and
throws an alert if there's any line in the bos output that isn't okay.
This means that this check will no longer catch a server restart that
successfully completed before the probe ran, but on the plus side it also
won't throw additional errors when the file server has come back up (since
the correct output is still different than the old incorrect output).

Also redid the coding style, added real option parsing, required the
standard Nagios -H option, and added full documentation.

21 years agoFix a few very minor issues in the documentation.
Russ Allbery [Thu, 25 Mar 2004 00:47:52 +0000 (00:47 +0000)]
Fix a few very minor issues in the documentation.

21 years agoExtensively reworked to do regular option parsing, support -h and -V
Russ Allbery [Wed, 24 Mar 2004 23:56:49 +0000 (23:56 +0000)]
Extensively reworked to do regular option parsing, support -h and -V
options, support configuration of the critical and warning levels, support
a timeout value, provide a bit of information for okay results, and add
complete documentation.

21 years agoPrint errors rather than using warn since Nagios wants things on standard
Russ Allbery [Wed, 24 Mar 2004 22:43:35 +0000 (22:43 +0000)]
Print errors rather than using warn since Nagios wants things on standard
output.  Fix the initial comment.

21 years agoReorganized extensively, simplified the code a little bit, simplified and
Russ Allbery [Wed, 24 Mar 2004 20:10:12 +0000 (20:10 +0000)]
Reorganized extensively, simplified the code a little bit, simplified and
shortened the output since Nagios will add the relevant host information
itself on errors, added full documentation, added better options parsing,
and made the output and options comply better with the Nagios plugin
standards.  Added a timeout option.

21 years agoDon't use -allconn -rxstats. We aren't paying any attention to the
Russ Allbery [Fri, 13 Feb 2004 01:08:27 +0000 (01:08 +0000)]
Don't use -allconn -rxstats.  We aren't paying any attention to the
statistics, just the mode of the connection, and -allconn just adds in the
(thousands of) idle connections.  Let's assume that an idle connection
can't also be blocked.

22 years agoDon't use the -long flag when checking afsdb servers so that we don't get
Russ Allbery [Fri, 19 Dec 2003 05:33:29 +0000 (05:33 +0000)]
Don't use the -long flag when checking afsdb servers so that we don't get
noise from the nightly kaserver restarts.

22 years agoFix bug in warning bits
Quanah Gibson-Mount [Sat, 13 Dec 2003 01:20:28 +0000 (01:20 +0000)]
Fix bug in warning bits

22 years agoRemove \n's from critical/warning arrays, so all problem partitiosn (if there
Quanah Gibson-Mount [Sat, 13 Dec 2003 01:19:04 +0000 (01:19 +0000)]
Remove \n's from critical/warning arrays, so all problem partitiosn (if there
is more than one) will print out on the nagios status page

22 years agoPrint out percentages instead
Quanah Gibson-Mount [Sat, 13 Dec 2003 00:29:08 +0000 (00:29 +0000)]
Print out percentages instead

22 years agoPrint out partition data in a nice format
Quanah Gibson-Mount [Sat, 13 Dec 2003 00:22:22 +0000 (00:22 +0000)]
Print out partition data in a nice format

22 years agoAdd -H opt, strip out for loop
Quanah Gibson-Mount [Fri, 12 Dec 2003 23:56:54 +0000 (23:56 +0000)]
Add -H opt, strip out for loop

22 years agoFully working nagios style check_afsspace check
Quanah Gibson-Mount [Fri, 12 Dec 2003 23:47:30 +0000 (23:47 +0000)]
Fully working nagios style check_afsspace check

22 years agoFirst pass at making this look like a nagios plugin:
Quanah Gibson-Mount [Fri, 12 Dec 2003 23:32:14 +0000 (23:32 +0000)]
First pass at making this look like a nagios plugin:

-w,-c,-H options all added
Print out full parition table if no errors found (like check_disk)

22 years agoAdd in timeout flag (essentially to fool nagios)
Quanah Gibson-Mount [Fri, 12 Dec 2003 00:54:01 +0000 (00:54 +0000)]
Add in timeout flag (essentially to fool nagios)

22 years agocheck_bos command for AFS servers
Quanah Gibson-Mount [Fri, 12 Dec 2003 00:07:56 +0000 (00:07 +0000)]
check_bos command for AFS servers

22 years agorxdebug check for AFS
Quanah Gibson-Mount [Thu, 11 Dec 2003 23:59:47 +0000 (23:59 +0000)]
rxdebug check for AFS

22 years agoRemove reliance on pubsw perl for both
Quanah Gibson-Mount [Thu, 11 Dec 2003 23:45:27 +0000 (23:45 +0000)]
Remove reliance on pubsw perl for both
use local vos for afsspace