documentation of error levels
Message-ID: <51ed4960.vstmQGYuYtXNeTId%bs(a)mathias-kettner.de>
User-Agent: Heirloom mailx 12.4 7/29/08
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Module: check_mk
Branch: master
Commit: 01b0c9a110bfb6f9425c161d86949da043abe734
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=01b0c9a110bfb6…
Author: Bernd Stroessenreuther <bs(a)mathias-kettner.de>
Date: Mon Jul 22 17:01:44 2013 +0200
check manpage for ibm_rsa_health: documentation of error levels
---
checkman/ibm_rsa_health | 7 ++-----
1 file changed, 2 insertions(+), 5 deletions(-)
diff --git a/checkman/ibm_rsa_health b/checkman/ibm_rsa_health
index 200e5ea..2f7db99 100644
--- a/checkman/ibm_rsa_health
+++ b/checkman/ibm_rsa_health
@@ -4,16 +4,13 @@ catalog: hw/server/ibm
license: GPL
distribution: check_mk
description:
- BERND, FIXME: Ask Mathias in which cases WARN and CRIT are issued.
-
Checks systems health via IBM Remote Supervisor Adapter (RSA) by SNMP.
The system health includes health of fans, power supplys, etc.
Returns {OK} if all of the included devices report OK status.
- Returns {WARN} if ...
-
- Returns {CRIT} if ...
+ Returns {WARN} if the RSA reports a non critical problem and {CRIT} on
+ a system level problem.
inventory:
Returns one check per RSA.
Module: check_mk
Branch: master
Commit: 4c057669c6e2de60cb87b400e8875da65a2c4512
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=4c057669c6e2de…
Author: Bernd Stroessenreuther <bs(a)mathias-kettner.de>
Date: Mon Jul 22 16:54:37 2013 +0200
initial version of check manpage for statgrab_cpu and statgrab_load
---
checkman/statgrab_cpu | 7 +++++--
checkman/statgrab_load | 41 +++++++++++++++++++++++++++++++++--------
2 files changed, 38 insertions(+), 10 deletions(-)
diff --git a/checkman/statgrab_cpu b/checkman/statgrab_cpu
index 1a74d29..fbc06b6 100644
--- a/checkman/statgrab_cpu
+++ b/checkman/statgrab_cpu
@@ -5,9 +5,12 @@ license: GPL
distribution: check_mk
description:
This check measures the CPU utilization during the last check
- cycle. A level can be set only on {wait} (disk wait).
+ cycle.
- BERND: Hinweis auf statgrab-Plugin (HP) bzw. statgrab-Befehl (beide).
+ Make sure you have the statgrab binary available on the monitored
+ machine. Under HP-UX you also need to activate the hpux_statgrab agent plugin.
+
+ The check allways returns {OK} status.
perfdata:
Three variables: The percentage of time spent in {user}, {system} and {wait}
diff --git a/checkman/statgrab_load b/checkman/statgrab_load
index 1896dbb..009e4a9 100644
--- a/checkman/statgrab_load
+++ b/checkman/statgrab_load
@@ -4,16 +4,41 @@ catalog: os/kernel
license: GPL
distribution: check_mk
description:
- This check measures the CPU utilization during the last check
- cycle. A level can be set only on {wait} (disk wait).
+ This check measures and checks the averaged CPU load during the last check
+ cycle by using the statgrab tool of the operating system. The values
+ for 1, 5 and 15 minute average are sent, although the PNP template
+ shipped with check_mk only displays the 1 and 15 min average load.
- BERND: Hinweis auf statgrab-Plugin (HP) bzw. statgrab-Befehl (beide).
+ Make sure you have the statgrab binary available on the monitored
+ machine. Under HP-UX you also need to activate the hpux_statgrab agent plugin.
+ Note: The CPU load is the average number of processes that are
+ currently in the state "running". Do not mix this up with the
+ CPU "utiliziation" (which measures the current usage of the CPU
+ in percent).
+
+examples:
+ # Set default levels to a load of 4 and 8
+ cpuload_default_levels = (4.0, 8.0).
+
+ checks += [
+ # Use levels 4 (warning) and 8 (critical)
+ # on all hosts with the tag "unix"
+ (["unix"], ALL_HOSTS, "statgrab_load", None, (4, 8)),
+ ]
perfdata:
- Three variables: The percentage of time spent in {user}, {system} and {wait}
- (disk wait). The idle time can be computed by substracting those three values
- from 100%. The user value contains also {nice}.
+ Three values: The load average for 1, 5 and 15 minutes.
inventory:
- One check per host is created if the agent contains a {statgrab_cpu}
- section containing cpu information.
+ One check per host is created if the agent contains a {statgrab_load}
+ section.
+
+[parameters]
+warning (int): 15 minute average load at which a {WARN} is triggered.
+critical (int): 15 minute average load at which a {CRIT} state is triggered.
+
+[configuration]
+cpuload_default_levels (int, int): A pair of floats or integers: The default
+ levels. These levels are used for inventorized checks. This variable
+ is preset to {(5, 10)}
+
Module: check_mk
Branch: master
Commit: 09fdb7ab46a8cef619097dcec2f0efa847b3eb33
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=09fdb7ab46a8ce…
Author: Lars Michelsen <lm(a)mathias-kettner.de>
Date: Mon Jul 22 16:42:57 2013 +0200
FIX: automation users can now be deleted again (bug was introduced in 1.2.2p1)
---
ChangeLog | 1 +
web/htdocs/userdb.py | 8 ++++++++
2 files changed, 9 insertions(+)
diff --git a/ChangeLog b/ChangeLog
index b168a8d..3232296 100644
--- a/ChangeLog
+++ b/ChangeLog
@@ -273,6 +273,7 @@
* FIX: fix layout of Auxiliary tags table
* FIX: avoid exception when called first time and first page ist host tags
* FIX: fix validation of time-of-day input field (24:00)
+ * FIX: automation users can now be deleted again (bug was introduced in 1.2.2p1)
1.2.2p2:
diff --git a/web/htdocs/userdb.py b/web/htdocs/userdb.py
index 73d7c90..1b60826 100644
--- a/web/htdocs/userdb.py
+++ b/web/htdocs/userdb.py
@@ -394,6 +394,14 @@ def save_users(profiles):
# entry = dir + "/" + e
# if os.path.isdir(entry):
# shutil.rmtree(entry)
+ # But for the automation.secret this is ok, since automation users are not
+ # created by other sources in common cases
+ dir = defaults.var_dir + "/web"
+ for user_dir in os.listdir(defaults.var_dir + "/web"):
+ if user_dir not in ['.', '..'] and user_dir not in profiles:
+ entry = dir + "/" + user_dir
+ if os.path.isdir(entry) and os.path.exists(entry + '/automation.secret'):
+ os.unlink(entry + '/automation.secret')
# Release the lock to make other threads access possible again asap
# This lock is set by load_users() only in the case something is expected
Module: check_mk
Branch: master
Commit: 224be905641ff522f27194649614e3d9113d6afc
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=224be905641ff5…
Author: Bernd Stroessenreuther <bs(a)mathias-kettner.de>
Date: Mon Jul 22 16:20:22 2013 +0200
initial version of check manpage for ibm_rsa_health and lnx_thermal
---
checkman/ibm_rsa_health | 14 +++++++++++++-
checkman/lnx_thermal | 23 ++++++++++++++++++++++-
2 files changed, 35 insertions(+), 2 deletions(-)
diff --git a/checkman/ibm_rsa_health b/checkman/ibm_rsa_health
index ae4ca9e..200e5ea 100644
--- a/checkman/ibm_rsa_health
+++ b/checkman/ibm_rsa_health
@@ -4,4 +4,16 @@ catalog: hw/server/ibm
license: GPL
distribution: check_mk
description:
- BERND
+ BERND, FIXME: Ask Mathias in which cases WARN and CRIT are issued.
+
+ Checks systems health via IBM Remote Supervisor Adapter (RSA) by SNMP.
+ The system health includes health of fans, power supplys, etc.
+
+ Returns {OK} if all of the included devices report OK status.
+
+ Returns {WARN} if ...
+
+ Returns {CRIT} if ...
+
+inventory:
+ Returns one check per RSA.
diff --git a/checkman/lnx_thermal b/checkman/lnx_thermal
index 0e14d3b..9aed910 100644
--- a/checkman/lnx_thermal
+++ b/checkman/lnx_thermal
@@ -4,4 +4,25 @@ catalog: os/hardware
license: GPL
distribution: check_mk
description:
- BERND: verwendet /proc/sys/wasweisich...
+ This check uses the temperature information provided by the kernel in
+ /sys/class/thermal/thermal_zone* and reports values in °C
+
+ The measured value is matched against levels delivered by the kernel.
+ Returns {CRIT} if the kernel reports {hot} or {critical} state,
+ {WARN} on any other non ok state and {OK} if the temperature is
+ considered to be within normal operation temperature levels.
+
+item:
+ The ID of the thermal zone
+
+examples:
+ # adding a check for /sys/class/thermal/thermal_zone0
+ checks += [
+ ( "localhost", "lnx_thermal", "0", None )
+ ]
+
+perfdata:
+ One value: The current temperature in °C
+
+inventory:
+ Creates one check for earch thermal zone.
Module: check_mk
Branch: master
Commit: 8176cdc5cad335faa5f4c5638871a6aa7af60357
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=8176cdc5cad335…
Author: Mathias Kettner <mk(a)mathias-kettner.de>
Date: Mon Jul 22 11:55:10 2013 +0200
Fix problem with outage_statistics in availability
---
web/plugins/views/availability.py | 1 +
1 file changed, 1 insertion(+)
diff --git a/web/plugins/views/availability.py b/web/plugins/views/availability.py
index 2b4f147..df09db3 100644
--- a/web/plugins/views/availability.py
+++ b/web/plugins/views/availability.py
@@ -334,6 +334,7 @@ def render_availability_options():
# Make sure that parameters are set that have not been present in the
# original version. This code can be dropped in a couple of years.
avoptions.setdefault("notification_period", "honor")
+ avoptions.setdefault("outage_statistics", ([], []))
is_open = False
html.begin_form("avoptions")
Module: check_mk
Branch: master
Commit: c56e37d679c0a319e1ce02776b650ea81e9e89e1
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=c56e37d679c0a3…
Author: Bernd Stroessenreuther <bs(a)mathias-kettner.de>
Date: Mon Jul 22 11:43:15 2013 +0200
initial version of check manpage for hp_proliant and hpux_tunables.*
---
checkman/hp_proliant | 9 +++------
checkman/hpux_tunables.maxfiles_lim | 19 ++++++++++++++-----
checkman/hpux_tunables.nkthread | 20 +++++++++++++++-----
checkman/hpux_tunables.nproc | 20 +++++++++++++++-----
checkman/hpux_tunables.semmni | 20 +++++++++++++++-----
checkman/hpux_tunables.semmns | 20 +++++++++++++++-----
checkman/hpux_tunables.shmseg | 19 +++++++++++++++----
7 files changed, 92 insertions(+), 35 deletions(-)
diff --git a/checkman/hp_proliant b/checkman/hp_proliant
index 7bcad3d..fe03fcd 100644
--- a/checkman/hp_proliant
+++ b/checkman/hp_proliant
@@ -1,15 +1,12 @@
-title: HP Proliant servers: general status
+title: HP Proliant servers: General Status
agents: snmp
catalog: hw/server/hp
license: GPL
distribution: check_mk
description:
- This is a copy only, FIXME, TODO: BERND
-
This check connects to the SNMP agent installed on an operating system
- on a HP Proliant server and reads the information from the {CPQHLTH-MIB}
- MIB. In addition to the general state it also returns the serial number
- and firmware version.
+ on a HP Proliant server and reads the general status information.
+ In addition it also returns the serial number and firmware version.
The status is {OK} if the agent signals {Unknown} or {Ok}, {WARN} on
{Degraded} and {CRIT} on {Failed}.
diff --git a/checkman/hpux_tunables.maxfiles_lim b/checkman/hpux_tunables.maxfiles_lim
index 38824d5..8b89985 100644
--- a/checkman/hpux_tunables.maxfiles_lim
+++ b/checkman/hpux_tunables.maxfiles_lim
@@ -1,13 +1,22 @@
-title: Kernel tunables: Maximum file limit
+title: Kernel tunables: Number of open files
agents: hpux
catalog: os/kernel
license: GPL
distribution: check_mk
description:
- This is a copy only, FIXME, TODO: BERND
- Title correct?
-
The check uses the output of {kcusage -l} for monitoring the
- current usage of shared memory segments. Per default this
+ current number of open files (maxfiles_lim). Per default this
check is always {OK}, but you can set levels for the used
percentage.
+
+ See check manpage for hpux_tunables too.
+
+perfdata:
+ See check manpage for hpux_tunables.
+
+inventory:
+ See check manpage for hpux_tunables.
+
+[parameters]
+warn(float): See check manpage for hpux_tunables.
+crit(float): See check manpage for hpux_tunables.
diff --git a/checkman/hpux_tunables.nkthread b/checkman/hpux_tunables.nkthread
index 83d7d60..ce1291e 100644
--- a/checkman/hpux_tunables.nkthread
+++ b/checkman/hpux_tunables.nkthread
@@ -1,13 +1,23 @@
-title: Kernel tunables: nkthread
+title: Kernel tunables: Number of threads
agents: hpux
catalog: os/kernel
license: GPL
distribution: check_mk
description:
- This is a copy only, FIXME, TODO: BERND
- Title to be improved
-
The check uses the output of {kcusage -l} for monitoring the
- current usage of shared memory segments. Per default this
+ current number of threads (nkthread). Per default this
check is always {OK}, but you can set levels for the used
percentage.
+
+ See check manpage for hpux_tunables too.
+
+perfdata:
+ See check manpage for hpux_tunables.
+
+inventory:
+ See check manpage for hpux_tunables.
+
+[parameters]
+warn(float): See check manpage for hpux_tunables.
+crit(float): See check manpage for hpux_tunables.
+
diff --git a/checkman/hpux_tunables.nproc b/checkman/hpux_tunables.nproc
index 1624158..9794752 100644
--- a/checkman/hpux_tunables.nproc
+++ b/checkman/hpux_tunables.nproc
@@ -1,13 +1,23 @@
-title: Kernel tunables: nproc
+title: Kernel tunables: Number of Processes
agents: hpux
catalog: os/kernel
license: GPL
distribution: check_mk
description:
- This is a copy only, FIXME, TODO: BERND
- Title to be improved
-
The check uses the output of {kcusage -l} for monitoring the
- current usage of shared memory segments. Per default this
+ current Number of processes (npoc). Per default this
check is always {OK}, but you can set levels for the used
percentage.
+
+ See check manpage for hpux_tunables too.
+
+perfdata:
+ See check manpage for hpux_tunables.
+
+inventory:
+ See check manpage for hpux_tunables.
+
+[parameters]
+warn(float): See check manpage for hpux_tunables.
+crit(float): See check manpage for hpux_tunables.
+
diff --git a/checkman/hpux_tunables.semmni b/checkman/hpux_tunables.semmni
index 2ebcb6c..ea77b39 100644
--- a/checkman/hpux_tunables.semmni
+++ b/checkman/hpux_tunables.semmni
@@ -1,13 +1,23 @@
-title: Kernel tunables: semmni
+title: Kernel tunables: Number of IPC Semaphore IDs
agents: hpux
catalog: os/kernel
license: GPL
distribution: check_mk
description:
- This is a copy only, FIXME, TODO: BERND
- Title to be improved
-
The check uses the output of {kcusage -l} for monitoring the
- current usage of shared memory segments. Per default this
+ current number of IPC semaphore IDs (semmni). Per default this
check is always {OK}, but you can set levels for the used
percentage.
+
+ See check manpage for hpux_tunables too.
+
+perfdata:
+ See check manpage for hpux_tunables.
+
+inventory:
+ See check manpage for hpux_tunables.
+
+[parameters]
+warn(float): See check manpage for hpux_tunables.
+crit(float): See check manpage for hpux_tunables.
+
diff --git a/checkman/hpux_tunables.semmns b/checkman/hpux_tunables.semmns
index 31350c0..07426b4 100644
--- a/checkman/hpux_tunables.semmns
+++ b/checkman/hpux_tunables.semmns
@@ -1,13 +1,23 @@
-title: Kernel tunables: semmns
+title: Kernel tunables: Number of IPC Semaphores
agents: hpux
catalog: os/kernel
license: GPL
distribution: check_mk
description:
- This is a copy only, FIXME, TODO: BERND
- Title to be improved
-
The check uses the output of {kcusage -l} for monitoring the
- current usage of shared memory segments. Per default this
+ current number of IPC semaphores (semmns). Per default this
check is always {OK}, but you can set levels for the used
percentage.
+
+ See check manpage for hpux_tunables too.
+
+perfdata:
+ See check manpage for hpux_tunables.
+
+inventory:
+ See check manpage for hpux_tunables.
+
+[parameters]
+warn(float): See check manpage for hpux_tunables.
+crit(float): See check manpage for hpux_tunables.
+
diff --git a/checkman/hpux_tunables.shmseg b/checkman/hpux_tunables.shmseg
index 1f57f8d..017dbd7 100644
--- a/checkman/hpux_tunables.shmseg
+++ b/checkman/hpux_tunables.shmseg
@@ -1,12 +1,23 @@
-title: Kernel tunables: Shared memory segments
+title: Kernel tunables: Number of Shared Memory Segments
agents: hpux
catalog: os/kernel
license: GPL
distribution: check_mk
description:
- This is a copy only, FIXME, TODO: BERND
-
The check uses the output of {kcusage -l} for monitoring the
- current usage of shared memory segments. Per default this
+ current number of shared memory segments (semmns). Per default this
check is always {OK}, but you can set levels for the used
percentage.
+
+ See check manpage for hpux_tunables too.
+
+perfdata:
+ See check manpage for hpux_tunables.
+
+inventory:
+ See check manpage for hpux_tunables.
+
+[parameters]
+warn(float): See check manpage for hpux_tunables.
+crit(float): See check manpage for hpux_tunables.
+
Module: check_mk
Branch: master
Commit: f57fbff1275fb9a5e760c86d8685f641864f8fb4
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=f57fbff1275fb9…
Author: Mathias Kettner <mk(a)mathias-kettner.de>
Date: Mon Jul 22 11:40:14 2013 +0200
FIX: fix validation of time-of-day input field (24:00)
---
ChangeLog | 1 +
web/htdocs/valuespec.py | 10 ++++++----
2 files changed, 7 insertions(+), 4 deletions(-)
diff --git a/ChangeLog b/ChangeLog
index e149c02..b168a8d 100644
--- a/ChangeLog
+++ b/ChangeLog
@@ -272,6 +272,7 @@
WATO:
* FIX: fix layout of Auxiliary tags table
* FIX: avoid exception when called first time and first page ist host tags
+ * FIX: fix validation of time-of-day input field (24:00)
1.2.2p2:
diff --git a/web/htdocs/valuespec.py b/web/htdocs/valuespec.py
index 8129be0..2803593 100644
--- a/web/htdocs/valuespec.py
+++ b/web/htdocs/valuespec.py
@@ -1597,10 +1597,12 @@ class Timeofday(ValueSpec):
def validate_value(self, value, varprefix):
if not self._allow_empty and value == None:
raise MKUserError(varprefix, _("Please enter a time."))
- if self._allow_24_00 and value > (24, 0):
- raise MKUserError(varprefix, _("The time must not be greater than 24:00."))
- elif not self._allow_empty and value > (23, 59):
- raise MKUserError(varprefix, _("The time must not be greater than 23:59."))
+ if self._allow_24_00:
+ max_value = (24, 0)
+ else:
+ max_value = (23, 59)
+ if value > max_value:
+ raise MKUserError(varprefix, _("The time must not be greater than %02d:%02d." % max_value))
elif value[0] < 0 or value[1] < 0 or value[0] > 24 or value[1] > 59:
raise MKUserError(varprefix, _("Hours/Minutes out of range"))
ValueSpec.custom_validate(self, value, varprefix)
Module: check_mk
Branch: master
Commit: b06eec20e94dabc5b67ab7b8a850ed7664e99655
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=b06eec20e94dab…
Author: Bernd Stroessenreuther <bs(a)mathias-kettner.de>
Date: Mon Jul 22 11:02:28 2013 +0200
initial version of check manpage for bluecoat_diskcpu and bluecoat_sensors
---
checkman/bluecoat_diskcpu | 19 ++++++++++++++++++-
checkman/bluecoat_sensors | 19 ++++++++++++++++---
2 files changed, 34 insertions(+), 4 deletions(-)
diff --git a/checkman/bluecoat_diskcpu b/checkman/bluecoat_diskcpu
index e8f754a..fd4b0dd 100644
--- a/checkman/bluecoat_diskcpu
+++ b/checkman/bluecoat_diskcpu
@@ -1,6 +1,23 @@
title: Blue Coat Systems: Security Gateway: Disk and CPU utilization
+agents: snmp
catalog: hw/network/bluecoat
license: GPL
distribution: check_mk
description:
- BERND
+ The check checks CPU utilization and Disk usage on Blue Coat Security
+ Gateways by SNMP.
+
+ Returns {OK} if SNMP reports OK status for the item and {CRIT} on
+ every other status.
+
+item:
+ The item {Disk} represents the disk usage.
+
+ The item {CPU} represents the CPU utilization.
+
+perfdata:
+ One value representing the usage (for the Disk check) or utilization
+ (for the CPU check)
+
+inventory:
+ Returns two checks for each appliance (machine): {Disk} and {CPU}
diff --git a/checkman/bluecoat_sensors b/checkman/bluecoat_sensors
index ea123e0..923590d 100644
--- a/checkman/bluecoat_sensors
+++ b/checkman/bluecoat_sensors
@@ -1,7 +1,20 @@
-title: Blue Coat Systems: Security Gateway: Sensors
+title: Blue Coat Systems: Security Gateway: Hardware Sensors
+agents: snmp
catalog: hw/network/bluecoat
license: GPL
distribution: check_mk
description:
- BERND
- title to be improved
+ Checks by SNMP the hardware sensors of Blue Coat Security Gateways, such as
+ different temperature sensors, voltage levels and fan speeds.
+
+ Returns {OK} if an ok status is reported by SNMP and {CRIT} on any
+ other status.
+
+item:
+ The sensor description as returned by SNMP
+
+perfdata:
+ One number: The measured value.
+
+inventory:
+ Finds one service for each hardware sensor.