Module: check_mk
Branch: master
Commit: 46caba322a0d2a3706074025d186e88bd21d6fa2
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=46caba322a0d2a…
Author: Simon Betz <si(a)mathias-kettner.de>
Date: Thu Feb 11 15:24:23 2016 +0100
updated manpages
---
checkman/hp_msa_volume | 1 +
checkman/hp_msa_volume.df | 1 +
checkman/hp_msa_volume.io | 1 +
3 files changed, 3 insertions(+)
diff --git a/checkman/hp_msa_volume b/checkman/hp_msa_volume
index 44b13d2..16f6c5e 100644
--- a/checkman/hp_msa_volume
+++ b/checkman/hp_msa_volume
@@ -6,6 +6,7 @@ distribution: check_mk
description:
This check reports the health states of the volumes
of a HP MSA 2040 storage system.
+ To make it work you have to configure the hp_msa datasource program via WATO.
If the state of the volumes are "OK" then the service state is OK.
Otherwise the service shows the health status and reason of the volume
diff --git a/checkman/hp_msa_volume.df b/checkman/hp_msa_volume.df
index ab14435..008436a 100644
--- a/checkman/hp_msa_volume.df
+++ b/checkman/hp_msa_volume.df
@@ -5,6 +5,7 @@ license: GPL
distribution: check_mk
description:
This check monitors the disk usage of each volume of a HP MSA 2040 storage system.
+ To make it work you have to configure the hp_msa datasource program via WATO.
The levels are configurable.
Default levels are set.
diff --git a/checkman/hp_msa_volume.io b/checkman/hp_msa_volume.io
index 643e905..0fd7c46 100644
--- a/checkman/hp_msa_volume.io
+++ b/checkman/hp_msa_volume.io
@@ -5,6 +5,7 @@ license: GPL
distribution: check_mk
description:
This check monitors the IO summary of all volume of a HP MSA 2040 storage system.
+ To make it work you have to configure the hp_msa datasource program via WATO.
With the WATO rule "Discovery mode for Disk IO check" the check
monitors each volume IO.
Module: check_mk
Branch: master
Commit: b04fbb3ff02cb0daaee34e4700ee5ceddd3d98cd
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=b04fbb3ff02cb0…
Author: Sebastian Herbord <sh(a)mathias-kettner.de>
Date: Thu Feb 11 15:02:02 2016 +0100
gitbugs
---
.bugs/2177 | 7 +++++--
.bugs/2400 | 7 +++++--
2 files changed, 10 insertions(+), 4 deletions(-)
diff --git a/.bugs/2177 b/.bugs/2177
index e43860c..558356f 100644
--- a/.bugs/2177
+++ b/.bugs/2177
@@ -1,9 +1,12 @@
Title: new windows agent: eventlog monitoring should be able to process cleared logfiles
Component: checks
-State: open
+Class: bug
+State: closed
Date: 2015-07-30 16:04:54
Targetversion: 1.2.8
-Class: bug
If someone clears the event log with the event viewer, the windows agent
is unable to start from event log record 0. It insists to continue at 51212, for example.
+
+2016-02-11 15:01:36: changed state open -> closed
+This report is invalid? Clearing the event log does not reset the event id counter. If your newest record has it 42 and you clear the event console, the next record will still have the id 43. Seeking in the event log is based on that record id so clearing the event log has no effect except that check_mk may miss cleared events it hadn't read yet.
diff --git a/.bugs/2400 b/.bugs/2400
index e0021d1..0682410 100644
--- a/.bugs/2400
+++ b/.bugs/2400
@@ -1,11 +1,14 @@
Title: Windows Agent may fail to read large event logs
Component: checks
-State: open
+Class: bug
+State: done
Date: 2015-11-06 11:48:19
Targetversion: 1.2.8
-Class: bug
Due to a bug in the ReadEventLog function (kb 177199) reading large event logs
may fail. This could be worked around in the agent by using sequential read
and a manual seek to the right offset but we'd have to check the performance
implications.
+
+2016-02-11 14:57:03: changed state open -> done
+EventLog code was overhauled and should fall back to sequential reading of event logs if necessary. Unfortunately wasn't possible to reproduce this problem internally.
Module: check_mk
Branch: master
Commit: 6ebb235b65b99074173e9749b881edbf16943bd8
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=6ebb235b65b990…
Author: Andreas Boesl <ab(a)mathias-kettner.de>
Date: Thu Feb 11 14:58:17 2016 +0100
#2991 Snapshot restore: Improved robustness and logging
The restore operation tries to complete the snapshot extraction with the rest of
the data when it encounters critical errors in the crucial extract phase.
Error information is logged in the file <tt>~/var/log/web.log</tt>.
This should never happen anyway - there are several mechanisms to prevent this.
---
.werks/2991 | 12 ++++++++++++
ChangeLog | 1 +
web/htdocs/multitar.py | 25 ++++++++++++++++++++-----
3 files changed, 33 insertions(+), 5 deletions(-)
diff --git a/.werks/2991 b/.werks/2991
new file mode 100644
index 0000000..f3186ae
--- /dev/null
+++ b/.werks/2991
@@ -0,0 +1,12 @@
+Title: Snapshot restore: Improved robustness and logging
+Level: 1
+Component: wato
+Compatible: compat
+Version: 1.2.7i4
+Date: 1455198375
+Class: feature
+
+The restore operation tries to complete the snapshot extraction with the rest of
+the data when it encounters critical errors in the crucial extract phase.
+Error information is logged in the file <tt>~/var/log/web.log</tt>.
+This should never happen anyway - there are several mechanisms to prevent this.
diff --git a/ChangeLog b/ChangeLog
index c717b67..2aaafc9 100644
--- a/ChangeLog
+++ b/ChangeLog
@@ -334,6 +334,7 @@
* 2923 SNMP community in WATO is now always displayed with ******...
* 2924 WATO can now add contact groups from hosts also to services...
* 2902 Hosts can now be added by regular network scans of configured IP ranges...
+ * 2991 Snapshot restore: Improved robustness and logging...
* 2666 FIX: Fix search for global configuration variables: ignore case now
* 2715 FIX: Fixed visibility of BI aggregations in editor
* 2716 FIX: Fixed error reporting on disabled checks page in case of broken man pages
diff --git a/web/htdocs/multitar.py b/web/htdocs/multitar.py
index d635965..d020186 100644
--- a/web/htdocs/multitar.py
+++ b/web/htdocs/multitar.py
@@ -197,7 +197,7 @@ def extract_domains(tar, domains):
try:
target_dir = domain.get("prefix")
if not target_dir:
- return
+ return []
# The complete tar.gz file never fits in stringIO buffer..
tar.extract(tar_member, restore_dir)
@@ -217,6 +217,8 @@ def extract_domains(tar, domains):
except Exception, e:
return [ "%s - %s" % (domain["title"], str(e)) ]
+ return []
+
def execute_restore(domain, is_pre_restore = True):
if is_pre_restore:
@@ -227,8 +229,9 @@ def extract_domains(tar, domains):
return domain["post_restore"]()
return []
-
total_errors = []
+ logger(LOG_INFO, "Restoring snapshot: %s" % tar.name)
+ logger(LOG_INFO, "Domains: %s" % ", ".join(tar_domains.keys()))
for what, abort_on_error, handler in [
("Permissions", True, lambda domain, tar_member: check_domain(domain, tar_member)),
("Pre-Restore", True, lambda domain, tar_member: execute_restore(domain, is_pre_restore = True)),
@@ -239,9 +242,21 @@ def extract_domains(tar, domains):
errors = []
for name, tar_member in tar_domains.items():
if name in domains:
- dom_errors = handler(domains[name], tar_member)
- if dom_errors:
- errors.extend(dom_errors)
+ try:
+ dom_errors = handler(domains[name], tar_member)
+ errors.extend(dom_errors or [])
+ except Exception, e:
+ # This should NEVER happen
+ err_info = "Restore-Phase: %s, Domain: %s\nError: %s" % (what, name, format_exception())
+ errors.append(err_info)
+ logger(LOG_CRIT, err_info)
+ if abort_on_error == False:
+ # At this state, the restored data is broken.
+ # We still try to apply the rest of the snapshot
+ # Hopefully the log entry helps in identifying the problem..
+ logger(LOG_ALERT, "Snapshot restore FAILED! (possible loss of snapshot data)")
+ continue
+ break
if errors:
if what == "Permissions":
Module: check_mk
Branch: master
Commit: 64e4fd76cd18660749a53910a8948f5948e00568
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=64e4fd76cd1866…
Author: Andreas Boesl <ab(a)mathias-kettner.de>
Date: Thu Feb 11 11:39:37 2016 +0100
NetApp datasource program: updated help text
---
web/plugins/wato/datasource_programs.py | 13 ++++++-------
1 file changed, 6 insertions(+), 7 deletions(-)
diff --git a/web/plugins/wato/datasource_programs.py b/web/plugins/wato/datasource_programs.py
index 5ced95c..8dc3f61 100644
--- a/web/plugins/wato/datasource_programs.py
+++ b/web/plugins/wato/datasource_programs.py
@@ -248,13 +248,12 @@ register_rule(group,
),
title = _("Check NetApp via WebAPI"),
help = _("This rule set selects the NetApp special agent instead of the normal Check_MK Agent "
- "and allows monitoring via the NetApp API. Right now only <i>7-Mode</i> is supported, "
- "<i>Cluster Mode</i> will follow soon. Important: To make this special agent NetApp work "
- "you will have to provide two additional python files (<tt>NaServer.py</tt>, <tt>NaElement.py</tt>) "
- "from the NetApp Manageability SDK. They need to be put into the site directory "
- "into <tt>~/local/lib/python</tt>. The user requires a number of permissions for specific API classes. "
- "They are displayed if you call the agent with <tt>agent_netapp --help</tt>. The agent itself "
- "is located in the site directory under <tt>~/share/check_mk/agents/special</tt>."),
+ "and allows monitoring via the NetApp Web API. Important: To get this agent running, you need to "
+ "install two additional python files (<tt>NaServer.py</tt>, <tt>NaElement.py</tt>) from the "
+ "NetApp Manageabiltiy SDK into the <tt>~/local/lib/python</tt> directory. To access the data the "
+ "user requires permissions to several API classes. They are shown when you call the agent with "
+ "<tt>agent_netapp --help</tt>. The agent itself is located in the site directory under "
+ "<tt>~/share/check_mk/agents/special</tt>."),
match = 'first')
register_rule(group,