Module: check_mk
Branch: master
Commit: 3113d9985d346e3380d9a7a81f476b6f11c80fd2
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=3113d9985d346e…
Author: Lars Michelsen <lm(a)mathias-kettner.de>
Date: Mon Jul 24 16:59:19 2017 +0200
5038 FIX Datasource programs: Prevent zombie processes in case of timeouts
When using datasource programs (like e.g. special agents or SSH command line calls)
to get agent data from hosts it may happen that these programs remain open as
zombie processes. Check_MK has been extended to deal with this situation and clean
up these processes.
Details: When the program execution takes too long Check_MK sends a SIGTERM to the
process group of the executed program. After sending the signal Check_MK is now
waiting for the process to finish.
Change-Id: I9f2dbfa0839f8089bcc86bf67ec270aedf7adf3a
---
.werks/5038 | 18 ++++++++++++++++++
cmk_base/agent_data.py | 1 +
2 files changed, 19 insertions(+)
diff --git a/.werks/5038 b/.werks/5038
new file mode 100644
index 0000000..8e17bb7
--- /dev/null
+++ b/.werks/5038
@@ -0,0 +1,18 @@
+Title: Datasource programs: Prevent zombie processes in case of timeouts
+Level: 1
+Component: core
+Class: fix
+Compatible: compat
+Edition: cre
+State: unknown
+Version: 1.5.0i1
+Date: 1500907925
+
+When using datasource programs (like e.g. special agents or SSH command line calls)
+to get agent data from hosts it may happen that these programs remain open as
+zombie processes. Check_MK has been extended to deal with this situation and clean
+up these processes.
+
+Details: When the program execution takes too long Check_MK sends a SIGTERM to the
+process group of the executed program. After sending the signal Check_MK is now
+waiting for the process to finish.
diff --git a/cmk_base/agent_data.py b/cmk_base/agent_data.py
index 456bf5d..5183715 100644
--- a/cmk_base/agent_data.py
+++ b/cmk_base/agent_data.py
@@ -812,6 +812,7 @@ def get_agent_info_program(commandline):
# On timeout exception try to stop the process to prevent child process "leakage"
if p:
os.killpg(os.getpgid(p.pid), signal.SIGTERM)
+ p.wait()
raise
except Exception, e:
raise MKAgentError("Could not execute '%s': %s" % (exepath, e))
Module: check_mk
Branch: master
Commit: 8ce9c073804494e19a284fa67a8e53a7b41daba5
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=8ce9c073804494…
Author: Lars Michelsen <lm(a)mathias-kettner.de>
Date: Mon Jul 24 17:02:47 2017 +0200
5039 Livestatus proxy: Connection parameters can now be configured globally
The connection parameters of the Livestatus Proxy can now be configured
globally to specify a default set of parameters for all sites. Each site
can now either be configured to use the global defaults or set it's own
parameters.
For most environments it is sufficient to configure the global defaults
and make all sites use the defaults.
Using the global defaults makes it a lot easier to maintain the settings
since you don't need to configure the settings for each site individually
anymore.
Change-Id: I718aa6ccc6c5a7c4044f304f808c08178f8ac2e7
---
.werks/5039 | 20 +++++++
web/htdocs/wato.py | 44 +++++++-------
web/htdocs/watolib.py | 157 ++++++++++++--------------------------------------
3 files changed, 80 insertions(+), 141 deletions(-)
Diff: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commitdiff;h=8ce9c07380…
Module: check_mk
Branch: master
Commit: a1c795a865aaa5115bfb49bbdb936c7f719d8f40
URL: http://git.mathias-kettner.de/git/?p=check_mk.git;a=commit;h=a1c795a865aaa5…
Author: Mathias Kettner <mk(a)mathias-kettner.de>
Date: Tue Jul 25 11:00:52 2017 +0200
5068 Add error message to report in case some sites are not reachable
When you create a report in a distributed environment with multiple sites
and not all of the sites are reachable when the report is being created
then there was simply missing data without any further notice.
No at the end of the report there is added an error message if there
was a problem with at least one site. That way the reader knows that
the data might be incomplete.
Change-Id: I78704f1f935f7d3e46a1066a19c81518289f70fc
---
.werks/5068 | 17 +++++++++++++++++
1 file changed, 17 insertions(+)
diff --git a/.werks/5068 b/.werks/5068
new file mode 100644
index 0000000..4330145
--- /dev/null
+++ b/.werks/5068
@@ -0,0 +1,17 @@
+Title: Add error message to report in case some sites are not reachable
+Level: 1
+Component: reporting
+Compatible: compat
+Edition: cee
+Version: 1.5.0i1
+Date: 1500973145
+Class: feature
+
+When you create a report in a distributed environment with multiple sites
+and not all of the sites are reachable when the report is being created
+then there was simply missing data without any further notice.
+
+No at the end of the report there is added an error message if there
+was a problem with at least one site. That way the reader knows that
+the data might be incomplete.
+