Zev Benjamin
c045644097
puppet: Run check_ntp_time against an NTP pool instead of time.mit.edu
...
MIT implemented NTP rate-limiting to defend against on-going reflection attacks,
which was causing our nagios checks to fail intermittently. When the attacks
die down or when external sites fix their NTP configurations, checking against
time.mit.edu will stop failing. However, there also isn't much of a reason to
stick with checking against a single server.
(imported from commit 2c2a1a04646b880b010cbb4b6d94016b1eccd1a0)
2014-01-06 17:30:09 -05:00
Tim Abbott
bdcc2e5c52
nagios: Set max_check_attempts to 3 for batched queue processors.
...
(imported from commit ec0ac86726cd6ff3d0fdfcfcb161d3329fca02ac)
2013-12-19 17:31:41 -05:00
Kevin Mehall
f929e51776
puppet: Make Camo Nagios check waste less bandwidth
...
Use http://www.google.com/favicon.ico instead of a 1.7MB animated gif from
imgur.
(imported from commit 94993af35bf87b0f22e6e743a9ba1cc1c5c9a78f)
2013-12-13 17:27:01 -05:00
Tim Abbott
606d8a4f9b
Add Nagios check for queue worker memory usage.
...
This is detect future memory leaks.
(imported from commit 75fd4c2ad41ea71e87a53fb33e2106c5773909d5)
2013-12-04 10:27:44 -05:00
Zev Benjamin
7af4b92b98
puppet: Rename app to prod0 in nagios
...
(imported from commit c2d1c2c06276a816ef33e057d3f859c755490cb3)
2013-11-25 11:43:16 -05:00
Zev Benjamin
9f2af6fd0d
puppet: Fix postgres_primary alias
...
(imported from commit 1cd199224e45700fac03e68c99f9d4f7d9212b45)
2013-11-25 11:43:16 -05:00
Zev Benjamin
847d4dfbca
puppet: Specify hosts for the postgres autovac_freeze check via a hostgroup
...
(imported from commit d0afc1b78015740fa9638563a5672d3400dd5002)
2013-11-23 12:08:49 -05:00
Zev Benjamin
139518ccbe
puppet: Remove postgres0 from nagios and munin configs
...
(imported from commit 6a4eb208b2a344d65d684cf904ba882a5400056d)
2013-11-23 12:06:27 -05:00
Zev Benjamin
bf8fb3c0df
puppet: Add postgres2 to nagios monitoring
...
(imported from commit 799b1304eebe49cf6d8153fb2bfd0b11a3bcab00)
2013-11-23 08:10:44 -05:00
Zev Benjamin
658972dda3
[manual] puppet: Add postgres2 to munin monitoring
...
You must run
autossh -2 -fN -M 20018 -L 5009:localhost:4949 nagios@postgres2.zulip.net
as nagios on nagios.zulip.net after deploying this commit.
(imported from commit bd8a61f99555ccf0a0010d79dbd89017aaafbb8f)
2013-11-23 08:10:44 -05:00
Tim Abbott
b50db26a18
puppet: Add monitoring for camo.
...
(imported from commit b3cf29b02de285cf860fc173183cb6f4f3a17c74)
2013-11-19 15:25:14 -05:00
acrefoot
0175440afc
[manual] fixup nagios postmaster configuration
...
(imported from commit e3c00b31bbb0ced38e62d31ae80b58e8c6374c7f)
2013-11-13 17:37:54 -05:00
acrefoot
6d38285a2e
fixup supervisor oops related to postmaster config
...
(imported from commit 8b5c39f0d13abb5e1def9f88a2ab82cfa67b42f6)
2013-11-13 17:15:55 -05:00
acrefoot
eab6a1d190
[manual] add nagios checks for email_deliver
...
manual step: puppet apply, make sure that these nagios checks are working properly
(imported from commit abc75b8a5b153510243c14035b820fbc864b7776)
2013-11-13 16:41:36 -05:00
Tim Abbott
b5979a3fed
[manual]: Rename zulip-internal puppet module to zulip_internal.
...
(imported from commit 64ac7ec0f3495b1fe7810da3d4d41263c52b9b3b)
2013-11-05 17:06:32 -05:00