Leo Franchi
c81c418478
Aggregate stats_counts.app.cache.* as well as stats
...
(imported from commit 675d7b10f982bd180bf049d6552d3d0b75da8b80)
2013-09-17 14:23:46 -04:00
Leo Franchi
e13c94b904
Add aggregation for reading stats
...
(imported from commit c344b9c2897bf95287c0f0df281b0567261c87f7)
2013-09-10 10:24:15 -04:00
Leo Franchi
7aedfbba14
Include all configuration bits for stats in puppet
...
(imported from commit 439c062dafd1167f465b6a1a0d8e6767dc6babeb)
2013-09-10 10:24:15 -04:00
Leo Franchi
e5a39e5b4f
Fix apache2 nagios confing redirect
...
(imported from commit 88775e874e2852e0c8b7846bdde04d7d1d2a1afe)
2013-09-09 10:43:04 -04:00
Zev Benjamin
f666e168e1
nagios: Fix argument expansion in check_postgres_alert_args command
...
This was apparently causing some quoting weirdness.
(imported from commit 680d3cb9f4b1cf47cc1652feeb9f19855e7aa3e0)
2013-09-05 20:02:02 -04:00
Zev Benjamin
abe07b34d5
Up the thresholds for nagios alerts about Postgres locks
...
We can hopefully lower this again when we figure turn on autocommit
or figure out why we're suddenly taking more locks. For now, though,
the number of locks does not seem to be harmful to performance.
(imported from commit 0f8915d4a44aa28d1fa0969b93beea691d70d8d7)
2013-09-05 19:32:33 -04:00
Zev Benjamin
11a503667d
Replace configuration references to trac.humbughq.com with trac.zulip.net
...
(imported from commit aa042a10ada3f04205fd8d3e94c2b13f12a5a1a7)
2013-09-05 15:43:08 -04:00
Luke Faraone
d95da86299
Apache etc configuration changes to stats to support stats1.zulip.net
...
(imported from commit 3a20e31b6a5bf897aa700b55eba1bb0e8346c36c)
2013-09-05 15:43:07 -04:00
Luke Faraone
1c48aa7b8c
Switch nagios to .zulip.net and add cert
...
We maintain a legacy hhq.c cert to avoid breaking links.
(imported from commit 4e8878185b02c0ed6c9193e21ff94ac96a2ef7b4)
2013-09-05 15:43:07 -04:00
Luke Faraone
4f6159049d
Switch trac to .zulip.net and add cert
...
We maintain a legacy hhq.c cert to avoid breaking links.
(imported from commit e9953a3cc93f057abd79d7ce652e0942051ff5dd)
2013-09-05 15:43:07 -04:00
Tim Abbott
a575ea997f
puppet: Run GitHub Camo on lb0 for external-content.zulipcdn.net.
...
(imported from commit c03f2018fee1d88b747f45fc7841949f61e5c06c)
2013-09-05 14:35:29 -04:00
Tim Abbott
ddc9c53e1b
nagios: Set max_check_attempts to 3 for rabbitmq consumers.
...
This works around the fact that we seem to have a mysterous extra
checkup 40s after an error first occurs with these checks, which
always fails because the data is updated by a cron job that runs every
minute.
(imported from commit e7fe9c85e8399115443269287e695b140b4443ff)
2013-09-05 14:35:29 -04:00
Steve Howell
360cc754cc
Extract last_check_ts var in nagios script
...
(imported from commit b391ef2832100a8140789b0d1b553b028e0b7af6)
2013-09-03 22:30:00 -04:00
Tim Abbott
221f6e6199
puppet: Add cron job to clear expired Django sessions.
...
(imported from commit 1426430785f483d450f3d044147fe0fad50cb898)
2013-09-03 17:10:40 -04:00
Tim Abbott
4951ad2ce7
check_personal_zephyr_mirrors: Use newlines in display format.
...
(imported from commit d4114cde3575fafd0b37722681baf2e61517a6ad)
2013-09-03 11:40:42 -04:00
Jessica McKellar
8176429328
Un-break PagerDuty and go back to using the e-mail integration.
...
(imported from commit 1b33336ad56d56e6d9e48a5b2497a5115fbd310f)
2013-08-30 11:32:12 -04:00
Jessica McKellar
c125db4917
email-mirror: put Nagios configuration in puppet.
...
(imported from commit a6ecac877628cf7918e55d553cf509aa5c0635cb)
2013-08-28 16:43:40 -04:00
Jessica McKellar
7737434dbe
email-mirror: run the management command as the humbug user.
...
We don't set up certs for root to be able to make database queries.
(imported from commit 0236745fb1d5aace3097acbe0ed4b5772a19b255)
2013-08-28 16:43:39 -04:00
Zev Benjamin
ade55dc39c
Fix PATH for Postgres backup cron job
...
(imported from commit 9e4b868d444d73f9eab1507851b1b3ab47655977)
2013-08-28 15:16:25 -04:00
Leo Franchi
cfb623ae5c
Bump os x app to 0.3.6
...
(imported from commit 51ddf87797a94b97dec8ce998a08a83ed8837196)
2013-08-28 14:20:20 -04:00
Zev Benjamin
1d13c091ef
pg_backup_and_purge.py: Set up logging
...
(imported from commit 0948f548b005505a7f0c38c3af245815ec9efd6c)
2013-08-28 13:30:51 -04:00
Leo Franchi
9d895e53d5
Add SHELL, PATH, and ROOT variables to crontab
...
(imported from commit 3e556b12bb1c5332bd08ba267d24e0dd524bff7d)
2013-08-27 17:28:07 -04:00
Leo Franchi
071415ae93
Update desktop apps to 0.3.5 in sparkle
...
(imported from commit cf7c1b9dbbc0cbda5c4776d7b4f523f9f4f885a8)
2013-08-27 15:42:06 -04:00
Tim Abbott
611955e9fa
zephyr_mirror: Use the new reasonable --log-path option.
...
(imported from commit 2208e4c47cf49e3005cc3b93314b8c2a0ffc23c6)
2013-08-27 14:21:26 -04:00
Tim Abbott
1ece1c17c3
zephyr_mirror: Stop logging to the old resend log.
...
We were intending to just be directing the logger logs, not logging
every message for potentially resending.
(imported from commit 9dd53e3968880745a8c01244db5d2c2247dfe85c)
2013-08-27 14:21:26 -04:00
Zev Benjamin
92623c74e9
Fix comment typo
...
(imported from commit 6da9e7d98f790a0728dfe3338f18e91d93afb464)
2013-08-27 13:44:26 -04:00
Zev Benjamin
7409e81775
Add nagios alert for Postgres backups
...
(imported from commit 1ffe019b898751aea215dda1826113c1df5bee5c)
2013-08-27 13:29:41 -04:00
Tim Abbott
ee0a3e55b7
puppet: Don't restart all services when doing supervisor config update.
...
(imported from commit 767dc26293f301ee18fc149156ad43ad1bd4b43c)
2013-08-26 18:17:26 -04:00
Tim Abbott
3152a8ef6e
nagios: Update stats to new stats.zulip.net host.
...
(imported from commit 1053750c9ba90ac21a9b523f84eefe5525e18e6d)
2013-08-26 18:17:26 -04:00
Tim Abbott
7014a661d2
zmirror: Add monitoring for personals mirrors.
...
(imported from commit acb7f2222076d2f3884a2e52c7032cb48fde1757)
2013-08-26 18:17:26 -04:00
Tim Abbott
c162127386
puppet: Move supervisord configuration to better path.
...
(imported from commit e243bb4c8ea180a3c6e74447b575e398169f7b67)
2013-08-26 18:17:26 -04:00
Tim Abbott
64a629f086
puppet: Make sure the humbug user owns supervisord socket properly.
...
This prevents the humbug user from losing access to the socket every
time that we restart supervisord.
(imported from commit 14dba25101d5e55bda4be46f4faa1d1d1db9a9be)
2013-08-26 18:17:26 -04:00
Tim Abbott
e2ea7c3c14
Import supervisord config into puppet.
...
(imported from commit e3247d11c8054f07fc44172eb21b7d76fd230615)
2013-08-26 18:17:25 -04:00
Tim Abbott
98aa534d3d
puppet: Add zmirror_personals machine type.
...
(imported from commit 3712d4ac1c2faa53cd9211441cc52da98de19e9b)
2013-08-26 18:17:25 -04:00
Tim Abbott
9893b37520
puppet: Fix zmirror supervisor configuration.
...
(imported from commit 3998cf36d6f9a60e6464e910db9cce57581710af)
2013-08-26 18:17:25 -04:00
Zev Benjamin
50c98c2da2
check_pg_replication_lag: Check for the critical conditions before the warning condition
...
Otherwise critical conditions are only reported as warnings.
(imported from commit 3e0e21f952e206e8df5b971633a8b0981540efc7)
2013-08-26 15:49:28 -04:00
Zev Benjamin
14f2b41c5d
Add Nagios plugin for monitoring the Postgres replication lag
...
(imported from commit 0196cca357ba58b08ac74065ce84f0e49141d30d)
2013-08-22 17:36:19 -04:00
Zev Benjamin
1bd4093e9b
Update Postgres pg_hba to match reality
...
(imported from commit 953fcff4df31cf7a1b14d674fe6df381f348fe72)
2013-08-22 11:47:03 -04:00
Tim Abbott
33872fc427
puppet: Fix path for rabbitmq-numconsumers.
...
(imported from commit 8d893057e837c8e8f45bc262cc7f3477ea5da415)
2013-08-21 12:01:55 -04:00
Tim Abbott
b8873e3283
Use /var/lib/nagios_state over /var/run/nagios for check status files.
...
The previous path would get destroyed on reboot.
(imported from commit 27e8fa54fc1e4001166db93b1fb59e87e0b1ccf5)
2013-08-20 17:09:09 -04:00
Tim Abbott
605171d1da
zephyr_mirror: Renew kerberos tickets when starting up class mirror.
...
(imported from commit 34b0a78558c6f20043efe986a16064387f36f1c0)
2013-08-20 15:27:24 -04:00
Tim Abbott
a8e28bb99a
puppet: Set correct permissions for supervisor.sock.
...
We can't do this with a "file" entry because it doesn't handle objects
of type socket.
(imported from commit 5f0f79f1a7905c4d70cc7e2f4c474b0629841c12)
2013-08-20 15:27:24 -04:00
Tim Abbott
12ec430cb8
puppet: Fix typo in email mirror configuration.
...
(imported from commit f81ca5ba28046be701f672df1fc3409e09841aec)
2013-08-20 15:27:24 -04:00
Tim Abbott
6850f25907
[manual] puppet: Move rabbitmq cron jobs into puppet.
...
When we deploy this, we should remove the relevant jobs from root's
crontab on our app servers.
(imported from commit 749be952d504f5a4d243cf59f6430acc689fc821)
2013-08-20 15:27:24 -04:00
Tim Abbott
9e6e57f349
puppet: Run sync-public-streams via supervisor.
...
(imported from commit f515cd36cd3138bcc8a13310276f3a6bb961ce33)
2013-08-20 15:27:23 -04:00
Tim Abbott
1a7d946bd1
puppet: Move zephyr_mirror crontab into /etc/cron.d.
...
(imported from commit e93e630704ce7e85346f54f095c642010058fc09)
2013-08-20 15:27:21 -04:00
Tim Abbott
c7be859d21
puppet: Run zephyr_mirror.py via supervisord.
...
(imported from commit 09086a1dd298e7ee9b655ccd6c58ac720bbc7798)
2013-08-20 15:26:50 -04:00
Tim Abbott
00279ae705
puppet: Add debathena.list to zmirror configuration.
...
(imported from commit 3d54bd495b5f8f582ec4d1f7a859ac0c0b63930b)
2013-08-20 15:26:50 -04:00
Luke Faraone
d3e1d0301d
Remove dummy 10.0.0.0/8 routes on interface configure.
...
(imported from commit 8a01137550b668a71871d20b1ebc1e62f0fdbaf4)
2013-08-19 16:19:47 -04:00
Luke Faraone
91ff010853
Configuration to create schroots.
...
(imported from commit 27288bfdc5b39aa5c8804ac9628dc960b621ca7f)
2013-08-19 16:19:45 -04:00