Commit Graph

429 Commits

Author SHA1 Message Date
Leo Franchi c81c418478 Aggregate stats_counts.app.cache.* as well as stats
(imported from commit 675d7b10f982bd180bf049d6552d3d0b75da8b80)
2013-09-17 14:23:46 -04:00
Leo Franchi e13c94b904 Add aggregation for reading stats
(imported from commit c344b9c2897bf95287c0f0df281b0567261c87f7)
2013-09-10 10:24:15 -04:00
Leo Franchi 7aedfbba14 Include all configuration bits for stats in puppet
(imported from commit 439c062dafd1167f465b6a1a0d8e6767dc6babeb)
2013-09-10 10:24:15 -04:00
Leo Franchi e5a39e5b4f Fix apache2 nagios confing redirect
(imported from commit 88775e874e2852e0c8b7846bdde04d7d1d2a1afe)
2013-09-09 10:43:04 -04:00
Zev Benjamin f666e168e1 nagios: Fix argument expansion in check_postgres_alert_args command
This was apparently causing some quoting weirdness.

(imported from commit 680d3cb9f4b1cf47cc1652feeb9f19855e7aa3e0)
2013-09-05 20:02:02 -04:00
Zev Benjamin abe07b34d5 Up the thresholds for nagios alerts about Postgres locks
We can hopefully lower this again when we figure turn on autocommit
or figure out why we're suddenly taking more locks.  For now, though,
the number of locks does not seem to be harmful to performance.

(imported from commit 0f8915d4a44aa28d1fa0969b93beea691d70d8d7)
2013-09-05 19:32:33 -04:00
Zev Benjamin 11a503667d Replace configuration references to trac.humbughq.com with trac.zulip.net
(imported from commit aa042a10ada3f04205fd8d3e94c2b13f12a5a1a7)
2013-09-05 15:43:08 -04:00
Luke Faraone d95da86299 Apache etc configuration changes to stats to support stats1.zulip.net
(imported from commit 3a20e31b6a5bf897aa700b55eba1bb0e8346c36c)
2013-09-05 15:43:07 -04:00
Luke Faraone 1c48aa7b8c Switch nagios to .zulip.net and add cert
We maintain a legacy hhq.c cert to avoid breaking links.

(imported from commit 4e8878185b02c0ed6c9193e21ff94ac96a2ef7b4)
2013-09-05 15:43:07 -04:00
Luke Faraone 4f6159049d Switch trac to .zulip.net and add cert
We maintain a legacy hhq.c cert to avoid breaking links.

(imported from commit e9953a3cc93f057abd79d7ce652e0942051ff5dd)
2013-09-05 15:43:07 -04:00
Tim Abbott a575ea997f puppet: Run GitHub Camo on lb0 for external-content.zulipcdn.net.
(imported from commit c03f2018fee1d88b747f45fc7841949f61e5c06c)
2013-09-05 14:35:29 -04:00
Tim Abbott ddc9c53e1b nagios: Set max_check_attempts to 3 for rabbitmq consumers.
This works around the fact that we seem to have a mysterous extra
checkup 40s after an error first occurs with these checks, which
always fails because the data is updated by a cron job that runs every
minute.

(imported from commit e7fe9c85e8399115443269287e695b140b4443ff)
2013-09-05 14:35:29 -04:00
Steve Howell 360cc754cc Extract last_check_ts var in nagios script
(imported from commit b391ef2832100a8140789b0d1b553b028e0b7af6)
2013-09-03 22:30:00 -04:00
Tim Abbott 221f6e6199 puppet: Add cron job to clear expired Django sessions.
(imported from commit 1426430785f483d450f3d044147fe0fad50cb898)
2013-09-03 17:10:40 -04:00
Tim Abbott 4951ad2ce7 check_personal_zephyr_mirrors: Use newlines in display format.
(imported from commit d4114cde3575fafd0b37722681baf2e61517a6ad)
2013-09-03 11:40:42 -04:00
Jessica McKellar 8176429328 Un-break PagerDuty and go back to using the e-mail integration.
(imported from commit 1b33336ad56d56e6d9e48a5b2497a5115fbd310f)
2013-08-30 11:32:12 -04:00
Jessica McKellar c125db4917 email-mirror: put Nagios configuration in puppet.
(imported from commit a6ecac877628cf7918e55d553cf509aa5c0635cb)
2013-08-28 16:43:40 -04:00
Jessica McKellar 7737434dbe email-mirror: run the management command as the humbug user.
We don't set up certs for root to be able to make database queries.

(imported from commit 0236745fb1d5aace3097acbe0ed4b5772a19b255)
2013-08-28 16:43:39 -04:00
Zev Benjamin ade55dc39c Fix PATH for Postgres backup cron job
(imported from commit 9e4b868d444d73f9eab1507851b1b3ab47655977)
2013-08-28 15:16:25 -04:00
Leo Franchi cfb623ae5c Bump os x app to 0.3.6
(imported from commit 51ddf87797a94b97dec8ce998a08a83ed8837196)
2013-08-28 14:20:20 -04:00
Zev Benjamin 1d13c091ef pg_backup_and_purge.py: Set up logging
(imported from commit 0948f548b005505a7f0c38c3af245815ec9efd6c)
2013-08-28 13:30:51 -04:00
Leo Franchi 9d895e53d5 Add SHELL, PATH, and ROOT variables to crontab
(imported from commit 3e556b12bb1c5332bd08ba267d24e0dd524bff7d)
2013-08-27 17:28:07 -04:00
Leo Franchi 071415ae93 Update desktop apps to 0.3.5 in sparkle
(imported from commit cf7c1b9dbbc0cbda5c4776d7b4f523f9f4f885a8)
2013-08-27 15:42:06 -04:00
Tim Abbott 611955e9fa zephyr_mirror: Use the new reasonable --log-path option.
(imported from commit 2208e4c47cf49e3005cc3b93314b8c2a0ffc23c6)
2013-08-27 14:21:26 -04:00
Tim Abbott 1ece1c17c3 zephyr_mirror: Stop logging to the old resend log.
We were intending to just be directing the logger logs, not logging
every message for potentially resending.

(imported from commit 9dd53e3968880745a8c01244db5d2c2247dfe85c)
2013-08-27 14:21:26 -04:00
Zev Benjamin 92623c74e9 Fix comment typo
(imported from commit 6da9e7d98f790a0728dfe3338f18e91d93afb464)
2013-08-27 13:44:26 -04:00
Zev Benjamin 7409e81775 Add nagios alert for Postgres backups
(imported from commit 1ffe019b898751aea215dda1826113c1df5bee5c)
2013-08-27 13:29:41 -04:00
Tim Abbott ee0a3e55b7 puppet: Don't restart all services when doing supervisor config update.
(imported from commit 767dc26293f301ee18fc149156ad43ad1bd4b43c)
2013-08-26 18:17:26 -04:00
Tim Abbott 3152a8ef6e nagios: Update stats to new stats.zulip.net host.
(imported from commit 1053750c9ba90ac21a9b523f84eefe5525e18e6d)
2013-08-26 18:17:26 -04:00
Tim Abbott 7014a661d2 zmirror: Add monitoring for personals mirrors.
(imported from commit acb7f2222076d2f3884a2e52c7032cb48fde1757)
2013-08-26 18:17:26 -04:00
Tim Abbott c162127386 puppet: Move supervisord configuration to better path.
(imported from commit e243bb4c8ea180a3c6e74447b575e398169f7b67)
2013-08-26 18:17:26 -04:00
Tim Abbott 64a629f086 puppet: Make sure the humbug user owns supervisord socket properly.
This prevents the humbug user from losing access to the socket every
time that we restart supervisord.

(imported from commit 14dba25101d5e55bda4be46f4faa1d1d1db9a9be)
2013-08-26 18:17:26 -04:00
Tim Abbott e2ea7c3c14 Import supervisord config into puppet.
(imported from commit e3247d11c8054f07fc44172eb21b7d76fd230615)
2013-08-26 18:17:25 -04:00
Tim Abbott 98aa534d3d puppet: Add zmirror_personals machine type.
(imported from commit 3712d4ac1c2faa53cd9211441cc52da98de19e9b)
2013-08-26 18:17:25 -04:00
Tim Abbott 9893b37520 puppet: Fix zmirror supervisor configuration.
(imported from commit 3998cf36d6f9a60e6464e910db9cce57581710af)
2013-08-26 18:17:25 -04:00
Zev Benjamin 50c98c2da2 check_pg_replication_lag: Check for the critical conditions before the warning condition
Otherwise critical conditions are only reported as warnings.

(imported from commit 3e0e21f952e206e8df5b971633a8b0981540efc7)
2013-08-26 15:49:28 -04:00
Zev Benjamin 14f2b41c5d Add Nagios plugin for monitoring the Postgres replication lag
(imported from commit 0196cca357ba58b08ac74065ce84f0e49141d30d)
2013-08-22 17:36:19 -04:00
Zev Benjamin 1bd4093e9b Update Postgres pg_hba to match reality
(imported from commit 953fcff4df31cf7a1b14d674fe6df381f348fe72)
2013-08-22 11:47:03 -04:00
Tim Abbott 33872fc427 puppet: Fix path for rabbitmq-numconsumers.
(imported from commit 8d893057e837c8e8f45bc262cc7f3477ea5da415)
2013-08-21 12:01:55 -04:00
Tim Abbott b8873e3283 Use /var/lib/nagios_state over /var/run/nagios for check status files.
The previous path would get destroyed on reboot.

(imported from commit 27e8fa54fc1e4001166db93b1fb59e87e0b1ccf5)
2013-08-20 17:09:09 -04:00
Tim Abbott 605171d1da zephyr_mirror: Renew kerberos tickets when starting up class mirror.
(imported from commit 34b0a78558c6f20043efe986a16064387f36f1c0)
2013-08-20 15:27:24 -04:00
Tim Abbott a8e28bb99a puppet: Set correct permissions for supervisor.sock.
We can't do this with a "file" entry because it doesn't handle objects
of type socket.

(imported from commit 5f0f79f1a7905c4d70cc7e2f4c474b0629841c12)
2013-08-20 15:27:24 -04:00
Tim Abbott 12ec430cb8 puppet: Fix typo in email mirror configuration.
(imported from commit f81ca5ba28046be701f672df1fc3409e09841aec)
2013-08-20 15:27:24 -04:00
Tim Abbott 6850f25907 [manual] puppet: Move rabbitmq cron jobs into puppet.
When we deploy this, we should remove the relevant jobs from root's
crontab on our app servers.

(imported from commit 749be952d504f5a4d243cf59f6430acc689fc821)
2013-08-20 15:27:24 -04:00
Tim Abbott 9e6e57f349 puppet: Run sync-public-streams via supervisor.
(imported from commit f515cd36cd3138bcc8a13310276f3a6bb961ce33)
2013-08-20 15:27:23 -04:00
Tim Abbott 1a7d946bd1 puppet: Move zephyr_mirror crontab into /etc/cron.d.
(imported from commit e93e630704ce7e85346f54f095c642010058fc09)
2013-08-20 15:27:21 -04:00
Tim Abbott c7be859d21 puppet: Run zephyr_mirror.py via supervisord.
(imported from commit 09086a1dd298e7ee9b655ccd6c58ac720bbc7798)
2013-08-20 15:26:50 -04:00
Tim Abbott 00279ae705 puppet: Add debathena.list to zmirror configuration.
(imported from commit 3d54bd495b5f8f582ec4d1f7a859ac0c0b63930b)
2013-08-20 15:26:50 -04:00
Luke Faraone d3e1d0301d Remove dummy 10.0.0.0/8 routes on interface configure.
(imported from commit 8a01137550b668a71871d20b1ebc1e62f0fdbaf4)
2013-08-19 16:19:47 -04:00
Luke Faraone 91ff010853 Configuration to create schroots.
(imported from commit 27288bfdc5b39aa5c8804ac9628dc960b621ca7f)
2013-08-19 16:19:45 -04:00