Commit Graph

455 Commits

Author SHA1 Message Date
Zev Benjamin 8edbd64bb8 Monitor the queue processors for the missedmessage_emails and slow_queries queues
(imported from commit 266b8f19b87a025ab35bd6dd4017bdf8a7694b49)
2013-10-04 17:58:44 -04:00
Tim Abbott d188d829d7 Update UserActivity queries for monitoring Zephyr mirroring.
(imported from commit 04a9536da2891e905c6e14e0d452ca62d632641d)
2013-10-04 16:15:53 -04:00
Zev Benjamin 2547e0768f puppet: Remove rabbitmq consumer checks based on check_procs
These have been superceded by checks for the existance of consumers
of the relevant queues.

(imported from commit 68a0e79734366411e39e9e4346b5a61bdd34144b)
2013-10-04 14:19:16 -04:00
Zev Benjamin dc082cd96d puppet: Add nagios notifications for the rest of our rabbitmq queues
(imported from commit 9d21a0ca3662396c436b482c574113d0cbc714a0)
2013-10-04 14:19:16 -04:00
Zev Benjamin 61ca14b400 [manual] puppet: Consolidate check_rabbitmq_*_consumers commands
This temporarily breaks the rabbitmq consumer checks for
user_activity and notify_tornado on prod.  This should be deployed in
such a way to minimize the time that the alert needs to be ignored.

(imported from commit 08fa2f0e7d78fca1346c62824573263e42339a45)
2013-10-04 14:19:16 -04:00
Zev Benjamin 6e54ca3045 puppet: Factor out writing the rabbitmq consumer check state file into its own script
This temporarily breaks the rabbitmq consumer checks for the
user_activity and notify_tornado queues because their state files
were renamed to match their queue names.  It will be fixed for
staging in the next commit.

(imported from commit a6aaa330a1134d8ddffe8f4959deb12b219f241a)
2013-10-04 14:19:16 -04:00
Tim Abbott dd3281fea8 Rename some staging nginx config variables humbug => zulip.
(imported from commit 7937e0ee2b1ebbdf184be3ceec74afc206a56c83)
2013-10-04 11:45:40 -04:00
Leo Franchi 2614716fca Log slow queries to zulip so we notice them
(imported from commit 23f311ad881edda4c4495089ea3b55213470a059)
2013-09-30 17:41:56 -04:00
Zev Benjamin a906890b4d install-server: Run resize2fs
This allows us to have larger root filesystems than the AMI image.

(imported from commit 4e9698432b0c154a0bc635df07abd278c08a4905)
2013-09-30 11:09:26 -04:00
Jessica McKellar 03fe84aa6a nagios: use last Received date to determine message age.
If there are delays while routing the email, we don't want to get a
spurious alert.

(imported from commit 3a9e3abf0a4db2b026f797c929f1b46978f1e5e4)
2013-09-27 11:39:42 -04:00
Jessica McKellar 4acddabe10 nagios: parse dates using timezone-aware functions in the email mirror check.
Why does email.utils.parsedate also exist? To put bugs in peoples' software.

(imported from commit a3dca741e5274027ef177388b49061b9b3c5d29e)
2013-09-27 11:39:42 -04:00
Tim Abbott 4be5d81af1 [manual] Write logs to /var/log/zulip rather than /var/log/humbug.
This requires a puppet apply to update the supervisor configuration.

(imported from commit f2836b6d9c53791af6f6ceb1650d0e0740df70ab)
2013-09-25 16:52:41 -04:00
Tim Abbott 0a4a53211c [manual] Rename /var/log/humbug to /var/log/zulip.
This requires a "puppet apply" to be done to create /var/log/zulip
before we deploy anything using the new directory.

(imported from commit 2d7baedbf923df9f01b152cf0bda6494f0eac936)
2013-09-25 16:52:39 -04:00
Tim Abbott 0b9e54416d [manual] puppet: Rename humbug=>zulip in nginx configuration filenames.
We need to manually remove the old humbug and humbug-staging sites-*
files when we deploy this via puppet.

(imported from commit d25e0172a14032c5acf1501668602d34b1b13b85)
2013-09-25 15:40:21 -04:00
Tim Abbott 32a333be42 puppet: Rename humbug=>zulip in nginx configuration.
(imported from commit c5880c7fe9f7d709d39e788cc2b6be940c279972)
2013-09-25 15:40:21 -04:00
Tim Abbott 4505df407b puppet: Access apt repositories via apt.zulip.com hostname.
(imported from commit 2d4ecd7479ac9565d52bd85c324003e03495ebef)
2013-09-25 15:40:21 -04:00
Tim Abbott 351dd6dd5d nagios: Update monitoring to use only the .zulip.net hostnames.
(imported from commit fc1a91546d39f4915ee9b9b63dad284b2cd54443)
2013-09-25 15:40:21 -04:00
Tim Abbott 17a4c39234 Rename humbug=>zulip in check_send_receive_time.
(imported from commit b734056dfb9d69d5dfc06a67e47c3579bb557f51)
2013-09-25 15:40:21 -04:00
Tim Abbott dabb565179 puppet: Delete obsolete my.cnf file.
(imported from commit 7a61a367fe352f275f95cdceebd69fa46a7691ff)
2013-09-25 15:40:21 -04:00
Leo Franchi bafe43f363 Release os x 0.3.8 via sparkle
(imported from commit 4d8b7ff23c48952f28a968c977c87951258c3333)
2013-09-25 11:18:41 -04:00
Zev Benjamin 0d0655ebdf puppet: Fix our supervisor worker configuration
* I forgot to add a worker for user_activity_interval
* user_activity_interval and user_presence weren't in the humbug-workers group

(imported from commit 61c29e54a0dff621a913b3e8491db41f4d0a3909)
2013-09-24 20:44:42 -04:00
Zev Benjamin 90ee96d456 Split out presence and activity interval events from user_activity queue
(imported from commit 3c22448491867a9e72659629d5d0faaf12b85b2c)
2013-09-24 20:44:15 -04:00
Zev Benjamin e450fc80c9 Move existing queue processing workers to new framework
(imported from commit b448eeefcddb1840344bd337046be30714a7fdbc)
2013-09-24 20:44:15 -04:00
Zev Benjamin 963b62873d puppet: Show all nagios results on the same page
(imported from commit aff603faa93c5d24f372eb6afb541d870aaecf5b)
2013-09-24 16:19:52 -04:00
Leo Franchi d3487aef15 Bump windows & mac apps to 0.3.7
(imported from commit 507332336538fae8321a200196b2778861967f6d)
2013-09-19 15:32:41 -04:00
Jessica McKellar 4300d7441a Add the new html2text requirement for the email mirror to Puppet.
(imported from commit 8d5e9eda4e554a3b48d4fa352e37125f31cafaac)
2013-09-18 15:41:27 -04:00
Leo Franchi c81c418478 Aggregate stats_counts.app.cache.* as well as stats
(imported from commit 675d7b10f982bd180bf049d6552d3d0b75da8b80)
2013-09-17 14:23:46 -04:00
Leo Franchi e13c94b904 Add aggregation for reading stats
(imported from commit c344b9c2897bf95287c0f0df281b0567261c87f7)
2013-09-10 10:24:15 -04:00
Leo Franchi 7aedfbba14 Include all configuration bits for stats in puppet
(imported from commit 439c062dafd1167f465b6a1a0d8e6767dc6babeb)
2013-09-10 10:24:15 -04:00
Leo Franchi e5a39e5b4f Fix apache2 nagios confing redirect
(imported from commit 88775e874e2852e0c8b7846bdde04d7d1d2a1afe)
2013-09-09 10:43:04 -04:00
Zev Benjamin f666e168e1 nagios: Fix argument expansion in check_postgres_alert_args command
This was apparently causing some quoting weirdness.

(imported from commit 680d3cb9f4b1cf47cc1652feeb9f19855e7aa3e0)
2013-09-05 20:02:02 -04:00
Zev Benjamin abe07b34d5 Up the thresholds for nagios alerts about Postgres locks
We can hopefully lower this again when we figure turn on autocommit
or figure out why we're suddenly taking more locks.  For now, though,
the number of locks does not seem to be harmful to performance.

(imported from commit 0f8915d4a44aa28d1fa0969b93beea691d70d8d7)
2013-09-05 19:32:33 -04:00
Zev Benjamin 11a503667d Replace configuration references to trac.humbughq.com with trac.zulip.net
(imported from commit aa042a10ada3f04205fd8d3e94c2b13f12a5a1a7)
2013-09-05 15:43:08 -04:00
Luke Faraone d95da86299 Apache etc configuration changes to stats to support stats1.zulip.net
(imported from commit 3a20e31b6a5bf897aa700b55eba1bb0e8346c36c)
2013-09-05 15:43:07 -04:00
Luke Faraone 1c48aa7b8c Switch nagios to .zulip.net and add cert
We maintain a legacy hhq.c cert to avoid breaking links.

(imported from commit 4e8878185b02c0ed6c9193e21ff94ac96a2ef7b4)
2013-09-05 15:43:07 -04:00
Luke Faraone 4f6159049d Switch trac to .zulip.net and add cert
We maintain a legacy hhq.c cert to avoid breaking links.

(imported from commit e9953a3cc93f057abd79d7ce652e0942051ff5dd)
2013-09-05 15:43:07 -04:00
Tim Abbott a575ea997f puppet: Run GitHub Camo on lb0 for external-content.zulipcdn.net.
(imported from commit c03f2018fee1d88b747f45fc7841949f61e5c06c)
2013-09-05 14:35:29 -04:00
Tim Abbott ddc9c53e1b nagios: Set max_check_attempts to 3 for rabbitmq consumers.
This works around the fact that we seem to have a mysterous extra
checkup 40s after an error first occurs with these checks, which
always fails because the data is updated by a cron job that runs every
minute.

(imported from commit e7fe9c85e8399115443269287e695b140b4443ff)
2013-09-05 14:35:29 -04:00
Steve Howell 360cc754cc Extract last_check_ts var in nagios script
(imported from commit b391ef2832100a8140789b0d1b553b028e0b7af6)
2013-09-03 22:30:00 -04:00
Tim Abbott 221f6e6199 puppet: Add cron job to clear expired Django sessions.
(imported from commit 1426430785f483d450f3d044147fe0fad50cb898)
2013-09-03 17:10:40 -04:00
Tim Abbott 4951ad2ce7 check_personal_zephyr_mirrors: Use newlines in display format.
(imported from commit d4114cde3575fafd0b37722681baf2e61517a6ad)
2013-09-03 11:40:42 -04:00
Jessica McKellar 8176429328 Un-break PagerDuty and go back to using the e-mail integration.
(imported from commit 1b33336ad56d56e6d9e48a5b2497a5115fbd310f)
2013-08-30 11:32:12 -04:00
Jessica McKellar c125db4917 email-mirror: put Nagios configuration in puppet.
(imported from commit a6ecac877628cf7918e55d553cf509aa5c0635cb)
2013-08-28 16:43:40 -04:00
Jessica McKellar 7737434dbe email-mirror: run the management command as the humbug user.
We don't set up certs for root to be able to make database queries.

(imported from commit 0236745fb1d5aace3097acbe0ed4b5772a19b255)
2013-08-28 16:43:39 -04:00
Zev Benjamin ade55dc39c Fix PATH for Postgres backup cron job
(imported from commit 9e4b868d444d73f9eab1507851b1b3ab47655977)
2013-08-28 15:16:25 -04:00
Leo Franchi cfb623ae5c Bump os x app to 0.3.6
(imported from commit 51ddf87797a94b97dec8ce998a08a83ed8837196)
2013-08-28 14:20:20 -04:00
Zev Benjamin 1d13c091ef pg_backup_and_purge.py: Set up logging
(imported from commit 0948f548b005505a7f0c38c3af245815ec9efd6c)
2013-08-28 13:30:51 -04:00
Leo Franchi 9d895e53d5 Add SHELL, PATH, and ROOT variables to crontab
(imported from commit 3e556b12bb1c5332bd08ba267d24e0dd524bff7d)
2013-08-27 17:28:07 -04:00
Leo Franchi 071415ae93 Update desktop apps to 0.3.5 in sparkle
(imported from commit cf7c1b9dbbc0cbda5c4776d7b4f523f9f4f885a8)
2013-08-27 15:42:06 -04:00
Tim Abbott 611955e9fa zephyr_mirror: Use the new reasonable --log-path option.
(imported from commit 2208e4c47cf49e3005cc3b93314b8c2a0ffc23c6)
2013-08-27 14:21:26 -04:00