zulip

Commit Graph

Author	SHA1	Message	Date
Tim Abbott	cbbca8319b	Restart Tornado _after_ moving the deployment symlink. Otherwise we end up running Tornado against the _previous_ version of our code! Testing of using supervisorctl to stop and then start a process shows it takes about the same amount of time as doing a supervisorctl restart, so there's no reason not to split the two commands apart and make it super clear that nothing is running at the time that we move the deployment symlink. (imported from commit c38049da2bfc9fa94320a32dbf3240d1fcba67f7)	2013-06-13 16:32:23 -04:00
Zev Benjamin	8884e81bb3	Move the current deployment symlink in restart-server This will help minimize downtime. (imported from commit 47fb66f0d2e21fc12f62c69b7c59ca6828553309)	2013-06-05 16:33:43 -04:00
Zev Benjamin	300e57fcbd	Fill memcached caches synchronously before restarting the server (imported from commit a45fa845e94a1fc6e96a1aafca31e9a6fc2b7526)	2013-06-05 16:33:43 -04:00
Tim Abbott	795173ac86	update-deployment: Move color codes to humbug_tools.py. (imported from commit 96a79a699ea032714b24aa6572fd188433b979a8)	2013-06-04 18:42:00 -04:00
Zev Benjamin	cce8dfab84	[manual] Use the same socket across server restarts We let supervisor create the socket for us by making humbug-django a fcig-program. Unfortunately, supevisor doesn't support putting fcgi-programs in groups (see https://github.com/Supervisor/supervisor/issues/148), so we have to restart tornado and django separately. To deploy, copy the config files over and restart nginx and supervisor (via stopping and then starting it because restart is broken). I believe the automated restart as part of update-deployment will fail because of the way supervisor treats programs in groups. If so, after restarting supervisor, you will also need to run restart-server manually to fill the caches and then delete the lock directory in humbug-deployments. (imported from commit bfb5db7dd42dcbc4bfefa2944355b3cbb2ef9104)	2013-05-23 00:19:17 -04:00
Zev Benjamin	8fd72a09bc	Restart Django and Tornado separately from the other worker processes The amount of process downtime during a supervisord-mediated restart appears to be linear in the number of processes that are being restarted. Therefore, restarting just Django and Tornado causes less downtime than doing them at the same time as the other worker processes. (imported from commit 1fa9ef547bcd88caeec49800664e37d5f2fcb7a8)	2013-05-21 16:13:39 -04:00
acrefoot	9d8f847fed	[manual] Run server using supervisord This change will make it so that processes related to the app.humbughq.com server are run under supervisord, which uses a state machine model to ensure that programs are running. It also ensure process startup order. We will need to manually switch the old way of running server (in screen) into this new way of doing things, on both staging and prod (app_frontend.pp has been updated appropriately). This means: 1) cp servers/puppet/modules/humbug/files/supervisord/conf.d/humbug.conf /etc/supervisord/conf.d 2) installing the supervisor package. 3) killing those while loops in that screen session 4) mkdir /var/log/humbug (as root) 5) /etc/init.d/supervisord start 6) check that nothing broke (imported from commit 055269a70973db89acd69049e01b185fabdc8f90)	2013-05-20 23:42:28 -04:00
Leo Franchi	77fd5f1113	Also kill/restart send_missedmessage_reminders on a deploy (imported from commit 560f6aabb0daec63d55806f8caab76d3fb7d260a)	2013-05-10 09:41:49 -04:00
Leo Franchi	7b0423efc1	Use incr instead of gauge when sending events to drawAsInfinite to statsd (imported from commit 08a4b6920c7a4a8f472f147ddce7c04710fe5c0a)	2013-04-19 09:56:41 -04:00
Leo Franchi	d02f46fba7	Log a statsd event when restarting the server (imported from commit e9fa632a39f0a6b6aa7311e80e68faf4178a2cf3)	2013-04-18 18:05:52 -04:00
Tim Abbott	6ab480d9c6	restart-server: Call fill_memcached_caches via more direct path. The previous version ended up being (at least sometimes) wrong after the recent deployment system changes. (imported from commit dec3beb1b1bf8b9c9ad6820b93b0a5d730d020e8)	2013-04-17 10:19:13 -04:00
Tim Abbott	3ef78dd01c	Fix restarting of commands with new deployment loops. (imported from commit 139d80097d74b1c7f4ec92175a2052b4acf55af1)	2013-04-12 15:35:49 -04:00
Tim Abbott	1b3d93ad47	Fix restarting of various tools during deployments. (imported from commit b16b8b3d7ce159a91e7d9c3781f1e69861bd3ff3)	2013-04-12 15:35:49 -04:00
Tim Abbott	1b8cf16988	[manual] Update deployment process to run atomically. This requires manual steps on deploy to each of staging and prod: (1) Run the new update-deployment code to setup the initial deployment directory. (2) Restart all the programs running in screen sessions. (3) Deploy the nginx changes and restart nginx. (imported from commit 1ffe27933ee79274dc0a93d35c9938712de0ef36)	2013-04-12 11:54:50 -04:00
Tim Abbott	88e2949818	Move update-prod-static etc. out of restart-server. (imported from commit 763af1c8b1937cda860acc635d9d87c12d66a938)	2013-04-12 11:54:49 -04:00
Tim Abbott	c0d65124f8	restart-server: Add some output on what's happening as we go. restart-server has been relatively slow recently, and it'd be nice to know what it is spending its time doing when it hangs for a few seconds. (imported from commit a411c951f5a3f2a1366b6d5d3a40d0660ebec11b)	2013-03-14 15:07:42 -04:00
Tim Abbott	60c3cf826d	restart-server: Fill memcached caches after clearing the cache. Our previous code could in theory end up clearing the caches it had just filled, if Tornado's cache filling work happened to be faster than the memcached flush. (imported from commit 48174aadad398fb7a7c917a1df765c1261b12a55)	2013-03-14 15:07:41 -04:00
Tim Abbott	91fb10e7dc	restart-server: Move memcached flush to after server code restart. (imported from commit 12123beac108db54e7e8a0c62b65762920997f7e)	2013-03-04 11:30:00 -05:00
Keegan McAllister	75704ad1e8	Consistently use #!/usr/bin/env python At Ksplice we used /usr/bin/python because we shipped dependencies as Debian / Red Hat packages, which would be installed against the system Python. We were also very careful to use only Python 2.3 features so that even old system Python would still work. None of that is true at Humbug. We expect users to install dependencies themselves, so it's more likely that the Python in $PATH is correct. On OS X in particular, it's common to have five broken Python installs and there's no expectation that /usr/bin/python is the right one. The files which aren't marked executable are not interesting to run as scripts, so we just remove the line there. (In general it's common to have libraries that can also be executed, to run test cases or whatever, but that's not the case here.) (imported from commit 437d4aee2c6e66601ad3334eefd50749cce2eca6)	2013-02-20 16:02:30 -05:00
Keegan McAllister	6990260b59	[manual] Minify JavaScript and CSS in production Manual deployment steps: The same Nginx reload as for "Get rid of the static-access-control mechanism". If deploying both commits at once, just do it once. (imported from commit dd8dbbf14b95fce0a4b6f66f462fa0a6b50bfb8c)	2013-01-31 15:41:01 -05:00
Tim Abbott	97aec279b3	Add deployment auto-restart for process_user_activity. (imported from commit 7ff384629552e454a9b25577f17d06ac27ae5239)	2013-01-31 12:09:16 -05:00
Tim Abbott	9cc3e29de5	Split restart-server code out of update-deployment. (imported from commit 3ae913b950be0a0c94fbaf0173012ea315f36d62)	2013-01-31 12:09:16 -05:00

22 Commits