zulip

Commit Graph

Author	SHA1	Message	Date
Umair Khan	6c1d805495	travis: Fix production suite flakiness. Previously, we were doing this request to the production server before waiting for all the supervisord processes to start; it's possible this could cause failures where we hit the server before the Django uwsgi processes are up. Hopefully fixes #2723.	2016-12-15 22:04:57 -08:00
Tim Abbott	15bfedec99	travis: Improve debuggability of server wget failures. The main improvement here is causing `wget` errors to be ignored so that we see the server logs in the event of a `wget` failure.	2016-12-03 20:48:57 -08:00
Tim Abbott	1a8a329b44	production-helper: Expand the apt-mark hold list.	2016-12-01 12:29:31 -08:00
Vishnu Ks	a7ead9e99d	settings: Eliminate ADMIN_DOMAIN for creating initial realm. We now use `./manage.py generate_realm_creation_link` as the flow flow for creating one's first realm.	2016-08-25 09:37:33 -07:00
Tim Abbott	88a123d5e0	Fix excessive CPU usage by rabbitmq-numconsumers Nagios checks. The previous model for these Nagios checks was kinda crazy -- every minute, we'd run a full `rabbitmctl list_consumers` for each of the dozen+ consumers that we have, and then do the exact same parsing logic for each to determine whether the target queue has a running consumer to write out a state file. Because `rabbitmctl list_consumers` takes a small amount of resources, on systems where CPU is very limited (e.g. t2 style AWS instances), this minor CPU wastage could be problematic. Now we just do that `rabbitmqctl list_consumers` once per minute, and output all the state files from a single command. Further TODO items on this front include removing the hardcoded list of queues.	2016-08-12 14:09:36 -07:00
Tim Abbott	6496fe2a53	travis: Remove rabbitmq nodename dependency on hostname. Because rabbitmq doesn't support changing the nodename of a running rabbitmq node, Zulip installations suffered a plague of issues where e.g. a Zulip server would reboot, the hostname would change, and suddenly the local rabbitmq instance being used by Zulip would stop working. We address this problem by using, by default, a fixed rabbitmq nodename, but providing server administrators the option to set the rabbitmq nodename used by Zulip however they choose. To upgrade an existing server to use this new configuration, one will need to add something like the following to /etc/zulip/zulip.conf: [rabbitmq] nodename = zulip@localhost However, I don't believe we have the puppet code in place to make this work correctly at initial installation without rabbitmq-server being already installed (but off), as we can easily setup in Travis CI but I haven't been willing to do for the installer. So for now, this just fixes our Travis CI problems. Fixes: #1579.	2016-08-12 09:38:23 -07:00
Tim Abbott	c7059c9751	travis: Update success-http-headers to match current certs. Travis CI seems to have changed the way the snakeoil SSL certs are generated in their infrastructure, so we need to update our expected "success" HTTP headers accordingly.	2016-08-12 09:35:41 -07:00
Tim Abbott	a648513580	production-helper: Remove /root/zulip during setup process. This fixes a problem that caused production-helper to not be idempotent.	2016-08-11 22:21:13 -07:00
Tim Abbott	7011c94465	production-helper: Use ln -nsf to install snakeoil symlinks. This fixes a problem where production-helper was not idempotent.	2016-08-11 22:20:24 -07:00
Tim Abbott	b3a768f4b2	settings: Improve ALLOWED_HOSTS defaults logic and docs. This removes the requirement for the user to put localhost/127.0.0.1 in their ALLOWED_HOSTS list, since it is now added automatically. Fixes: #1358.	2016-08-05 21:25:29 -07:00
Umair Khan	1a6e8282c8	Run 'check_send_receive_time' as 'zulip' user. Run '/puppet/zulip/files/nagios_plugins/zulip_app_frontend/check_send_receive_time' script as 'zulip' user so that the connection to the database can be made correctly.	2016-07-28 13:39:29 -07:00
Tim Abbott	039c175d68	production-helper: Hold tons of packages. This saves almost a minute doing apt upgrades in the production test suite.	2016-06-22 10:41:09 -07:00
Tim Abbott	0f2729f5fb	production-helper: use dist-upgrade to match install script. Previously, we were wasting time every time we installed packages, because `apt-get upgrade` would only install most of the packages `apt-get dist-upgrade` would.	2016-06-22 10:38:27 -07:00
Tim Abbott	6c744564a7	travis: Add debugging code for rabbitmq nagios failures.	2016-05-09 09:55:18 -07:00
Tim Abbott	804dad42e6	travis: Run various Nagios checks in production tests.	2016-05-08 17:35:50 -07:00
Tim Abbott	744e8ad0e3	travis: Set prod EXTERNAL_HOST to resolve correctly. This is needed to use check_send_receive_time in the tests.	2016-05-08 17:35:50 -07:00
Tim Abbott	e4c098fba4	travis: Verify all supervisord jobs are running in production test. This requires a bit of complexity since supervisord automatically restarts failing jobs.	2016-05-08 17:35:50 -07:00
Tim Abbott	40de75d9e6	travis: Verify the server doesn't 500 in production test.	2016-05-08 17:35:50 -07:00
Tim Abbott	52c1e8ac7d	Run a local camo server in voyager production environments. Camo is a caching image proxy, used in Zulip to avoid mixed-content warnings by proxying HTTP image content over HTTPS. We've been using it in zulip.com production for years; this change makes it available in standalone Zulip deployments.	2016-05-02 17:21:31 -07:00
Tim Abbott	48a578d003	travis: hold expensive to upgrade packages in Travis CI. This should save a few minutes of time running the production test suite. This is part of solving #722.	2016-05-02 16:59:21 -07:00
Tim Abbott	79327a61ae	travis: Do an apt-get update before the apt upgrade. This should save several minutes off the Travis CI `production` suite's runtime, since previously we were doing the full apt upgrade process twice, resulting in things like multiple expensive rebuilds of the initramfs.	2016-05-02 16:35:46 -07:00
Tim Abbott	6943a142ea	Fix postgres errors in Travis CI again. Travis CI's model of installing every version of postgres on the test VM and then shutting all the versions other than the one requested down seems to not work very well with doing apt upgrades. It seems the best way to resolve this is to just uninstall the versions we don't need.	2016-01-21 22:07:10 -08:00
Tim Abbott	a98b0cf35d	travis: Workaround postgres 9.1 conflict issues on trusty. We ran into a bug with the Travis CI infrastructure where it postgres 9.1 is installed on the system, and so when we'd do an apt upgrade with a new version of 9.1, the 9.1 daemon would end up getting started and conflict with the 9.3 daemon we were trying to run.	2016-01-09 16:59:43 -08:00
Tim Abbott	2be7ac8d70	travis: Fix prompting for user input in production-helper.	2015-12-07 20:33:36 -08:00
Tim Abbott	6eb670097c	Expand testing done via Travis CI to cover production pipeline. With this change, we are now testing the production static asset pipeline and installation process in a new testing job (and also run the frontend/backend tests separately). This means that changes that break the Zulip static asset pipeline or production installation process are more likely to fail tests. The testing is imperfect in that it does not have proper isolation -- we build a complete Zulip development environment and then install a Zulip production environment on top of it, so e.g. any apt dependencies installed for Zulip development will still be available for the Zulip production environment. But, it's better than nothing! A good v2 of this would be to have the production setup process just install the minimum stuff needed to run `build-release-tarball` and then uninstall it / clean it up so that we can do a more clear production installation, but that's more work.	2015-11-01 18:11:39 -08:00

25 Commits