zulip

Commit Graph

Author	SHA1	Message	Date
Anders Kaseorg	0451d1e47f	zulip_tools: Replace universal_newlines with text. Generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 12:57:49 -07:00
Anders Kaseorg	a543dcc8e3	Remove Debian 10 support. As a consequence: • Bump minimum supported Python version to 3.8. • Move Vagrant environment to Ubuntu 20.04, which has Python 3.8. • Move CI frontend tests to Ubuntu 20.04. • Move production build test to Ubuntu 20.04. • Move 3.4 upgrade test to Ubuntu 20.04. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-26 16:32:02 -07:00
Anders Kaseorg	63a1ef0e91	configure-rabbitmq: Remove use of sudo. It already runs as root everywhere except in provision_inner, so move the sudo there. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-19 12:36:31 -07:00
Anders Kaseorg	cc30ed8ec7	actions: Delete zerver.lib.actions. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-14 17:14:38 -07:00
Alex Vandiver	09860dc284	check-database-compatibility: Sort and prettify output.	2022-04-06 14:10:46 -07:00
Alex Vandiver	eb31681934	check-database-compatibility: Ignore squashed and renamed migrations. Fixes: #21596.	2022-04-01 16:15:41 -07:00
Alex Vandiver	0af00a3233	upgrade: Mark puppet as having started the server. We previously used restart-server if puppet was run, as a nod to the fact that `supervisor reread && supervisor update` will _start_ service groups that were modified, even if they were previously stopped; this is because they are marked as `autostart=true`, which is honored on service change. However, upgrades want to run while there are no services running. If puppet is run, explicitly set the server as potentially being "up", so that a `shutdown_server()` before migrations, if they exist, will stop services.	2022-03-31 17:21:39 -07:00
Alex Vandiver	e9596637e7	upgrade: Move the shutdown_server calls to where they are relevant. shutdown_server is a noop if the server is already stopped; placing these in each block makes the logic more apparent.	2022-03-31 17:21:39 -07:00
Alex Vandiver	65e19c4fbd	supervisor: 'foo:' also matches 'foo'. `7c4293a7d3` switched to checking if the service was already running, and use `supervisorctl start` if it was not. Unfortunately, `list_supervisor_processes("zulip-tornado:")` did not include `zulip-tornado`, and as such a non-sharded process was always considered to _not_ be running, and was thus started, not restarted. Starting an already-started service is a no-op, and thus non-sharded tornado processes were never restarted. The observed behaviour is that requests to the tornado process attempt to load the user from the cache, with a different prefix from Django, and immediately invalidate the session and eject the user back to the login page. Fix the `list_supervisor_processes` logic to match without the trailing `:*`.	2022-03-31 10:41:41 -07:00
Anders Kaseorg	55882fb343	python: Use modern set comprehension syntax. Generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-25 10:45:12 -07:00
Anders Kaseorg	1f68c73e66	supervisor: Update superseded super(C, self) syntax to superior super(). Generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-25 10:45:12 -07:00
Anders Kaseorg	2762121162	python: Convert last type comments to annotations. We had skipped these in #14693 so we could keep generating a friendly error on Python 3.5, but we gave that up in #19801. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-24 20:32:39 -07:00
Alex Vandiver	d7b59c86ce	puppet: Build wal-g from source for aarch64. Since wal-g does not provide binaries for aarch64, build them from source. While building them from source for arm64 would better ensure that build process is tested, the build process takes 7min and 700M of temp files, which is an unacceptable cost; we thus only build on aarch64. Since the wal-g build process uses submodules, which are not in the Github export, we clone the full wal-g repository. Because the repository is relatively small, we clone it anew on each new version, rather than attempt to manage the remotes. Fixes #21070.	2022-03-22 15:02:35 -07:00
Alex Vandiver	a4d0f03319	scripts: Switch to stop-server/restart-server. stop-server and restart-server address all services which talk to the database, and are thus more correct than restarting or stopping everything in supervisor. This is possible now that the previous commit ensures that the zulip user can read the zulip installation directory during `create-database`; previously, that directory was still owned by root when `create-database` was run, whereas now it is in `~zulip/deployments/`.	2022-03-21 16:33:28 -07:00
Alex Vandiver	c0cc98c6a8	install: Re-order final steps. Move database creation to immediately before database initialization; this means it happens in a directory readable by the `zulip` user, as well as placing it alongside similar operations. It removes the check for the `zulip::postgresql_common` Puppet class; instead it keeps the check for `--no-init-db`, and switches to require `zulip::app_frontend_base`. This is a behavior change for any install of `zulip::postgresql_common`-only classes, but that is not a common form -- and such installs likely already pass `--no-init-db` because they are warm spare replicas. As a result, all non-`zulip::app_frontend_base` installs now skip database initialization, even without `--no-init-db`. This is clearly correct for, e.g. Redis-only hosts, and makes clearer that the frontend, not the database host, is responsible for database initialization.	2022-03-21 16:33:28 -07:00
Alex Vandiver	394f1eadde	setup: Rename postgresql-init-db to create-database. The old name was confusingly similar to initialize-database.	2022-03-21 16:33:28 -07:00
Anders Kaseorg	7d4b02738d	install-node: Upgrade Node.js from 16.14.0 to 16.14.1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-17 15:24:46 -07:00
Anders Kaseorg	84e91a6e33	configure-rabbitmq: Use rabbitmqctl await_online_nodes. rabbitmqctl ping only checks that the Erlang process is registered with epmd. There’s a window after that where the rabbit app is still starting inside it. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-14 16:26:05 -07:00
Alex Vandiver	52d363cada	upgrade: Skip re-checking of new bots on upgrade. This was added in `c770bdaa3a`, and we have not added any realm-internal bots since `c770bdaa3a`. Speed up the critical period during upgrades by skipping this step.	2022-03-14 14:14:53 -07:00
Alex Vandiver	d26a15b14d	setup-apt-repo: Make hashes file not contain full path. Using an absolute `ZULIP_SCRIPTS` path when computing sha245sums results in a set of hashes which varies based on the path that the script is called as. This means that each deploy _always_ has `setup-apt-repo --verify` fail, since it is a different base path. Make all paths passed to sha256sum be relative to the repository root, ensuring they can be compared across runs.	2022-03-12 17:24:19 -08:00
Alex Vandiver	7c4293a7d3	restart-server: Check if service is running before restart, vs start. In some instances (e.g. during upgrades) we run `restart-server` and not `start-server`, even though we expect the server to most likely already be stopped. `supervisorctl restart servicename` if the service is stopped produces the perhaps-alarming message: ``` restart-server: Restarting servicename servicename: ERROR (not running) servicename: started ``` This may cause operators to worry that something is broken, when it is not. Check if the service is already running, and switch from "restart" to "start" in cases where it is not. The race condition here is safe -- if the service transitions from stopped to started between the check and the `start` call, it will merely output: ``` servicename: ERROR (already started) ``` ...and continue, as that has exit status 0. If the service transitions from started to stopped between the check and the `restart` call, we are merely back in the current case, where it outputs: ``` servicename: ERROR (not running) servicename: started ``` In none of these cases does a call to "restart" fail to result in the service being stopped and then started.	2022-03-09 14:42:15 -08:00
Anders Kaseorg	646e466341	install: Desupport Ubuntu 22.04 for now. Ubuntu 22.04 pushed a post-feature-freeze update to Python 3.10, breaking virtual environments in a Debian patch (https://bugs.launchpad.net/ubuntu/+source/python3.10/+bug/1962791). Also, our antique version of Tornado doesn’t work in 3.10, and we’ll need to do some work to upgrade that. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-07 11:46:07 -08:00
Anders Kaseorg	60e943b92e	install-node: Upgrade Node.js from 16.13.2 to 16.14.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-01 23:09:46 -08:00
Anders Kaseorg	de1fb2b8d0	check-database-compatibility: Ignore guardian, django.contrib.sites. We can safely ignore the presence of the extra tables that could be left behind in the database from when we had these installed (before Zulip 1.7.0 and 2.0.0, respectively). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-01 10:30:23 -08:00
Tim Abbott	98a05257ea	scripts: Print names of missing migrations in compatibility check. This will make it much easier to debug any situations where this happens.	2022-02-28 11:09:52 -08:00
Anders Kaseorg	894a50b5c9	install: Support Ubuntu 22.04. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-25 14:49:07 -08:00
Anders Kaseorg	f9997e311c	generate-self-signed-cert: Remove RANDFILE. This was not needed for OpenSSL ≥ 1.1.1 (all our supported platforms), and breaks with OpenSSL ≥ 3.0.0 (Ubuntu 22.04). It was removed from the upstream configuration file too: https://bugs.debian.org/990228. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-25 14:49:07 -08:00
Anders Kaseorg	f852af0709	upgrade-zulip-stage-2: Set default PostgreSQL version for Debian 11. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-25 14:49:07 -08:00
Anders Kaseorg	1fa2761790	upgrade-zulip-stage-2: Remove create_large_indexes optimization. This was only used for upgrading from Zulip < 1.9.0, which is no longer possible because Zulip < 2.1.0 had no common supported platforms with current main. If we ever want this optimization for a future migration, it would be better implemented using Django merge migrations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-23 11:59:45 -08:00
Anders Kaseorg	1629d6bfb3	python: Reformat with Black 22 (stable). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-18 18:03:13 -08:00
Alex Vandiver	1d2582c899	upgrade: Log the commit hash and directory when upgrading.	2022-02-16 12:33:58 -08:00
Anders Kaseorg	f6a701090c	setup-apt-repos: Don’t install lsb_release. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-14 16:38:53 -08:00
Anders Kaseorg	9c8d2b7be3	apt-repos: Downgrade PostgreSQL to dodge PGroonga regression. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	43c4672deb	apt-repos: Remove groovy. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	fdc1294993	setup-apt-repo: Support installing an APT preferences file. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	7077a289ae	setup-apt-repo: Move supported release check earlier. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	c8bb98554e	setup-apt-repo: Use /etc/os-release instead of lsb_release. But still install lsb-release for now since Puppet acts funny without it. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	d1241be496	configure-rabbitmq: Use rabbitmqctl ping. Our supported distributions now all have RabbitMQ ≥ 3.7.8. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:09:41 -08:00
Tim Abbott	1a7c4a0276	scripts: Fix typo in logging statement.	2022-02-11 13:47:24 -08:00
Alex Vandiver	8da6098631	upgrade: Catch "upgrade" attempts which would downgrade the database. Attempting to "upgrade" from `main` to 4.x should abort; Django does not prevent running old code against the new database (though it likely errors at runtime), and `./manage.py migrate` from the old version during the "upgrade" does not downgrade the database, since the migrations are entirely missing in that directory, so don't get reversed. Compare the list of applied migrations to the list of on-disk migrations, and abort if there are applied migrations which are not found on disk. Fixes: #19284.	2022-02-10 16:02:49 -08:00
Alex Vandiver	71e02d7893	zulip_tools: Factor out ZULIP_VERSION parsing.	2022-02-10 16:02:49 -08:00
Anders Kaseorg	e1f42c1ac5	docs: Add missing space to compound verbs “back up”, “log in”, etc. Noun: backup, login, logout, lookup, setup. Verb: back up, log in, log out, look up, set up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-07 19:20:54 -08:00
Anders Kaseorg	b0ce4f1bce	docs: Fix many spelling mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-07 18:51:06 -08:00
Alex Vandiver	2066860ab6	start-server: Start auxiliary services, if they exist. Services like go-camo and smokescreen are not stopped in stop-server, since they are upgraded and restarted by puppet application. As such, they also do not appear in start-server, despite the server relying on them to be running to function properly. Ensure those services are started, by starting them in start-server, if they are configured in supervisor on the host.	2022-01-26 12:39:54 -08:00
Alex Vandiver	88c3f560ae	supervisor: Add a filter for only(-not)-running.	2022-01-26 12:39:54 -08:00
Alex Vandiver	7243c3c73d	scripts: Re-implement list_supervisor_processes using API.	2022-01-26 12:39:54 -08:00
Alex Vandiver	8e35cdb3da	scripts: Add a supervisor package, to use the XMLRPC Supervisor API. For many uses, shelling out to `supervisorctl` is going to produce better error messages. However, for instances where we wish to parse the output of `supervisorctl`, using the API directly is less brittle.	2022-01-26 12:39:54 -08:00
Anders Kaseorg	aec6cd4cdb	reindex-textual-data: Find psycopg2 in the virtualenv. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-26 11:56:30 -08:00
Alex Vandiver	a5496f4098	CVE-2021-43799: Set a secure Erlang cookie. The RabbitMQ docs state ([1]): RabbitMQ nodes and CLI tools (e.g. rabbitmqctl) use a cookie to determine whether they are allowed to communicate with each other. [...] The cookie is just a string of alphanumeric characters up to 255 characters in size. It is usually stored in a local file. ...and goes on to state (emphasis ours): If the file does not exist, Erlang VM will try to create one with a randomly generated value when the RabbitMQ server starts up. Using such generated cookie files are appropriate in development environments only. The auto-generated cookie does not use cryptographic sources of randomness, and generates 20 characters of `[A-Z]`. Because of a semi-predictable seed, the entropy of this password is thus less than the idealized 26^20 = 94 bits of entropy; in actuality, it is 36 bits of entropy, or potentially as low as 20 if the performance of the server is known. These sizes are well within the scope of remote brute-force attacks. On provision, install, and upgrade, replace the default insecure 20-character Erlang cookie with a cryptographically secure 255-character string (the max length allowed). [1] https://www.rabbitmq.com/clustering.html#erlang-cookie	2022-01-25 02:13:53 +00:00
Alex Vandiver	93a344fc3c	configure-rabbitmq: Set -u, and not -x.	2022-01-25 01:52:36 +00:00
Alex Vandiver	ece96c9729	configure-rabbitmq: Factor out sudo, instead of rabbitmqctl.	2022-01-25 01:52:36 +00:00
Alex Vandiver	bd7deed691	upgrade: Show output from (re)starting zulip. `5c450afd2d`, in ancient history, switched from `check_call` to `check_output` and throwing away its result. Use check_call, so that we show the steps to (re)starting the server.	2022-01-25 01:52:34 +00:00
Alex Vandiver	e705883857	CVE-2021-43799: During upgrades, restart rabbitmq if necessary. Check if it is listening on a public interface on port 25672, and if so shut it down so it can pick up the new configuration.	2022-01-25 01:51:56 +00:00
Alex Vandiver	da5201b986	upgrade: Make calling shutdown_server twice, only try once.	2022-01-25 01:48:05 +00:00
Alex Vandiver	43d63bd5a1	puppet: Always set the RabbitMQ nodename to zulip@localhost. This is required in order to lock down the RabbitMQ port to only listen on localhost. If the nodename is `rabbit@hostname`, in most circumstances the hostname will resolve to an external IP, which the rabbitmq port will not be bound to. Installs which used `rabbit@hostname`, due to RabbitMQ having been installed before Zulip, would not have functioned if the host or RabbitMQ service was restarted, as the localhost restrictions in the RabbitMQ configuration would have made rabbitmqctl (and Zulip cron jobs that call it) unable to find the rabbitmq server. The previous commit ensures that configure-rabbitmq is re-run after the nodename has changed. However, rabbitmq needs to be stopped before `rabbitmq-env.conf` is changed; we use an `onlyif` on an `exec` to print the warning about the node change, and let the subsequent config change and notify of the service and configure-rabbitmq to complete the re-configuration.	2022-01-25 01:48:02 +00:00
Alex Vandiver	3bfcfeac24	puppet: Run configure-rabbitmq on nodename change. `/etc/rabbitmq/rabbitmq-env.conf` sets the nodename; anytime the nodename changes, the backing database changes, and this requires re-creating the rabbitmq users and permissions. Trigger this in puppet by running configure-rabbitmq after the file changes.	2022-01-25 01:46:51 +00:00
Alex Vandiver	b6cd89440e	setup: Remove unused RABBITMQ_NODE. This reverts commit `889547ff5e`. It is unused in the Docker container, as the configurtaion of the `zulip` user in the rabbitmq node is done via environment variables. The Zulip host in that context does not have `rabbitmqctl` installed, and would have needed to know the Erlang cookie to be able to run these commands.	2022-01-25 01:46:51 +00:00
Anders Kaseorg	21548ff7c0	install-node: Upgrade Node.js from 16.13.1 to 16.13.2. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-24 15:55:38 -08:00
Alex Vandiver	a3adaf4aa3	puppet: Fix standalone certbot configurations. This addresses the problems mentioned in the previous commit, but for existing installations which have `authenticator = standalone` in their configurations. This reconfigures all hostnames in certbot to use the webroot authenticator, and attempts to force-renew their certificates. Force-renewal is necessary because certbot contains no way to merely update the configuration. Let's Encrypt allows for multiple extra renewals per week, so this is a reasonable cost. Because the certbot configuration is `configobj`, and not `configparser`, we have no way to easily parse to determine if webroot is in use; additionally, `certbot certificates` does not provide this information. We use `grep`, on the assumption that this will catch nearly all cases. It is possible that this will find `authenticator = standalone` certificates which are managed by Certbot, but not Zulip certificates. These certificates would also fail to renew while Zulip is running, so switching them to use the Zulip webroot would still be an improvement. Fixes #20593.	2022-01-24 12:13:44 -08:00
Alex Vandiver	76ce8631c0	setup: Install a temporary certificate, before certbot runs. Installing certbot with --method=standalone means that the configuration file will be written to assume that the standalone method will be used going forward. Since nginx will be running, attempts to renew the certificate will fail. Install a temporary self-signed certificate, just to allow nginx to start, and then follow up (after applying puppet to start nginx) with the call to setup-certbot, which will use the webroot authenticator. The `setup-certbot --method=standalone` option is left intact, for use in development environments. Fixes part of #20593; it does not address installs which were previously improperly configured with `authenticator = standalone`.	2022-01-24 12:13:44 -08:00
Anders Kaseorg	97e4e9886c	python: Replace universal_newlines with text. This is supported in Python ≥ 3.7. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-23 22:16:01 -08:00
Anders Kaseorg	a58a71ef43	Remove Ubuntu 18.04 support. As a consequence: • Bump minimum supported Python version to 3.7. • Move Vagrant environment to Debian 10, which has Python 3.7. • Move CI frontend tests to Debian 10. • Move production build test to Debian 10. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-21 17:26:14 -08:00
Alex Vandiver	677467f040	upgrade-zulip-from-git: Fix upstream URL for existing deploys.	2022-01-18 21:10:38 -08:00
Alex Vandiver	bad58cdca6	upgrade-zulip-from-git: Fix the upstream URL not be the custom remote.	2022-01-18 21:10:38 -08:00
Alex Vandiver	6bc5849ea8	puppet: Remove now-unused debathena apt repository.	2022-01-18 14:13:28 -08:00
Anders Kaseorg	e2cc554077	zulip_tools: Rename may_be_perform_purging to maybe_perform_purging. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-12 13:21:35 -08:00
Alex Vandiver	b31658482b	upgrade-zulip: Pass any arguments down to upgrade-zulip-stage-2. This is the equivalent of `93f3da4c05` but for the tarball codepath.	2022-01-11 14:26:54 -08:00
Alex Vandiver	06e115bb00	zulip_tools: Switch get_deploy_options to use shlex.split. This makes it honor quoting in the config file.	2022-01-11 14:26:54 -08:00
Anders Kaseorg	1cc1de82cd	reindex-textual-data: Reindex textual functional indexes too. This catches nine functional indexes that the previous query didn’t: upper_preregistration_email_idx upper_stream_name_idx upper_subject_idx upper_userprofile_email_idx zerver_message_recipient_upper_subject zerver_mutedtopic_stream_topic zerver_stream_realm_id_name_uniq zerver_userprofile_realm_id_delivery_email_uniq zerver_userprofile_realm_id_email_uniq Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-07 10:37:04 -08:00
Alex Vandiver	6218ed91c2	puppet: Use lazy-apps and uwsgi control sockets for rolling reloads. Restarting the uwsgi processes by way of supervisor opens a window during which nginx 502's all responses. uwsgi has a configuration called "chain reloading" which allows for rolling restart of the uwsgi processes, such that only one process at once in unavailable; see uwsgi documentation ([1]). The tradeoff is that this requires that the uwsgi processes load the libraries after forking, rather than before ("lazy apps"); in theory this can lead to larger memory footprints, since they are not shared. In practice, as Django defers much of the loading, this is not as much of an issue. In a very basic test of memory consumption (measured by total memory - free - caches - buffers; 6 uwsgi workers), both immediately after restarting Django, and after requesting `/` 60 times with 6 concurrent requests: \| Non-lazy \| Lazy app \| Difference ------------------+------------+------------+------------- Fresh \| 2,827,216 \| 2,870,480 \| +43,264 After 60 requests \| 3,332,284 \| 3,409,608 \| +77,324 ..................\|............\|............\|............. Difference \| +505,068 \| +539,128 \| +34,060 That is, "lazy app" loading increased the footprint pre-requests by 43MB, and after 60 requests grew the memory footprint by 539MB, as opposed to non-lazy loading, which grew it by 505MB. Using wsgi "lazy app" loading does increase the memory footprint, but not by a large percentage. The other effect is that processes may be served by either old or new code during the restart window. This may cause transient failures when new frontend code talks to old backend code. Enable chain-reloading during graceful, puppetless restarts, but only if enabled via a zulip.conf configuration flag. Fixes #2559. [1]: https://uwsgi-docs.readthedocs.io/en/latest/articles/TheArtOfGracefulReloading.html#chain-reloading-lazy-apps	2022-01-05 14:48:52 -08:00
Alex Vandiver	4aaa250623	zulip_tools: Fix a typo in a comment.	2022-01-05 14:48:52 -08:00
Alex Vandiver	9d85f64e5a	upgrade-zulip-stage-2: Pass through --skip-tornado and --less-graceful. These restart-server arguments are useful to be able to provide to `upgrade-zulip`.	2021-12-31 11:17:14 -08:00
Alex Vandiver	fb3368b482	restart-server: Factor out argparser, to allow reuse.	2021-12-31 11:17:14 -08:00
Alex Vandiver	93f3da4c05	upgrade-from-git: Pass unknown options through to the upgrade process.	2021-12-31 11:17:14 -08:00
Anders Kaseorg	82748d45d8	install-yarn: Use test -ef in case /srv is a symlink. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-30 13:42:07 -08:00
Anders Kaseorg	0b454dda12	install: Try apt-get update if the Ubuntu universe check fails. On a system where ‘apt-get update’ has never been run, ‘apt-cache policy’ may show no repositories at all. Try to correct this with ‘apt-get update’ before giving up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-16 17:56:23 -08:00
Alex Vandiver	f6520a97cd	setup-certbot: Reinstate nginx reload after installation. If nginx was already installed, and we're using the webroot method of initializing certbot, nginx needs to be reloaded. Hooks in `/etc/letsencrypt/renewal-hooks/deploy/` do not run during initial `certbot certonly`, so an explicit reload is required.	2021-12-10 16:43:53 -08:00
Alex Vandiver	01e8f752a8	puppet: Use certbot package timer, not our own cron job. The certbot package installs its own systemd timer (and cron job, which disabled itself if systemd is enabled) which updates certificates. This process races with the cron job which Zulip installs -- the only difference being that Zulip respects the `certbot.auto_renew` setting, and that it passes the deploy hook. This means that occasionally nginx would not be reloaded, when the systemd timer caught the expiration first. Remove the custom cron job and `certbot-maybe-renew` script, and reconfigure certbot to always reload nginx after deploying, using certbot directory hooks. Since `certbot.auto_renew` can't have an effect, remove the setting. In turn, this removes the need for `--no-zulip-conf` to `setup-certbot`. `--deploy-hook` is similarly removed, as running deploy hooks to restart nginx is now the default; pass `--no-directory-hooks` in standalone mode to not attempt to reload nginx. The other property of `--deploy-hook`, of skipping symlinking into place, is given its own flog.	2021-12-09 13:47:33 -08:00
Tim Abbott	9aa2e0ad45	upgrade-zulip-from-git: Improve webpack failure error handling. We've had a number of unhappy reports of upgrades failing due to webpack requiring too much memory. While the previous commit will likely fix this issue for everyone, it's worth improving the error message for failures here. We avoid doing the stop+retry ourselves, because that could cause an outage in a production system if webpack fails for another reason. Fixes #20105.	2021-12-09 12:26:34 -08:00
Tim Abbott	72b381d749	upgrade-zulip-from-git: Require more memory to run webpack. Since the upgrade to Webpack 5, we've been seeing occasional reports that servers with roughly 4GiB of RAM were getting OOM kills while running webpack. Since we can't readily optimize the memory requirements for webpack itself, we should raise the RAM requirements for doing the lower-downtime upgrade strategy. Fixes #20231.	2021-12-09 12:23:25 -08:00
Alex Vandiver	939d2e2705	scripts: Only stop/start existing tornado processes. Stopping both `zulip-tornado` and `zulip-tornado:` causes errors on deploys with tornado sharding, as the plain `zulip-tornado` service does not exist. Pass `zulip-tornado:`, which matches both plain `zulip-tornado`, as well as the sharded `zulip-tornado:zulip-tornado-port-9800` cases.	2021-12-08 14:06:06 -08:00
Tim Abbott	73d503995a	scripts: Fix running compare-settings-to-template from any CWD. This matches the number of dirname() calls for other files in its directory. Fixes #20489.	2021-12-07 14:45:53 -08:00
Anders Kaseorg	2e5af073b7	install-node: Upgrade Node.js from 16.13.0 to 16.13.1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-03 14:33:53 -08:00
Anders Kaseorg	2e1a8ff632	configure-rabbitmq: Increase startup timeout. Starting RabbitMQ at boot seems to have gotten slower, which broke ‘vagrant up --provision’. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-03 14:32:23 -08:00
Alex Vandiver	3455fc137a	upgrade-postgresql: Check for extension upgrade steps.	2021-11-20 07:13:50 -08:00
Alex Vandiver	544e8c569e	install: Switch default to PostgreSQL 14.	2021-11-08 18:21:46 -08:00
Alex Vandiver	f77bbd3323	upgrade-postgresql: Switch to vacuumdb --all --analzyze-only --jobs 10. The `analyze_new_cluster.sh` script output by `pg_upgrade` just runs `vacuumdb --all --analyze-in-stages`, which runs three passes over the database, getting better stats each time. Each of these passes is independent; the third pass does not require the first two. `--analyze-in-stages` is only provided to get "something" into the database, on the theory that it could then be started and used. Since we wait for all three passes to complete before starting the database, the first two passes add no value. Additionally, PosttgreSQL 14 and up stop writing the `analyze_new_cluster.sh` script as part of `pg_upgrade`, suggesting the equivalent `vacuumdb --all --analyze-in-stages` call instead. Switch to explicitly call `vacuumdb --all --analyze-only`, since we do not gain any benefit from `--analyze-in-stages`. We also enable parallelism, with `--jobs 10`, in order to analyze up to 10 tables in parallel. This may increase load, but will accelerate the upgrade process.	2021-11-08 18:21:46 -08:00
Anders Kaseorg	f2a443a736	install-node: Upgrade Node.js from 14.18.1 to 16.13.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-11-05 17:34:13 -07:00
Anders Kaseorg	458844a2f5	install-yarn: Verify that the install location is /srv/zulip-yarn. scripts.lib.node_cache expects Yarn to be in /srv/zulip-yarn, so if it’s installed somewhere else, even if it’s the right version, we need to reinstall it. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-11-03 16:49:58 -07:00
rht	bb8504d925	lint: Fix typos found by codespell.	2021-10-19 16:51:13 -07:00
Anders Kaseorg	291087d70c	install-yarn: Upgrade Yarn from 1.22.11 to 1.22.17. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-17 07:15:09 -07:00
Anders Kaseorg	7df96b78c6	install-node: Upgrade Node.js from 14.17.6 to 14.18.1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-17 07:15:09 -07:00
Anders Kaseorg	2f993f1a79	install-node: Stop using NVM. NVM doesn’t check hashes or signatures and really just adds complexity we don’t need. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-24 06:58:32 -07:00
Anders Kaseorg	902883d818	setup_venv: Skip virtualenv’s automatic download of setuptools. It recently started failing on Debian 10 (buster). We immediately follow this by replacing these packages with our own versions from pip.txt, anyway. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-23 14:29:04 -07:00
Anders Kaseorg	08e459b393	zulip_tools: Convert "".format to Python 3.6 f-strings. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-22 13:58:46 -07:00
Anders Kaseorg	9bed17e0ab	install-node: Upgrade Node.js from 14.17.5 to 14.17.6. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-13 10:12:43 -07:00
Gaurav Pandey	502697d239	docs: Add documentation for bullseye support. The support for bullseye was added in #17951 but it was not documented as bullseye was frozen and did not have proper configuration files, hence wasn't documented. Since now bullseye is released as a stable version, it's support can be documented.	2021-09-09 11:05:16 -07:00
Anders Kaseorg	915884bff7	docs: Apply bullet style changes from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-08 12:06:24 -07:00
Anders Kaseorg	02582c6956	upgrade-zulip-from-git: Run git fetch with --prune. This prevents upgrading to an obsolete version of a branch that has been deleted or renamed. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-01 05:34:57 -07:00
Anders Kaseorg	3cb66d59ac	install: Remove /dev/null redirect for zulip-puppet-apply. The usual output from this command looks like Notice: Compiled catalog for localhost in environment production in 2.33 seconds Notice: /Stage[main]/Zulip::Apt_repository/Exec[setup_apt_repo]/returns: current_value 'notrun', should be ['0'] (noop) Notice: Class[Zulip::Apt_repository]: Would have triggered 'refresh' from 1 event Notice: Stage[main]: Would have triggered 'refresh' from 1 event Notice: Applied catalog in 1.20 seconds which doesn’t seem abnormally alarming, and hiding it makes failures harder to diagnose. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-31 16:30:53 -07:00
Alex Vandiver	faf71eea41	upgrade-postgresql: Do not remove other supervisor configs. We previously used `zulip-puppet-apply` with a custom config file, with an updated PostgreSQL version but more limited set of `puppet_classes`, to pre-create the basic settings for the new cluster before running `pg_upgradecluster`. Unfortunately, the supervisor config uses `purge => true` to remove all SUPERVISOR configuration files that are not included in the puppet configuration; this leads to it removing all other supervisor processes during the upgrade, only to add them back and start them during the second `zulip-puppet-apply`. It also leads to `process-fts-updates` not being started after the upgrade completes; this is the one supervisor config file which was not removed and re-added, and thus the one that is not re-started due to having been re-added. This was not detected in CI because CI added a `start-server` command which was not in the upgrade documentation. Set a custom facter fact that prevents the `purge` behaviour of the supervisor configuration. We want to preserve that behaviour in general, and using `zulip-puppet-apply` continues to be the best way to pre-set-up the PostgreSQL configuration -- but we wish to avoid that behaviour when we know we are applying a subset of the puppet classes. Since supervisor configs are no longer removed and re-added, this requires an explicit start-server step in the instructions after the upgrades complete. This brings the documentation into alignment with what CI is testing.	2021-08-24 19:00:58 -07:00
Anders Kaseorg	7b2e585213	install-yarn: Upgrade Yarn from 1.22.10 to 1.22.11. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-23 12:33:27 -07:00
Anders Kaseorg	ebb8e9109c	install-node: Upgrade Node.js from 14.17.3 to 14.17.5. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-23 12:29:04 -07:00
Anders Kaseorg	4206e5f00b	python: Remove locally dead code. These changes are all independent of each other; I just didn’t feel like making dozens of commits for them. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-19 01:51:37 -07:00
Alex Vandiver	c9bb2c16cc	restart-server: Add a --skip-tornado. Tornado restarts are the most user-visible; provide a means to restart everything but them, for changes which are known to not affect Tornado.	2021-08-04 10:57:53 -07:00
Tim Abbott	d439a2a53e	emails: Create wider marketing email base template. For our marketing emails, we want a width that's more appropriate for newsletter context, vs. the narrow emails we use for transactional content. I haven't figured out a cleaner way to do this than duplicating most of email_base_default.source.html. But it's not a big deal to duplicate, since we've been changing that base template only about once a year.	2021-08-03 11:57:31 -07:00
Anders Kaseorg	5483ebae37	python: Convert "".format to Python 3.6 f-strings. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	ad5f0c05b5	python: Remove default "utf8" argument for encode(), decode(). Partially generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	1760897a8c	python: Remove default "r" mode for open(). Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	3665deb93a	python: Remove unnecessary intermediate lists. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
manavdesai27	572cef9a0f	provision: Add support for Fedora 34.	2021-07-20 12:10:41 -07:00
Alex Vandiver	91282ab490	reindex-textual-data: Provide a tool to reindex all text indices. The script is added to upgrade steps for 20.04 and Buster because those are the upgrades that cross glibc 2.28, which is most problematic. It will also be called out in the upgrade notes, to catch those that have already done that upgrade.	2021-07-19 16:34:23 -07:00
Anders Kaseorg	47897c76a2	scripts: Use curl -f (--fail). This makes curl exit with nonzero status on HTTP 4xx/5xx errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-13 16:47:49 -07:00
Alex Vandiver	16691110a6	scripts: Only stop/restart zulip_deliver_scheduled_* processes if known. Running `supervisorctl stop` or `supervisorctl restart` on a process name which is not known is an error: ``` $ supervisorctl stop nonexistent-process nonexistent-process: ERROR (no such process) $ echo $? 1 ``` `ef6d0ec5ca` moved zulip_deliver_scheduled_* out of the `workers:` group. Since upgrades run `stop-server` before applying puppet, the list of processes at that time is from the previous version of Zulip, so may not have the new `zulip_deliver_scheduled_` names -- and the `stop-server` will hence fail. If the upgrade is not applying puppet, it will `restart-server`. At that point, the old names will still be in the configuration, so relying on the current `superisorctl status` is the best gauge of what exists to restart. In short, only ever stop/start/restart the `zulip_deliver_scheduled_` processes if `supervisorctl status` knows about them already.	2021-07-09 10:04:53 -07:00
Alex Vandiver	c94bdd8534	zulip_tools: Find missing processes/groups in list_supervisor_processes. Nonexistent processes and groups passed to `supervisortctl status` are printed to STDOUT as follows: ``` $ supervisorctl status zulip-django nonexistent-process nonexistent-group:* nonexistent-process: ERROR (no such process) nonexistent-group: ERROR (no such group) zulip-django RUNNING pid 16043, uptime 17:31:31 ``` On supervisor 4 and above, this exits with an exit code of 4; previously, it returned exit code 0. Ubuntu 18.04 has version 3.3.1, and Ubuntu 20.04 has version 4.1.0. Skip any lines with `ERROR (no such ...)`, and accept exit code 4 from `supervisorctl status`.	2021-07-09 10:04:53 -07:00
Alex Vandiver	85a9c0982a	zulip_tools: Extract out `list_supervisor_processes`.	2021-07-09 10:04:53 -07:00
Anders Kaseorg	d83c91526b	install-node: Upgrade Node.js from 14.17.0 to 14.17.3. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-05 14:51:24 -07:00
Anders Kaseorg	684dad8145	tools: Use root-based absolute import for tools.lib, etc. Mypy can’t follow absolute imports based on directories other than the root. This was hiding some type errors due to ignore_missing_imports. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-05 12:21:52 -07:00
Anders Kaseorg	7d71a1a31a	setup: Add missing __init__.py. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-05 12:20:39 -07:00
Alex Vandiver	88c19bf65a	puppet: Catch when a comma is left out of puppet_classes. With two space-separated classes in `puppet_classes`, the second one is silently ignored. With three of more, puppet generates the following very opaque error message: ``` Error: Could not parse for environment production: This Name has no effect. A value was produced and then forgotten (one or more preceding expressions may have the wrong form) ``` Catch when this has happened, and give an error message to the user. Fixes #18992.	2021-06-28 20:58:56 -04:00
Anders Kaseorg	0ba9114c22	install-yarn: Rewrite Yarn installer. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-27 16:58:17 -07:00
Gaurav Pandey	af08bcdb3f	management: Delete send_stats command. This command is part of a statsd infrastructure that we stopped supporting years ago. Its only purpose for some time has been to provide sample code for how the restart script might trigger a notification to a graphing system, which doesn't justify maintaining it. Fixes part of #18898.	2021-06-25 09:13:48 -07:00
Anders Kaseorg	91bfebca7d	install: Replace wget with curl. curl uses Happy Eyeballs to avoid long timeouts on systems with broken IPv6. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-25 09:05:07 -07:00
Anders Kaseorg	3b60b25446	ci: Remove bullseye hack. base-files 11.1 marked bullseye as Debian 11 in /etc/os-release. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-24 14:35:51 -07:00
Anders Kaseorg	bf361e9951	ci: Remove uses of VERSION_CODENAME. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-24 14:35:51 -07:00
Tim Abbott	83738f7e6d	install: Use a period at end of root error message.	2021-06-23 08:42:46 -07:00
Gaurav Pandey	faae845366	upgrade: Modify upgrade scripts to handle failure. The current `upgrade-zulip` and `upgrade-zulip-from-git` bash scripts exit with a zero status even if the upgrade commands exit with a non-zero status. Hence add `set -e` command which exits the script with the same status as the non-zero command. For pipe commands however, the net status of a command is the status of the last command, hence if the other parts fail, the net status is only determined by the last command. This is the case with our main /lib/upgrade-zulip* command in the scripts whose status is determined by the `tee` command instead. Hence add a small condition to get the status of the actual upgrade command and exit the script if it fails with a non-zero command. We also check whether the script is being run as root, matching the install script logic.	2021-06-23 08:42:20 -07:00
Tim Abbott	28d49edee3	script: Add --no-headings option to purge-old-deployments. This parameter is somewhat useful, and adding this also fixes a regression where purge-old-deployments would crash since the changes around `c5580607a7` because of inconsistent supported args lists.	2021-06-17 15:49:23 -07:00
Mateusz Mandera	06c0a29e47	email-mirror-postfix: Choose scheme based on http_only config. Fixes #16659. If the server is behind a reverse proxy with http_only=True, the requests made by email-mirror-postfix need to use http, as https doesn't work.	2021-06-17 09:06:09 -07:00
Alex Vandiver	d51272cc3d	puppet: Remove zulip_deliver_scheduled_* from zulip-workers:. Staging and other hosts that are `zulip::app_frontend_base` but not `zulip::app_frontend_once` do not have a /etc/supervisor/conf.d/zulip/zulip-once.conf and as such do not have `zulip_deliver_scheduled_emails` or `zulip_deliver_scheduled_messages` and thus supervisor will fail to reload. Making the contents of `zulip-workers` contingent on if the server is _also_ a `-once` server is complicated, and would involve using Concat fragments, which severely limit readability. Instead, expel those two from `zulip-workers`; this is somewhat reasonable, since they are use an entirely different codepath from zulip_events_, using the database rather than RabbitMQ for their queuing.	2021-06-14 17:12:59 -07:00
Riken Shah	c5580607a7	purge-old-deployments: Use the `clean_unused_caches.main` function. We currently run the `clean_unused_caches.py` as a script to clean the unused caches. This commit replaces that with `clean_unused_caches.main` function as it would be faster.	2021-06-12 07:28:16 -07:00
Riken Shah	45af71e33b	clean_unused_caches: Allow the main function to accept `Namespace` args. This commit will allow us to pass the arguments in the 'clean...' functions when calling the `main` function (in `provision`). It also changes args parsing function location to `if __name__ == "__main__"` block as we wouldn't need it to parse args when we call the function.	2021-06-12 07:28:16 -07:00
Riken Shah	4f54e15993	refactor: Convert `clean-unused-caches` to`clean_unused_caches.py`. We convert the `clean-unused-caches` script to a python file so we can run it in provision by importing it instead of running the script, hence saving some time.	2021-06-12 07:28:16 -07:00
Anders Kaseorg	d8cb418586	zulip_tools: Flush ‘set -x’-style messages in run. Otherwise they often get buffered until after the command actually runs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-09 14:05:31 -07:00
Anders Kaseorg	342834ee9c	python: Simplify stdio flushing using print(…, flush=True). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-09 14:05:31 -07:00
Anders Kaseorg	bc169d63a7	install-node: Upgrade Node.js from 14.16.1 to 14.17.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-08 16:02:12 -07:00
Anders Kaseorg	61e1e38a00	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-07 17:57:51 -07:00
Alex Vandiver	1cdf14d195	puppet: Add a teleport server. See https://goteleport.com/docs/architecture/overview/ for the general architecture of a Teleport cluster. This commit adds a Teleport auth[1] and proxy[2] server. The auth server serves as a CA for granting time-bounded access to users and authenticating nodes on the cluster; the proxy provides access and a management UI. [1] https://goteleport.com/docs/architecture/authentication/ [2] https://goteleport.com/docs/architecture/proxy/	2021-06-02 18:38:38 -07:00
Alex Vandiver	e080a05b05	node_cache: Serialize to structured data before hashing. Appending data back-to-back without serializing it loses the information about where the breaks between them lie, which can lead to different inputs having the same hash.	2021-05-27 22:47:56 -07:00
Alex Vandiver	87a109e3e0	puppet: Pull in pinned puppet modules. Using puppet modules from the puppet forge judiciously will allow us to simplify the configuration somewhat; this specifically pulls in the stdlib module, which we were already using parts of.	2021-05-27 21:14:48 -07:00
Anders Kaseorg	cb8d9a1f8a	create-db: Default dbuser and dbname to zulip. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-26 17:19:11 -07:00
Alex Vandiver	f3eea72c2a	setup: Merge multiple setup-apt-repo scripts into one. This moves the `.asc` files into subdirectories, and writes out the according `.list` files into them. It moves from templates to written-out `.list` files for clarity and ease of implementation (Debian and Ubuntu need different templates for `zulip`), and as a way of making explicit which releases are supported for each list. For the special-case of the PGroonga signing key, we source an additional file within the directory. This simplifies the process for adding another class of `.list` file.	2021-05-26 14:42:29 -07:00
Adam Birds	4539899cae	installer: Add support for custom database user and dbname. Add support for custom database names and database users, which can be set with the `--postgresql-database-name` and `--postgresql-database-user` install script options. If these parameters aren't provided, then the defaults remain "zulip". Fixes #17662. Co-authored-by: Alex Vandiver <alexmv@zulip.com>	2021-05-25 13:56:05 -07:00
Alex Vandiver	7ff3c9f966	upgrade-zulip: Support arbitrary database user and dbname. Co-authored-by: Adam Birds <adam.birds@adbwebdesigns.co.uk>	2021-05-25 13:56:05 -07:00
Alex Vandiver	1d59330cbc	postgresql-init-db: Support arbitrary database user and dbname. Co-authored-by: Adam Birds <adam.birds@adbwebdesigns.co.uk>	2021-05-25 13:56:04 -07:00
Alex Vandiver	54c222d3f8	settings: Support arbitrary database user and dbname. This adds basic support for `postgresql.database_user` and `postgresql.database_name` settings in `zulip.conf`; the defaults if unspecified are left as `zulip`. Co-authored-by: Adam Birds <adam.birds@adbwebdesigns.co.uk>	2021-05-25 13:46:58 -07:00
Adam Birds	21cc186105	installer: Add run_psql_as_postgres function zulip_tools.py. Add a helper `run_psql_as_postgres` function in `scripts/lib/zulip_tools.py`. This is preparatory refactoring for the work to add custom database and user names.	2021-05-24 16:58:11 -07:00
Alex Vandiver	81644f110e	install: $ZULIP_ADMINISTRATOR may be unset for non-frontend hosts.	2021-05-23 13:29:23 -07:00
Anders Kaseorg	09f6ba1971	install: Run git config commands from a known readable cwd. Fixes this error when running the installer from a directory that isn’t world-readable: + su zulip -c 'git config --global user.email anders@zulip.com' fatal: cannot come back to cwd: Permission denied Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-13 22:00:29 -07:00
Anders Kaseorg	bc45525369	postgresql-init-db: Fix installation from world-unreadable directory. This reverts part of commit `476524c0c1` (#18215), to fix this error when running the installer from a directory that isn’t world-readable: + '[' -e /var/run/supervisor.sock ']' +++ dirname /root/zulip-server-4.1/scripts/setup/postgresql-init-db ++ dirname /root/zulip-server-4.1/scripts/setup + su zulip -c /root/zulip-server-4.1/scripts/stop-server bash: /root/zulip-server-4.1/scripts/stop-server: Permission denied Zulip installation failed (exit code 126)! Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-13 22:00:29 -07:00
Anders Kaseorg	6766a3f780	purge-old-deployments: Check /srv/zulip.git existence before pruning it. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-13 20:36:27 -07:00
Tim Abbott	de47feab43	scripts: Fix check for services running when upgrading. When upgrading from a pre-4.0 release, scripts/stop-server logic would check whether supervisord configuration files were present to determine what it needed to restart, but only considered paths to those files that are introduced in Zulip 4.0. Fixed #18493.	2021-05-13 18:57:19 -07:00
Anders Kaseorg	3f83b843c2	upgrade-zulip-from-git: Create deployment directories with git worktree. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-13 13:42:05 -07:00
Tim Abbott	03420831b0	upgrade-zulip-from-git: Fetch tags from upstream repository. This ensures that the `git describe` queries that we run for caching Zulip's Git version are guaranteed to include recent releases. This change ensures that we have accurate output even if we're pointed at a fork of Zulip that never updates its tags. Additionally, it will make it possible to record the `git merge-base upstream/master` in future commits. Note that because we run this code before unpacking the new version, the pre-upgrade version of this code runs. As a result, we cannot assume that the upstream repository exists.	2021-05-13 11:17:25 -07:00
Alex Vandiver	3ccb77da74	install: Tell NVM to not change $PATH earlier. This removes a possible window where an installer error could leave `nvm` in a state where it had prepended the full path to the newly-installed `npm` to `$PATH`; we would like to avoid `nvm` fiddling with path whenever possible (ref `ebe930ab2c`).	2021-05-11 11:25:34 -10:00
Anders Kaseorg	9ba48c4ed3	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-07 22:42:39 -07:00
Anders Kaseorg	d0c6f4f400	python: Strip leading and trailing spaces from docstrings. This is enforced by Black ≥ 21.4b0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-07 22:42:39 -07:00
Robert Imschweiler	534d78232c	scripts: Add {start,stop,restart}-server support for postgresql role. During the upgrade process of a postgresql-only Zulip installation, (`puppet_classes = zulip::profile::postgresql` in `/etc/zulip/zulip.conf`) either `scripts/start-server` or `scripts/stop-server` fail because they try to handle supervisor services that are not available (e.g. Tornado) since only `/etc/supervisor/conf.d/zulip/zulip_db.conf` is present and not `/etc/supervisor/conf.d/zulip/zulip.conf`. While this wasn't previously supported, it's a pretty reasonable thing to do, and can be readily supported by just adding a few conditionals.	2021-05-07 09:41:05 -07:00
Anders Kaseorg	9d57fa9759	puppet: Use pgrep -x to avoid accidental matches. Matching the full process name (-x without -f) or full command line (-xf) is less prone to mistakes like matching a random substring of some other command line or pgrep matching itself. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-07 08:54:41 -07:00
Anders Kaseorg	405bc8dabf	requirements: Remove Thumbor. Thumbor and tc-aws have been dragging their feet on Python 3 support for years, and even the alphas and unofficial forks we’ve been running don’t seem to be maintained anymore. Depending on these projects is no longer viable for us. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-06 20:07:32 -07:00
Alex Vandiver	eda9ce2364	locale: Use `C.UTF-8` rather than `en_US.UTF-8`. The `en_US.UTF-8` locale may not be configured or generated on all installs; it also requires that the `locales` package be installed. If users generate the `en_US.UTF-8` locale without adding it to the permanent set of system locales, the generated `en_US.UTF-8` stops working when the `locales` package is updated. Switch to using `C.UTF-8` in all cases, which is guaranteed to be installed. Fixes #15819.	2021-05-04 08:51:46 -07:00
Mateusz Mandera	dd7f3a1dce	upgrade: Use restart-server unless --skip-puppet is used. In some cases, puppet can end up restarting supervisord services - which will use code from the old deployment, because when puppet runs, /home/zulip/deployments/current still points there. Thus restart-server needs to be used in favor of start-server, unless we know that puppet has been skipped.	2021-05-03 08:12:54 -07:00
Alex Vandiver	ebe930ab2c	upgrade: Set an explicit value for PATH. Previous versions of zulip used `nvm alias default ...` to have `nvm` prepend the full path to the latest `node` install to the `PATH` in root's shell. Unfortunately, this means that `update-prod-static`, when called from `upgrade-zulip-stage-2` after an upgrade of node in `install-node`, would still have the full path to the _old_ `node` at the start of its PATH, because the PATH of `upgrade-zulip-stage-2` would still be unchanged. Bootstrap out of this by setting a known-reasonable PATH during upgrade, and remove the problematic `nvm alias default` behaviour. Fixes #18258.	2021-05-01 07:16:45 -07:00
Alex Vandiver	49144247dd	install: Set explicit value for PATH. In Debian, becoming root as `su` does not alter the `$PATH`; this can lead to the root user not having `/usr/sbin` in its path, and thus the `useradd zulip` step of the installer fails. Fixes #17441.	2021-05-01 07:16:45 -07:00
Alex Vandiver	daabc52a78	restart-server: Reorder supervisorctl calls for less downtime. Instead of taking the "onion" approach, where all services are stopped, and then started back up again, default to a rolling restart across all processes. This draws out how long the overall "restart" takes, but minimizes the time that any of the services are down. This minimizes user-visible impact and queue buildup. In cases where speed is more important than minimal impact (for example, there is already a current outage), a --less-graceful flag is provided, which brings the services down more suddenly, and back up in a still-correct order.	2021-04-30 16:47:15 -07:00
Alex Vandiver	4c88da8ed9	scripts: Tool to find the diff to an original settings.py prod template. This hits the unauthenticated Github API to get the list of tags, which is rate-limited to 60 requests per hour. This means that the tool can only be run 60 times per hour before it starts to exit with errors, but that seems like a reasonable limit for the moment.	2021-04-27 21:50:33 -07:00
Alex Vandiver	ae2c377d13	postgresql: Switch to defaulting to PostgreSQL 13.	2021-04-27 16:55:04 -07:00
Robert Imschweiler	ba25580b19	clean-unused-caches: Handle non-existent yarn cache.	2021-04-27 10:02:49 -07:00
Riken Shah	1288dcbaaf	clean-unused-caches: Add script to remove redundant yarn cache. This commit removes redundant yarn cache by removing the old version directories, i.e. All the directory under `~/.cache/yarn` except `~/.cache/yarn/v6` (current version directory). Fixes #15964.	2021-04-26 16:28:08 -07:00
Anders Kaseorg	6060d0d364	docs: Add missing space to compound verbs “log in”, “set up”, etc. Noun: backup, checkout, cleanup, login, logout, setup, shutdown, signup, timeout. Verb: back up, check out, clean up, log in, log out, set up, shut down, sign up, time out. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-04-26 09:31:08 -07:00
Alex Vandiver	6db454b252	upgrade: Call start-server rather than restart-server if we stopped it. This saves a little time, and thus causes a shorter outage window, since we will not try to stop the services; we know they are already down.	2021-04-21 10:28:30 -07:00
Alex Vandiver	16650ba239	upgrade: Call ./scripts/stop-server rather than duplicate the logic.	2021-04-21 10:28:30 -07:00
Alex Vandiver	ec12a6128a	scripts: Add a start-server as well. In general, `./scripts/restart-server` will already work in any circumstance where the server is already stopped and needs to be started. However, it will output a couple minor warnings, and it is not readily obvious that it will work correctly. Add an alias for `restart-server` named `start-server`, for parallelism with `stop-server`, which omits the steps of `restart-server` which would stop the server first.	2021-04-21 10:24:08 -07:00
Alex Vandiver	476524c0c1	scripts: Add a script to stop the server. Using `supervisorctl stop all` to stop the server is not terribly discoverable, and may stop services which are not part of Zulip proper. Add an explicit tool which only stops the relevant services. It also more carefully controls the order in which services are stopped to minimize lost requests, and maximally quiesce the server. Locations which may be stopping _older_ versions of Zulip (without this script) are left with using `supervisorctl stop all`. Fixes #14959.	2021-04-21 10:24:08 -07:00
Alex Vandiver	31169526ec	scripts: Say "Zulip" rather than "Application".	2021-04-21 10:24:08 -07:00
Alex Vandiver	0de8357820	scripts: Fix path to additional Zulip supervisor files. The path which contains all of the Zulip supervisor files changed in `3ab9b31d2f` to make it easier to purge now-unwanted supervisor configuration files. However, the paths that the zulip upgrade process, and restart-server, look at were not adjusted. Fix the supervisor configuration file paths.	2021-04-21 10:24:08 -07:00
Alex Vandiver	de41a10d38	upgrade: Install python3-yaml as needed. `3314fefaec` started needing `python3-yaml`, but incorrectly claimed that it was always an indirect dependency; it is a dependency of `ubuntu-minimal` on 20.04, but not required on 18.04 or Debian. We cannot install it in puppet because then is definitionally too late; it is needed at load time by `zulip-puppet-apply`. Install `python3-yaml`, but guarded by a simple check so as to not further slow most installs. Fixes #18179.	2021-04-21 09:52:56 -07:00
Alex Vandiver	4c8502f7fd	upgrade: Show fewer stacktraces. The stacktraces here are seldom useful -- for the calls to upgrade-stage-2, we know precisely what was run. For the `run` wrapper, the output contains the command that failed, which is sufficient to identify where in the upgrade process it was. Showing more stacktrace below the actual error merely confuses users and scrolls the real error off of the screen.	2021-04-21 09:51:40 -07:00
Siddharth Asthana	d2706fa246	install: Create a .gitconfig file for the zulip user. For installs which use the `upgrade-zulip-from-git` process, the deployment directory is a git checkout. This means that an administrator can, as an emergency tool, run `git revert` and similar commands -- assuming there is a `~/.gitconfig` set up for the zulip user. Add commands to `scripts/lib/install` to create a `~/.gitconfig` file at installation time. The `user.name` and `user.email` fields are set to the hostname and passed-in `--email` value, respectively. Fixes #18039.	2021-04-20 22:47:20 -07:00
Gaurav Pandey	feb720b463	install: Add beta support for debian bullseye for production. This won't work on a real bullseye system until Bullseye actually officially releases. Fixes part of #17863.	2021-04-15 21:38:31 -07:00
Gaurav Pandey	78524d4f87	provision: Add support for debian bullseye. Fixes part of #17863.	2021-04-15 21:38:31 -07:00
Anders Kaseorg	b6b117274c	install-node: Upgrade Node.js to 14.16.1 and nvm to 0.38.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-04-07 21:05:01 -07:00
Ganesh Pawar	c1628e7605	provision: Upgrade support for Fedora to version 33. Note that the `overwrite_symlink` changes fix a bug introduced in `5c20ee998c`, that we need root permissions to do those operations.	2021-03-22 19:34:18 -07:00
Ganesh Pawar	666ab59b03	pgroonga: Bump pgroonga version to 2.2.8 when building from source.	2021-03-22 19:33:48 -07:00
Ganesh Pawar	7cdb26108c	minor: Avoid verbose tar output. It isn't much helpful and clutters the logs.	2021-03-22 19:33:48 -07:00
Anders Kaseorg	6364e1b5f3	requirements: Upgrade talon fork to 1.4.8. https://github.com/mailgun/talon/pull/200 Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-03-18 17:10:18 -07:00
Alex Vandiver	3314fefaec	puppet: Do not require a venv for zulip-puppet-apply. `0663b23d54` changed zulip-puppet-apply to use the venv, because it began using `yaml` to parse the output of puppet to determine if changes would happen. However, not every install ends with a venv; notably, non-frontend servers do not have one. Attempting to run zulip-puppet-apply on them hence now fails. Remove this dependency on the venv, by installing a system python3-yaml package -- though in reality, this package is already an indirect dependency of the system. Especially since pyyaml is quite stable, we're not using it in any interesting way, and it does not actually add to the dependencies, it is preferable to parsing the YAML by hand in this instance.	2021-03-14 17:50:57 -07:00
Alex Vandiver	52f155873f	puppet: Ensure that all `scripts/lib/install` packages are installed. These have all been required packages for some time, but this helps keep the install-time list more clearly a subset of the upgrade-time list.	2021-03-14 17:50:57 -07:00
Anders Kaseorg	d393ac5034	update-prod-static: Remove unused --prev-deploy option. It’s unused since commit `079ddae4c8` (#12676). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-03-04 18:04:57 -08:00
Anders Kaseorg	25bb98dcf5	install-node: Upgrade Node.js from 14.15.1 to 14.16.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-03-03 21:46:42 -08:00
Anders Kaseorg	ccad00b7e9	provision: Suppress exception chaining for CalledProcessError retries. When exception is raised inside an exception handler, Python 3 helpfully prints both tracebacks separated by “During handling of the above exception, another exception occurred:”. But when we’re using an exception handler to retry the same operation, multiple tracebacks are just noise. Suppress the earlier one using PEP 409 syntax. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-03-03 16:25:03 -08:00
Alex Vandiver	32149c6a1c	puppet: Add ksplice uptrack for kernel hotpatches.	2021-02-25 18:05:47 -08:00
Alex Vandiver	0663b23d54	puppet: Only prompt to apply if there are changes to apply. Since `yaml` is not a module in the standard library, this requires makings `zulip-puppet-apply` use the venv.	2021-02-23 18:16:02 -08:00
Alex Vandiver	d15e6990e5	puppet: Only execute setup-apt-repo if necessary. This means that in steady-state, `zulip-puppet-apply` is expected to produce no changes or commands to execute. The verification step of `setup-apt-repo` is quite fast, so this cleans up the output for very little cost.	2021-02-23 18:16:02 -08:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	5028c081cb	python: Merge concatenated string literals that Black would uglify. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	1a4f70f1bc	lint: Convert sudo exclusion to double quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 08:34:43 -08:00
Ganesh Pawar	7eeca9da46	provision: Add provision support for Ubuntu 20.10(Groovy). PostgreSQL 13 is used when os_version is 20.10.	2021-02-05 09:30:34 -08:00
Anders Kaseorg	948f2ee2ad	manage: Quote commands correctly in log_management_command. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-01-26 13:26:57 -08:00
Alex Vandiver	c2526844e9	worker: Remove SignupWorker and friends. ZULIP_FRIENDS_LIST_ID and MAILCHIMP_API_KEY are not currently used in production. This removes the unused 'signups' queue and worker.	2021-01-17 11:16:35 -08:00
Sutou Kouhei	0d3f9fc855	install: Use PGroonga packages built for PostgreSQL packages by PGDG Because we always use PostgreSQL packages by PGDG since Zulip 3.0. Fixes #16058.	2020-12-18 15:38:21 -08:00
Anders Kaseorg	77fdac3579	install-node: Upgrade Node.js to 14.15.1 and nvm to 0.37.2. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-09 23:07:40 -08:00
Vishnu KS	eb008fc864	emails: Use macros for email tags in invitation email.	2020-10-30 11:50:30 -07:00
Anders Kaseorg	aaa7b766d8	python: Use universal_newlines to get str from subprocess. We can replace ‘universal_newlines’ with ‘text’ when we bump our minimum Python version to 3.7. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	86e8d81c7f	python: Skip unnecessary decode before JSON parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Tim Abbott	c537912a77	puppet: Migrate postgres_backups puppet manifest name.	2020-10-29 11:29:44 -07:00
Alex Vandiver	2332113c97	upgrade: Adjust puppet class names even with --skip-puppet. The class names need to be renamed even if we are not about to run puppet ourselves; otherwise, deployments which rely on running puppet themselves will still have the wrong class names.	2020-10-28 17:49:14 -07:00
Alex Vandiver	6b9d7000b5	puppet: Set proxy environment variables. These are respected by `urllib`, and thus also `requests`. We set `HTTP_proxy`, not `HTTP_PROXY`, because the latter is ignored in situations which might be running under CGI -- in such cases it may be coming from the `Proxy:` header in the request.	2020-10-28 12:17:35 -07:00
Alex Vandiver	97745688ca	docs: Link to the new doc home of the email gateway.	2020-10-28 12:13:04 -07:00
Alex Vandiver	f1cf730c5b	restore-backup: Rename variables to postgresql.	2020-10-28 11:57:03 -07:00
Alex Vandiver	5ee3379ce0	upgrade: Rename variables to postgresql.	2020-10-28 11:57:03 -07:00
Alex Vandiver	2b0bbbb882	tools: Rename postgres to postgresql in tool names.	2020-10-28 11:57:02 -07:00
Alex Vandiver	5eb8064a1a	install: Rename postgres options to postgresql.	2020-10-28 11:55:32 -07:00
Alex Vandiver	1f7132f50d	docs: Standardize on PostgreSQL, not Postgres.	2020-10-28 11:55:16 -07:00
Anders Kaseorg	23a289ecd5	install-node: Upgrade Node.js to 12.19.0 and Yarn to 1.22.10. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-28 11:45:02 -07:00
Anders Kaseorg	de5282d2cf	install-node: Install npm and npx symlinks. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-28 11:45:02 -07:00
Alex Vandiver	5f3765b872	upgrade: Adjust puppet classes to new names.	2020-10-27 13:29:19 -07:00
Alex Vandiver	16d9dd84b8	upgrade: Switch to using crudini to update zulip.conf contents. Using `config_file.write()` only writes out what python stored of the file; as such, it strips all comments and whitespace. Use `crudini --set`, which only modifies the line whose contents are changed.	2020-10-27 13:29:19 -07:00
Alex Vandiver	5365af544a	puppet: Rename zulip::profile::rabbit to ::rabbitmq.	2020-10-27 13:29:19 -07:00
Alex Vandiver	188af57296	puppet: Rename postgres_appdb to postgresql. There is only one PostgreSQL database; the "appdb" is irrelevant. Also use "postgresql," as it is the name of the software, whereas "postgres" the name of the binary and colloquial name. This is minor cleanup, but enabled by the other renames in the previous commit.	2020-10-27 13:29:19 -07:00
Alex Vandiver	0f25acc7b3	puppet: Rename "voyager"/"dockervoyager" to "standalone"/"docker". The "voyager" name is non-intuitive and not significant. `zulip::voyager` and `zulip::dockervoyager` stubs are kept for back-compatibility with existing `zulip.conf` files.	2020-10-27 13:29:19 -07:00
Alex Vandiver	c2185a81d6	puppet: Move top-level zulip deployments into "profile" directory. This moves the puppet configuration closer to the "roles and profiles method"[1] which is suggested for organizing puppet classes. Notably, here it makes clear which classes are meant to be able to stand alone as deployments. Shims are left behind at the previous names, for compatibility with existing `zulip.conf` files when upgrading. [1] https://puppet.com/docs/pe/2019.8/the_roles_and_profiles_method	2020-10-27 13:29:19 -07:00
Alex Vandiver	7cf737988d	queue: Be more explicit about test/real queue division.	2020-10-26 12:32:47 -07:00
Anders Kaseorg	31d0141a30	python: Close opened files. Fixes various instances of ‘ResourceWarning: unclosed file’ with python -Wd. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-26 12:31:30 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	16aa48d9b2	configure-rabbitmq: Wait for RabbitMQ to start up. Fixes an occasional failure in ‘vagrant up --provision’. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-15 17:01:00 -07:00
Anders Kaseorg	f16aa8f264	configure-rabbitmq: Put the command and flags in one array. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-15 17:01:00 -07:00
Alex Vandiver	1fa4ef0271	upgrade-postgres: Catch failed pg_upgradecluster exit code. Because the command is part of a pipe sequence, the exitcode defaults to the last in the sequence, which is not the most important one here. Set pipefail, which sets the exit status to the exit code of the last program in the sequence to exit non-zero, or 0 if all succeeded. This prevents the upgrade from barreling onward and setting `postgres.version` improperly if the database upgrade step failed.	2020-10-15 15:21:30 -07:00
Anders Kaseorg	dfaea9df65	shfmt: Reformat shell scripts with shfmt. https://github.com/mvdan/sh Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-15 15:16:00 -07:00
Anders Kaseorg	dd48dbd912	docs: Add spaces to “check out”, “log in”, “set up”, “sign up” as verbs. “Checkout”, “login”, “setup”, and “signup” are nouns, not verbs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Anders Kaseorg	b7a94be152	python: Catch BaseException when we need to clean something up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:16:16 -07:00
Tim Abbott	5de6f3523c	upgrade-postgres: Pass the requested postgres explicitly.	2020-10-01 14:29:24 -07:00
Alex Vandiver	4d65ea256a	rabbitmq: Consolidate check_rabbitmq_queue to call rabbitmqctl once. `rabbitmqctl` tends to be slow; this shaves half a second off the time to run `check-rabbitmq-consumers` in some cases.	2020-09-29 17:44:44 -07:00
Alex Vandiver	c0e240277b	tornado: Remove fingerprinting, write out .tmp files always. Fingerprinting the config is somewhat brittle -- it requires either custom bootstrapping for old (fingerprint-less) configs, and may have false-positives. Since generating the config is lightweight, do so into the .tmp files, and compare the output to the originals to determine if there are changes to apply. In order to both surface errors, as well as notify the user in case a restart is necessary, we must run it twice. The `onlyif` functionality cannot show configuration errors to the user, only determine if the command runs or not. We thus run the command once, judging errors as "interesting" enough to run the actual command, whose failure will be verbose in Puppet and halt any steps that depend on it. Removing the `onlyif` would result in `stage_updated_sharding` showing up in the output of every Puppet run, which obscures the important messages it displays when an update to sharding is necessary. Removing the `command` (e.g. making it an `echo`) would result in removing the ability to report configuration errors. We thus have no choice but to run it twice; this is thankfully low-overhead.	2020-09-25 10:52:40 -07:00
Alex Vandiver	4b3121db0b	certbot: Explicitly apt-get update before installing certbot. There is no guarantee that the apt data is up-to-date, unless we explicitly update. Fixes: zulip/docker-zulip#275	2020-09-21 15:26:28 -07:00
Mateusz Mandera	e2dcdc2758	queue: Increase allowed expected_time_to_clear_backlog for embed_links. It's okay for this queue to be a bit slow, and the default limits are kind of too low for it.	2020-09-21 15:24:04 -07:00
Mateusz Mandera	cd9b194d88	queue: Eliminate useless "burst" concept in monitoring. The reason higher expected_time_to_clear_backlog were allowed for queues during "bursts" was, in simpler terms, because those queues to which this happens, intrinsically have a higher acceptable "time until cleared" for new events. E.g. digests_email, where it's completely fine to take a long time to send them out after putting in the queue. And that's already configurable without a normal/burst distinction. Thanks to this we can remove a bunch of overly complicated, and ultimately useless, logic.	2020-09-21 15:24:04 -07:00
Mateusz Mandera	2365a53496	queue: Fix a race condition in monitoring after queue stops being idle. The race condition is described in the comment block removed by this commit. This leaves room for another, remaining race condition that should be virtually impossible, but nevertheless it seems worthwhile to have it documented in the code, so we put a new comment describing it. As a final note, this is not a new race condition, it was hypothetically possible with the old code as well.	2020-09-21 15:22:56 -07:00
Alex Vandiver	2a12fedcf1	tornado: Remove explicit tornado_processes setting; compute it. We can compute the intended number of processes from the sharding configuration. In doing so, also validate that all of the ports are contiguous. This removes a discrepancy between `scripts/lib/sharding.py` and other parts of the codebase about if merely having a `[tornado_sharding]` section is sufficient to enable sharding. Having behaviour which changes merely based on if an empty section exists is surprising. This does require that a (presumably empty) `9800` configuration line exist, but making that default explicit is useful. After this commit, configuring sharding can be done by adding to `zulip.conf`: ``` [tornado_sharding] 9800 = # default 9801 = other_realm ``` Followed by running `./scripts/refresh-sharding-and-restart`.	2020-09-18 15:13:40 -07:00
Anders Kaseorg	b7874ac82e	install-node: Upgrade Node.js to 12.18.4 and Yarn to 1.22.5. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-15 16:33:28 -07:00
Alex Vandiver	efdaa58c24	supervisor: Use more specific process_name than "port-9800". Making this include "zulip-tornado" makes it clearer in supervisor logs. Without this, one only sees: ``` 2020-09-14 03:43:13,788 INFO waiting for port-9807 to stop 2020-09-14 03:43:14,466 INFO stopped: port-9807 (exit status 1) 2020-09-14 03:43:14,469 INFO spawned: 'port-9807' with pid 24289 2020-09-14 03:43:15,470 INFO success: port-9807 entered RUNNING state, process has stayed up for > than 1 seconds (startsecs) ```	2020-09-14 22:17:51 -07:00
Alex Vandiver	13fb7875e2	nagios: Remove an unnecessary path.append.	2020-09-14 18:20:12 -07:00
Alex Vandiver	dd68cc98fd	upgrade: Stop in the same order as restart-server. restart-server explicitly stops the workers first, then the core services. Keep that ordering consistently.	2020-09-14 16:27:15 -07:00
Alex Vandiver	dc58dec231	restart-server: Start services in opposite order from stop. `supervisorctl` starts and stops its arguments sequentially, in the order they are passed[1]. Start them in the opposite order from the order in which they were stopped -- this puts the dependencies first, and the most core services (`zulip-django`) last. While the only "dependency" here is currently thumbor, this sets us up in case others are added later. [1] https://github.com/Supervisor/supervisor/blob/master/supervisor/supervisorctl.py#L782	2020-09-14 16:27:15 -07:00
Alex Vandiver	8adf530400	puppet: Generate sharding in puppet, then refresh-sharding-and-restart. This supports running puppet to pick up new sharding changes, which will warn of the need to finalize them via `refresh-sharding-and-restart`, or simply running that directly.	2020-09-14 16:27:15 -07:00
Alex Vandiver	bf029d99f1	sharding: Also mark sharding.json 644 for consistency. There is no reason to limit this to 640; mark it 644 for consistency with the other file.	2020-09-14 16:27:15 -07:00
Alex Vandiver	b5bcff04e5	sharding: Consistent mode for nginx sharding file. This disagreed between `tornado_sharding.pp` in puppet and `scripts/refresh-sharding-and-restart`.	2020-09-14 16:27:15 -07:00
Mateusz Mandera	aae84197e8	check-rabbitmq-queue: Use list_queues output for current backlog size. The value in the stats file can get outdated if the queue hasn't done enough iterations to update the stats file for a while. The queue size output by rabbitmqctl list_queues is more up to date, and empirically tends to agree with the value in the stats file (when the stats file is fresh).	2020-09-11 15:51:07 -07:00
Anders Kaseorg	b7b7475672	python: Use standard secrets module to generate random tokens. There are three functional side effects: • Correct an insignificant but mathematically offensive bias toward repeated characters in generate_api_key introduced in commit 47b4283c4b4c70ecde4d3c8de871c90ee2506d87; its entropy is increased from 190.52864 bits to 190.53428 bits. • Use the base32 alphabet in confirmation.models.generate_key; its entropy is reduced from 124.07820 bits to the documented 120 bits, but now it uses 1 syscall instead of 24. • Use the base32 alphabet in get_bigbluebutton_url; its entropy is reduced from 51.69925 bits to 50 bits, but now it uses 1 syscall instead of 10. (The base32 alphabet is A-Z 2-7. We could probably replace all of these with plain secrets.token_urlsafe, since I expect most callers can handle the full urlsafe_b64 alphabet A-Z a-z 0-9 - _ without problems.) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-09 15:52:57 -07:00
Anders Kaseorg	f91d287447	python: Pre-fix a few spots for better Black formatting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	bb4fc3c4c7	python: Prefer --flag=option over --flag option. For less inflation by Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	9edcafb7a0	setup_venv: Add missing comma in COMMON_YUM_VENV_DEPENDENCIES. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:25:54 -07:00
Anders Kaseorg	a50fae89e2	python: Elide type=str from argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	fbfd4b399d	python: Elide action="store" for argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	1f2ac1962f	python: Elide default=None for argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	3c5b39da9c	python: Elide nargs for argparse flag arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	b4597a8ca8	python: Elide default for store_{true,false} argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	a276eefcfe	python: Rewrite dict() as {}. Suggested by the flake8-comprehensions plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	ab120a03bc	python: Replace unnecessary intermediate lists with generators. Mostly suggested by the flake8-comprehension plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	1ded51aa9d	python: Replace list literal concatenation with * unpacking. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	a5dbab8fb0	python: Remove redundant dest for argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:04:10 -07:00
Sutou Kouhei	ebf4048dd4	create-db.sql: Ensure using en_US.UTF-8 encoding. PostgreSQL packages for Ubuntu run "initdb" without specifying locale on installation. It means that the default template database (template1) is created by the system default locale. If the system default locale is non UTF-8 compatible encoding such as en_US.ISO-8859-15, "zulip" database is also created non UTF-8 compatible encoding such as LATIN9. You can reproduce this case by running the following script: apt update apt install -y locales locale-gen en_US.ISO-8859-15 update-locale LANG=en_US.ISO-8859-15 LANGUAGE=en_US: apt install -y wget wget https://www.zulip.org/dist/releases/zulip-server-latest.tar.gz tar xf zulip-server-latest.tar.gz zulip-server-/scripts/setup/install \ --hostname=zulip-test.example.com \ --email=zulip-test-admin@example.com \ --self-signed-cert scripts/setup/install is failed with the following error: + ./manage.py migrate --noinput Operations to perform: Apply all migrations: analytics, auth, confirmation, contenttypes, otp_static, otp_totp, sessions, social_django, two_factor, zerver Running migrations: Applying contenttypes.0001_initial... OK Applying auth.0001_initial... OK Applying zerver.0001_initial...Traceback (most recent call last): File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/backends/utils.py", line 82, in _execute return self.cursor.execute(sql) File "/home/zulip/deployments/2020-08-19-05-57-10/zerver/lib/db.py", line 33, in execute return wrapper_execute(self, super().execute, query, vars) File "/home/zulip/deployments/2020-08-19-05-57-10/zerver/lib/db.py", line 20, in wrapper_execute return action(sql, params) psycopg2.errors.UntranslatableCharacter: character with byte sequence 0xe2 0x80 0x99 in encoding "UTF8" has no equivalent in encoding "LATIN9" CONTEXT: line 4 of configuration file "/usr/share/postgresql/12/tsearch_data/en_us.affix" The above exception was the direct cause of the following exception: Traceback (most recent call last): File "./manage.py", line 50, in <module> execute_from_command_line(sys.argv) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/__init__.py", line 381, in execute_from_command_line utility.execute() File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/__init__.py", line 375, in execute self.fetch_command(subcommand).run_from_argv(self.argv) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/base.py", line 323, in run_from_argv self.execute(args, *cmd_options) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/base.py", line 364, in execute output = self.handle(args, *options) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/base.py", line 83, in wrapped res = handle_func(args, **kwargs) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/commands/migrate.py", line 232, in handle post_migrate_state = executor.migrate( File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/migrations/executor.py", line 117, in migrate state = self._migrate_all_forwards(state, plan, full_plan, fake=fake, fake_initial=fake_initial) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/migrations/executor.py", line 147, in _migrate_all_forwards state = self.apply_migration(state, migration, fake=fake, fake_initial=fake_initial) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/migrations/executor.py", line 245, in apply_migration state = migration.apply(state, schema_editor) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/migrations/migration.py", line 124, in apply operation.database_forwards(self.app_label, schema_editor, old_state, project_state) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/migrations/operations/special.py", line 105, in database_forwards self._run_sql(schema_editor, self.sql) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/migrations/operations/special.py", line 130, in _run_sql schema_editor.execute(statement, params=None) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/backends/base/schema.py", line 137, in execute cursor.execute(sql, params) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/backends/utils.py", line 67, in execute return self._execute_with_wrappers(sql, params, many=False, executor=self._execute) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/backends/utils.py", line 76, in _execute_with_wrappers return executor(sql, params, many, context) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/backends/utils.py", line 84, in _execute return self.cursor.execute(sql, params) File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/utils.py", line 89, in __exit__ raise dj_exc_value.with_traceback(traceback) from exc_value File "/srv/zulip-venv-cache/b4a27188142d80b2eeb64f5d5c05b1d94cc6b7b9/zulip-py3-venv/lib/python3.8/site-packages/django/db/backends/utils.py", line 82, in _execute return self.cursor.execute(sql) File "/home/zulip/deployments/2020-08-19-05-57-10/zerver/lib/db.py", line 33, in execute return wrapper_execute(self, super().execute, query, vars) File "/home/zulip/deployments/2020-08-19-05-57-10/zerver/lib/db.py", line 20, in wrapper_execute return action(sql, params) django.db.utils.DataError: character with byte sequence 0xe2 0x80 0x99 in encoding "UTF8" has no equivalent in encoding "LATIN9" CONTEXT: line 4 of configuration file "/usr/share/postgresql/12/tsearch_data/en_us.affix"	2020-08-24 12:24:38 -07:00
Anders Kaseorg	0f608176ad	install-node: Upgrade Node.js from 12.18.2 to 12.18.3. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-12 18:37:25 -07:00
Aman	7b9fe77bf1	provision: Fix missing <sasl/sasl.h> headers during provision.	2020-08-12 16:19:06 -07:00
Anders Kaseorg	60a25b2721	docs: Fix spelling errors caught by codespell. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:23:06 -07:00
Anders Kaseorg	3582183fba	setup_venv: Install libyaml-dev. This will let PyYAML link against LibYAML when PyYAML is next installed. Due to virtualenv-clone, that won’t happen until the next Python package removal anyway, so we don’t bother bumping PROVISION_VERSION. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-07 20:58:07 -07:00
Anders Kaseorg	dbdf67301b	memcached: Switch from pylibmc to python-binary-memcached. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-06 12:51:14 -07:00
Alex Vandiver	519b1e9b4d	upgrade: With `skip_puppet`, show what puppet changes are outstanding. This prevents puppet changes from building up over time.	2020-08-02 12:47:31 -07:00
Alex Vandiver	c1923e19b0	puppet: --noop implies --force (i.e. no prompt). The combination of `--force --noop` is potentially confusing, but currently `--noop` makes no sense without `--force`, as it will prompt and then not make changes. Make `--noop` skip the prompt as well.	2020-08-02 12:47:31 -07:00
Alex Vandiver	38d01cd4db	puppet: Generalize install-wal-g to be arbitrary tarballs.	2020-07-24 17:24:57 -07:00
Anders Kaseorg	b3da022bdf	install-node: Upgrade Node.js to 12.18.2. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-07-20 10:56:31 -07:00
Anders Kaseorg	c2f9db4602	logo: Update Zulip logo. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-07-16 01:37:08 -07:00
Tim Abbott	525b42cecc	setup_venv: Require same Python version for virtualenv-clone. This prevents us cloning a virtualenv in a way that would cause us to ignore a newly updated Python version on the system.	2020-07-13 13:06:15 -07:00
Aman Agrawal	685ec2a098	hash_reqs: Include python version when generating hash. Fixes #12868. We now also include python version in the format 'major.minor.patchlevel', when generating hash for a requirement file. This was necessary since packages tend to break on different versions of python, so it is important to track the version on which the venv was setup. WARN: This commit will force all zulip venvs to be recreated.	2020-07-13 13:06:15 -07:00
Aman Agrawal	2668829c93	hash_reqs: Use combined package name and version to generate hash. We were already using packages names along with their versions to generate hash for the requirement file, as we were passing the `.txt` files to the hash_reqs file instead of intended `.in` files for which the functions in this file was originially designed. Changed the expand_reqs_helper function to adapt for the `.txt` files.	2020-07-13 13:06:15 -07:00
Alex Vandiver	5ff3025411	upgrade: Add additional comments.	2020-07-13 12:47:49 -07:00
Alex Vandiver	47400cd04b	upgrade: Drop unnecessary memcached restart. The contents in the database are unchanged across the PostgreSQL restart; as such, there is no reason to invalidate the caches. This step was inherited from the general operating system upgrade documentation. When Python versions change, such as during OS upgrades, we must ensure that memcached is cleared. However, the `do-release-upgrade` process uninstalled and upgraded to a new memcached, as well as likely restarted the system; a separate step for OS upgrades to restart memcached is thus unnecessary.	2020-07-13 12:47:04 -07:00
Alex Vandiver	0502b7a8d5	upgrade: Drop the unnecessary step that stops the old cluster. The initial step in pg_upgradecluster stops the cluster for us; this removes the somewhat ugly hack we are otherwise forced into.	2020-07-13 12:45:50 -07:00
Alex Vandiver	bf0f712c81	upgrade: Use the in-place pg_upgrade, not a full dump/restore. pg_upgradecluster has two possibilities for `--method`: `dump`, and `upgrade`. The former is the default, and does a `pg_dump` of all of the databases in the old cluster and feeds them into the new cluster. This is a sure-fire way of getting the same information in both databases, but may be extremely slow on large databases, and is guaranteed to fail on servers whose databases take up >50% of their disk. The `--method=upgrade` method, by contrast, uses pg_upgrade to copy the raw database data file over to the new cluster, and then fiddles with their internal structure as needed by the upgrade to let them be correct for the new version[1]. This is slightly faster than the dump/load method, since it skips the serialization step, but still requires that there be enough space on disk for both old and new versions at once. `pg_upgrade` is currently supported for all versions of PostgreSQL from 8.4 to 12. Using `pg_upgrade` incurs slightly more risk, but since the it is widely used by now, using it in the relatively-controlled Zulip server environment is reasonable. The expected worst failure is failure to upgrade, not corruption or data loss. Additionally passing `--link` uses hardlinks to link the data files into both the old and new directories simultaneously. This resolve both the runtime of the operation, as well as the disk space usage. The only potential downside to this is that as soon as writes have occurred on the upgraded cluster, the old cluster can no longer be started. Since this tooling intends to remove the old cluster immediately after the upgrade completes successfully, this is not a significant drawback. Switch to using `--method=upgrade --link`. This technique spits out two shell scripts which are expected to be run after completion of the upgrade; one re-analyzes the statistics, the other does an `rm -rf` of the data where it is still hardlinked in the old cluster. Extract the location of these scripts from parsing the `pg_upgradecluster` output; since the path is not static, we must rely on it being relatively easy to parse. The risk of the path changing is lower, and has more obvious failure modes, than inserting the current contents of these upgrade steps into the overall `upgrade-postgres`. [1] https://www.postgresql.org/docs/12/pgupgrade.html	2020-07-13 12:45:50 -07:00
Mateusz Mandera	c231d88d9f	upgrade: Add management command to fix FTS indexes. Upgrading the base OS's dictionary files can corrupt our FTS indexes. We add a command for fixing them. Fixes #14982.	2020-07-13 12:40:44 -07:00
Anders Kaseorg	ff1622afcf	zulip_tools: Replace deprecated mktemp call. Although mktemp is deprecated due to security issues, this is not a security issue. The security problems with mktemp happen when you open the resulting filename (without O_EXCL) in a publicly writable directory, because then someone else might have predicted the filename and created or symlinked or hardlinked something there between the mktemp and the open, causing you to write to a file you didn’t expect. Here we don’t open the resulting filename, we symlink to it. symlink will refuse to clobber an existing file, and we handle the error that arises from this case. This is the normal way to atomically create a symlink. We should still replace mktemp because it’s deprecated, but we can’t replace it with a function that creates the temporary file. Instead we build a random filename ourselves. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-07-09 14:32:02 -07:00
Anders Kaseorg	9900298315	zthumbor: Remove Python 2 residue. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-07-06 18:44:58 -07:00
Alex Vandiver	0d7dbd1b07	puppet: Apply basic PostgreSQL configuration before pg_upgradecluster. Running `pg-upgradecluster` runs the `CREATE TEXT SEARCH DICTIONARY` and `CREATE TEXT SEARCH CONFIGURATION` from `zerver/migrations/0001_initial.py` on the new PostgreSQL cluster; this requires that the stopwords file and dictionary exist _prior_ to `pg_upgradecluster` being run. This causes a minor dependency conflict -- we do not wish to duplicate the functionality from `zulip::postgres_appdb_base` which configures those files, but installing all of `zulip::postgres_appdb_tuned` will attempt to restart PostgreSQL -- which has not configured the cluster for the new version yet. In order to split out configuration of the prerequisites for the application database, and the steps required to run it, we need to be able to apply only part of the puppet configuration. Use the newly-added `--config` argument to provide a more limited `zulip.conf` which only applies `zulip::postgres_appdb_base` to the new version of Postgres, creating the required tsearch data files. This also preserves the property that a failure at any point prior to the `pg_upgradecluster` is easily recoverable, by re-running `zulip-puppet-apply`.	2020-07-06 18:30:16 -07:00
Alex Vandiver	17002f2a0e	puppet: Allow passing an alternate config path to zulip-puppet-apply. When temporary configuration changes are desired, this lets one set up an alternate `zulip.conf` to apply while leaving the true one in place.	2020-07-06 18:30:16 -07:00
Alex Vandiver	efe2b6e5cd	puppet: Switch `zulip-puppet-apply` to argparse. This allows additional arguments other than `-f` or `--force`.	2020-07-06 18:30:16 -07:00
Aman Agrawal	a486872a8e	requirements: Upgrade Thumbor to 7.0.0a5 on Python 3. Co-authored-by: Anders Kaseorg <anders@zulip.com> Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-07-06 16:09:53 -07:00
Vishnu KS	97403a09d0	install: Create zulip user only if required. Otherwise, the useradd command will fail during the DigitalOcean 1-Click App installation because the install script is called twice during the whole process. Plus the Zulip install script is designed to be idempotent and this bug compromises that.	2020-07-02 14:55:04 -07:00
Anders Kaseorg	e3835554a7	postgres-init-db: Read terminate-psql-sessions script as root. Fixes #15646. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-07-02 14:54:36 -07:00
Tim Abbott	ab1ee1f061	install: Add a comment on crudini deletion.	2020-07-01 15:13:00 -07:00
Alex Vandiver	6df99677d3	installer: Remove unnecessary nginx restart. Puppet takes care of this.	2020-07-01 15:07:52 -07:00
Alex Vandiver	2d4fae0ffe	installer: Remove out-of-date comment.	2020-07-01 15:07:52 -07:00
Alex Vandiver	2de8400a32	installer: Only set `deploy_type = production` in zulip.conf. The value is a holdover from when it controlled runtime behavior, which it no longer does. Stop taking a DEPLOYMENT_TYPE, which is unused; the python code only care about if the option exists, not its value.	2020-07-01 15:07:52 -07:00
Alex Vandiver	117d32cd8c	installer: Switch to checking dockervoyager as a class, not a deployment. The DEPLOYMENT_TYPE=dockervoyager is otherwise unused; and always happens in conjunction with a `zulip::dockervoyager` puppet class.	2020-07-01 15:07:52 -07:00
Alex Vandiver	8236cb52d2	installer: Switch has_* variables for has_class checks. These are more correct to the sense of "is this a service we configured for Zulip", and removes potential confusion around the 0/1 values being backwards from how binary is usually interpreted.	2020-07-01 15:07:52 -07:00
Alex Vandiver	2c79909a5d	installer: Switch other PUPPET_CLASSES check for has_class.	2020-07-01 15:07:52 -07:00
Alex Vandiver	ec2383dcde	installer: Move missing_dictionaries configuration to with other config. It already has been made to explicitly conflict with `--no-overwrite-settings`, so moving it inside the else block is safe.	2020-07-01 15:07:52 -07:00
Alex Vandiver	9c0fd632bb	installer: Use `puppet --write-catalog-summary` to determine classes. Using checks of `,$PUPPET_CLASSES,` is repetitive and error-prone; it does not properly deal with `zulip_ops::` classes, for instance, which include the `zulip::` classes. As alluded to in `ca9d27175b`, this can be fixed by inspecting the classes that would be applied, using `puppet --write-catalog-summary`. We work around the chicken-and-egg problem alluded to therein by writing out as complete `zulip.conf` as would be necessary, before running puppet and removing the sections we then know to not be needed. Unfortunately, there are two checks for `$PUPPET_CLASSES` which cannot be switched to this technique, as they concern errors that we wish to catch quite early, and thus before we have puppet installed. Since we expect failures of those to only concern warnings, and only be mistakenly omitted for internal `zulip_ops::` classes, this seems a reasonable risk to admit in exchange for catching common errors early.	2020-07-01 15:07:51 -07:00
Alex Vandiver	64b44a12f5	puppet: Add an exec rule to reload the whole supervisor config. When supervisor is first installed, it is started automatically, and creates the socket, owned by root. Subsequent reconfiguration in puppet only calls `reread + update`, which is insufficient to apply the `chown = zulip:zulip` line in `supervisord.conf`, leaving the socket owned by `root` and the last part of the installation unable to restart `supervisor` services as the `zulip` user. The `chown` line in `scripts/lib/install` exists to paper over this. Add a separate exec target for changes to `supervisord.conf` itself, which restarts the full service. This leaves the default `restart` action on the service for the lightweight `reread + update` action, which is more common. We use `systemctl` only on redhat-esque builds, because CI runs Ubuntu, but init is not systemd in that context. `systemctl reload` is sufficient to re-apply the socket ownership, but a full `restart` and not `reload` is necessary under `/etc/init.d/supervisor`.	2020-07-01 10:40:54 -07:00
Anders Kaseorg	7f46886696	settings: Split hostname from port more carefully. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-29 22:19:47 -07:00
Anders Kaseorg	fa89d1b266	generate-self-signed-cert: Correct subjectAltName for an IP address. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-29 22:19:47 -07:00
Alex Vandiver	918fcb9f6f	upgrade: Make upgrade-postgres work without systemctl. The only postgres cluster which need be stopped is the one we are upgrading.	2020-06-29 17:18:47 -07:00
Alex Vandiver	cd290c2c66	installer: Be tighter about the search for postgres server packages.	2020-06-29 13:37:16 -07:00
Alex Vandiver	eb6802057a	upgrade: Don't prompt in the second apt-get upgrade.	2020-06-26 16:16:12 -07:00
Alex Vandiver	b7a135f037	upgrade: Add a tool to upgrade PostgreSQL. This is based on the existing steps in the documentation, with additional changes now that the PostgreSQL version is stored in `/etc/zulip/zulip.conf`.	2020-06-26 16:07:39 -07:00
Alex Vandiver	31f1f10501	installer: Halt if wrong version of PostgreSQL is already installed. `49a7a66004` and immediately previous commits began installing PostgreSQL 12 from their apt repository. On machines which already have the distribution-provided version of PostgreSQL installed, however, this leads to failure to apply puppet when restarting PostgreSQL 12, as both attempt to claim the same port. During installation, if we will be installing PostgreSQL, look for other versions than what we will install, and abort if they are found. This is safer than attempting to automatically uninstall or reconfigure existing databases.	2020-06-24 12:57:38 -07:00
Alex Vandiver	814198d649	installer: Abstract out version of postgres installed. This allows for installing from-scratch with a different pinned version of PostgreSQL, and provides a single place to change when the default should increase.	2020-06-24 12:57:38 -07:00
Alex Vandiver	ca9d27175b	installer: Write PostgreSQL version based on puppet classes. Using `/etc/init.d/postgresql` as the detection of if Postgres is on the server is incorrect, because this line runs _before_ puppet and any packages are installed. Thus, it cannot tell the difference between a new Ubuntu one-host first-time-install without PostgreSQL yet, and one which is merely a front-end and will never have PostgreSQL. This leads to failures in first-time installs: ``` Error: Evaluation Error: Error while evaluating a Function Call, Could not find template 'zulip/postgresql//postgresql.conf.template.erb' ``` The only way to detect if PostgreSQL will be present in the _end_ state of the install is to examine the puppet classes that are applied. To do this, we must inspect `PUPPET_CLASSES`. Unfortunately, this can be fragile to subclassing (e.g. `zulip_ops::postgres_appdb`). We might desire to use `puppet apply --write-catalog-summary` to deduce the _applied_ classes, which would unroll the inheritance; however, this causes a chicken-and-egg problem, because `zulip.conf` must be already written out (including a value for `postgresql.version`, if necessary!) before such a puppet run could successfully complete. Switch to predicating the `postgresql.version` key on the puppet classes that are known to install postgres.	2020-06-24 12:57:38 -07:00
Alex Vandiver	253246185f	installer: Update documentation. Where appropriate, documentation wording is shared with docs/production/install.md	2020-06-24 12:57:38 -07:00
Alex Vandiver	85dbb13c56	installer: Abstract out apt/yum divide into a variable. This check is done in several places, using a somewhat fragile `case` statement; move it into an explicit variable.	2020-06-24 12:57:38 -07:00
Alex Vandiver	876ee4a8ed	installer: Remove code specific to stretch or xenial. Support for Xenial and Stretch was removed (`5154ddafca`, `0f4b1076ad`, `8944e0ad53`, `79acd5ae40`, `1219a2e854`), but not all codepaths were updated to remove their conditionals on it. Remove all code predicated on Xenial or Stretch. debathena support was migrated to Bionic, since that appears to be the current state of existing debathena servers.	2020-06-24 12:57:38 -07:00
Alex Vandiver	e4899eae8b	installer: Sync the claimed supported distros with the check. `0f4b1076ad` removed Ubuntu 16.04 "xenial" and Debian 9 "stretch" from the printed list of supported operating systems, but left them in the verification check that controls if that message is printed, effectively continuing to support them. Conversely, `439f0d3004` added Ubuntu 20.04 "focal" to the check, but not to the printed list. Synchronize to check and print the right supported distributions: Ubuntu 18.04 "bionic", Ubuntu 20.04 "focal", and Debian 10 "buster".	2020-06-24 12:57:38 -07:00
Alex Vandiver	58cb7cecd8	installer: Remove `--remote-postgres`, redundant with `--no-init-db`. The previous commit removed the only behavior difference between the two flags; both of them skip user/database creation, and the tables therein. Of the two options `--no-init-db` is more explicit as to what it does, as opposed to just one facet of when it might be used; remove `--remote-postgres`.	2020-06-24 12:57:38 -07:00
Alex Vandiver	7c6a25a43d	installer: Group and unify ordering of installer options. This also adds the missing `--no-overwrite-settings` option to `--help`.	2020-06-24 12:57:38 -07:00
Alex Vandiver	b165b4144d	installer: Prevent flags which conflict with `--no-overwrite-settings`. Since `--postgres-missing-dictionaries` edits `/etc/zulip/zulip.conf`, it interferes with the intent of `--no-overwrite-settings`. Make the two settings conflict, to prevent this unclear state.	2020-06-23 13:40:28 -07:00
Alex Vandiver	7f4a2527c0	installer: Make `--no-overwrite-settings` also preserve `zulip.conf`. This allows a path through the installer for places that have already configured `zulip.conf`, by extending the existing flag and behavior.	2020-06-23 13:40:28 -07:00
Alex Vandiver	27100b4507	installer: Fix mis-indentation.	2020-06-23 13:36:26 -07:00
Alex Vandiver	5b7be7ba5d	installer: Do not initialize db with --no-init-db. The `--no-init-db` option previously only controlled if `initialize-database` was run, which sets up the tables inside the database. If PostgreSQL was installed locally, it still attempted to create the user and empty database. This fails on hosts which are remote PostgreSQL hosts, and not application hosts, as: - They may already have a local database, and while `initialize-datbase` will detect and offer to abort if one is found,`--no-init-db` seems like it should be the option to not overwrite it - `flush-memcached` requires that a local venv be installed, which it often is not on non-frontend machines. Skip the database configuration when run with `--no-init-db`.	2020-06-23 13:36:26 -07:00
Anders Kaseorg	a4f2704301	flush-memcached: Replace a type: ignore with an assert. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-23 11:29:54 -07:00
Tim Abbott	60b800b1ac	upgrade-zulip-from-git: Fix setting postgres_version. The new logic to set postgres_version when upgrading never wrote the configuration file after making its edit.	2020-06-18 22:01:01 -07:00
Alex Vandiver	49a7a66004	install: Pin new apt-based installs to the latest postgresql. Since we now support Postgres versions from 10 to 12, we might as well have new installations start on Postgres 12 to avoid unnecessary migration/upgrade work.	2020-06-16 17:08:16 -07:00
Alex Vandiver	6979ed9d97	install: Use the apt postgres server packages from postgres. This allows Debian and Ubuntu administrators to reasonably seamlessly swap over to more recent version of postgres than ships with their distribution.	2020-06-16 17:05:46 -07:00
Alex Vandiver	03bffd3938	upgrade-zulip: Pin the postgres version to the OS default. We would prefer to use the postgres packages from Postgres themselves, if available. However, this requires ensures that, for existing installs, we preserve the same version of postgres as their base distribution installed. Move the version-determination logic from being computed at puppet interpolation time, to being computed at install time and pinned into zulip.conf.	2020-06-16 17:05:46 -07:00
Alex Vandiver	e788ea52d2	upgrade-zulip: Use existing config helper functions.	2020-06-16 17:05:46 -07:00
Aman Agrawal	da84b19aea	upgrade-zulip: Shutdown servers with <3GB RAM when buiding static. Fixes #14643. This is to avoid running out of memory when building static assets with webpack while server is running in low ram systems.	2020-06-15 22:17:02 -07:00
Aman Agrawal	81195abdbd	upgrade-zulip: Extract shutdown call into a function. This will help us call it as needed.	2020-06-15 22:17:02 -07:00
Vishnu KS	18ecf9bcfa	backup: Make restore-backup work in docker. Co-authored-by: Anders Kaseorg <anders@zulip.com> Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-15 21:37:14 -07:00
Anders Kaseorg	fa2496c229	terminate-psql-sessions: Rely on the caller to set PGHOST, PGUSER. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-15 21:37:14 -07:00
Vishnu KS	f2ce856b8f	scripts: Don't terminate current session in terminate-psql-sessions. This is a prep commit. Running terminate-psql-sessions command on docker-zulip results in the script exiting with non-zero exit status 2. This is because the current session also gets terminated while running terminate-psql-sessions command. To prevent that from happening we don't terminate the session created by terminate-psql-sessions.	2020-06-15 21:37:14 -07:00
Anders Kaseorg	5dc9b55c43	python: Manually convert more percent-formatting to f-strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	3461db7ef5	python: Convert percent formatting to "".format in certain files. These files can’t use f-strings yet because they need to run in Python 2 or Python 3.5. Generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	a803e68528	email-mirror-postfix: Handle 8-bit messages correctly. Since JSON can’t represent bytes, we encode them with base64. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 20:24:06 -07:00
Anders Kaseorg	5050fb19f6	nagios: Don’t crash on missing cron file. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 16:49:32 -07:00
Anders Kaseorg	57a80856a5	python: Convert more "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format. Now including %d, %i, %u, and multi-line strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:39:00 -07:00
Anders Kaseorg	0d6c771baf	python: Guard against default value mutation with read-only types. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:31:27 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Alex Vandiver	4fe0444108	puppet: Install wal-g, not wal-e.	2020-06-11 15:52:43 -07:00
Anders Kaseorg	0e5946ee5a	python: Add noqa comments for the specific star imports we allow. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 15:36:43 -07:00
Anders Kaseorg	67e7a3631d	python: Convert percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 15:02:09 -07:00
arpit551	9e8f1aacb3	certbot: Switch to use certbot from apt. certbot-auto doesn’t work on Ubuntu 20.04, and won’t be updated; we migrate to instead using the certbot package shipped with the OS instead. Also made sure that sure certbot gets installed when running zulip-puppet-apply, to handle existing systems.	2020-06-08 21:59:29 -07:00
Anders Kaseorg	523907fe1d	upgrade-zulip: Add umask override. We already override the umask in upgrade-zulip-stage-2, but that’s too late since we’ve already written a bunch of files in stage 1. I would have removed the stage 2 override, but the OS upgrade documentation references running stage 2 directly. Fixes #15164. Note that an affected installation will need to upgrade twice, because the first upgrade uses the old stage 1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-08 21:57:05 -07:00
Anders Kaseorg	8dd83228e7	python: Convert "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, but with the NamedTuple changes reverted (see commit `ba7906a3c6`, #15132). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-08 15:31:20 -07:00
rht	07fa25dcd3	setup-yum-repo: Update url of postgresql rpm repo. The old url is dead.	2020-06-08 11:26:07 -07:00
Anders Kaseorg	0f63753926	install-node: Upgrade Node.js to 12.18.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-07 11:06:57 -07:00
Anders Kaseorg	333f7d16c9	logging: Pass more format arguments to logging. Commit `bdc365d0fe` (#14852) missed this because of https://github.com/returntocorp/semgrep/issues/831. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-26 11:42:23 -07:00
arpit551	439f0d3004	install: Ad production support for Zulip on Ubuntu Focal. Install script now runs on Focal. Python 2 is now installed via the `python2` package in Focal.	2020-05-25 16:58:42 -07:00
arpit551	3971824d04	puppet: suppress puppet warnings with ruby 2.7. Ubuntu Focal comes with ruby 2.7 and the latest puppet has some issues with it so to suppress puppet warnings with ruby 2.7 we added RUBYOPT = "-W0" in the environment.	2020-05-25 16:56:11 -07:00
Tim Abbott	220620e7cf	sharding: Add basic sharding configuration for Tornado. This allows straight-forward configuration of realm-based Tornado sharding through simply editing /etc/zulip/zulip.conf to configure shards and running scripts/refresh-sharding-and-restart. Co-Author-By: Mateusz Mandera <mateusz.mandera@zulip.com>	2020-05-20 13:47:20 -07:00
Mateusz Mandera	28a6983b34	check-rabbitmq-queue: Log queue size in "queue stuck" alert.	2020-05-14 11:55:20 -07:00

... 5 6 7 8 9 ...

1503 Commits