zulip

Commit Graph

Author	SHA1	Message	Date
Anders Kaseorg	2007c75061	install-node: Upgrade Node.js from 16.14.1 to 16.15.1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-06-02 12:03:49 -07:00
Alex Vandiver	30457ecd02	upgrade-zulip-from-git: Stop mirroring the remote. The local `/srv/zulip.git` directory has been cloned with `--mirror` since it was first created as a local cache in `dc4b89fb08`. This made some sense at the time, since it was purely a cache of the remote, and not a home to local branches of its own. That changed in `3f83b843c2`, when we began using `git worktree`, which caused the `deployment-...` branches to begin being stored in `/src/zulip.git`. This caused intermixing of local and remote branches. When `02582c6956` landed, the addition of `--prune` caused all but the most recent deployment branch to be deleted upon every fetch -- leaving previous deployments with non-existent branches checked out: ``` zulip@example-prod-host:~/deployments/last$ git status On branch deployment-2022-04-15-23-07-55 No commits yet Changes to be committed: (use "git rm --cached <file>..." to unstage) new file: .browserslistrc new file: .codecov.yml new file: .codespellignore new file: .editorconfig [...snip list of every file in repo...] ``` Switch `/srv/zulip.git` to no longer be a `--mirror` cache of the origin. We reconfigure the remote to drop `remote.origin.mirror`, and delete all refs under `refs/pulls/` and `refs/heads/`, while preserving any checked-out branches. `refs/pulls/`, if the remote is the canonical upstream, contains _tens of thousands_ of refs, so pruning those refs trims off 20% of the repository size. Those savings require a `git gc --prune=now`, otherwise the dangling objects are ejected from the packfiles, which would balloon the repository up to more than three times its previous size. Repacking the repository is reasonable, in general, after removing such a large number of refs -- and the `--prune=now` is safe and will not lose data, as the `--mirror` was good at ensuring that the repository could not be used for any local state. The refname in the upgrade process was previously resolved from the union of local and remote refs, since they were in the same namespace. We instead now only resolve arguments as tags, then origin branches; this means that stale local branches will be skipped. Users who want to deploy from local branches can use `--remote-url=.`. Because the `scripts/lib/upgrade-zulip-from-git` file is "stage 1" and run from the old version's code, this will take two invocations of `upgrade-zulip-from-git` to take effect. Fixes #21901.	2022-06-01 16:06:15 -07:00
Alex Vandiver	6337f17923	upgrade: Add --skip-restart which preps but does not restart. This adds a --skip-restart which makes `deployments/next` in a state where it can be restarted into, but holds off on conducting that restart. This requires many of the same guarantees as `--skip-tornado`, in terms of there being no Puppet or database schema changes between the versions. Enforce those with `--skip-restart`, and also broaden both flags to prevent other, less common changes which nonetheless potentially might affect the other deploy.	2022-05-22 15:07:37 -07:00
Alex Vandiver	86a4e64726	upgrade: Enforce that --skip-tornado does not have Puppet or DB changes.	2022-05-22 15:07:18 -07:00
Alex Vandiver	ef7c2ea0ea	upgrade: Copy cache prefix with --skip-tornado. Because Tornado and Django use memcached as a shared cache for checking session information, they must agree on the prefix used to store those values. Subsequent commits will work to ensure that it is always _safe_ to share that cache.	2022-05-22 14:52:38 -07:00
Alex Vandiver	fa77be6e6c	upgrade: Only run Django system checks once, explicitly. These are expensive, and moving them to one explicit call early has considerable time savings in the critical period: ``` $ hyperfine './manage.py fill_memcached_caches' './manage.py fill_memcached_caches --skip-checks' Benchmark #1: ./manage.py fill_memcached_caches Time (mean ± σ): 5.264 s ± 0.146 s [User: 4.885 s, System: 0.344 s] Range (min … max): 5.119 s … 5.569 s 10 runs Benchmark #2: ./manage.py fill_memcached_caches --skip-checks Time (mean ± σ): 3.090 s ± 0.089 s [User: 2.853 s, System: 0.214 s] Range (min … max): 2.950 s … 3.204 s 10 runs Summary './manage.py fill_memcached_caches --skip-checks' ran 1.70 ± 0.07 times faster than './manage.py fill_memcached_caches' ```	2022-05-22 14:52:38 -07:00
Alex Vandiver	2e5a079ef4	upgrade: Check with zulip-puppet-apply to see if we can skip it.	2022-05-22 14:52:38 -07:00
Alex Vandiver	b15d8e0118	upgrade: Skip the pre-work if the server is already stopped. This optimization makes sense if the server is already running, but if it is already stopped, it is just prolonging the downtime.	2022-05-22 14:52:38 -07:00
Alex Vandiver	05af4b0a11	upgrade: Fill caches before the critical period, if possible.	2022-05-22 14:52:38 -07:00
Alex Vandiver	2f7068ffbb	upgrade: Move puppet class renames earlier. These do not need to happen during the critical period when the server is stopped.	2022-05-22 14:52:38 -07:00
Anders Kaseorg	f8957863a2	Revert "apt-repos: Downgrade PostgreSQL to dodge PGroonga regression." This reverts commit `9c8d2b7be3` (#21115). The PostgreSQL fix was released 2022-05-12. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-05-17 15:07:37 -07:00
Anders Kaseorg	3cb7d3d1dc	node_cache: Remove node_modules/.cache when copying. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-05-04 09:56:07 -07:00
Alex Vandiver	ba1237119c	log-search: Add a tool to search nginx logs by IP/hostname. This is a script to search nginx log files by server hostname or client IP address, and output matching lines, all while skipping common and less-interesting request lines.	2022-05-03 13:44:29 -07:00
Anders Kaseorg	e952641013	install: Resupport Ubuntu 22.04. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-05-03 09:41:08 -07:00
Anders Kaseorg	25c87cc7da	zulip-puppet-apply: Work around broken Puppet on Ubuntu 22.04. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-05-03 09:41:08 -07:00
Anders Kaseorg	080a806d60	build-pgroonga: Update PGroonga to 2.3.6. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-29 16:02:45 -07:00
Anders Kaseorg	098a514599	python: Use Python 3.8 shlex.join function. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 12:57:49 -07:00
Anders Kaseorg	0451d1e47f	zulip_tools: Replace universal_newlines with text. Generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 12:57:49 -07:00
Anders Kaseorg	a543dcc8e3	Remove Debian 10 support. As a consequence: • Bump minimum supported Python version to 3.8. • Move Vagrant environment to Ubuntu 20.04, which has Python 3.8. • Move CI frontend tests to Ubuntu 20.04. • Move production build test to Ubuntu 20.04. • Move 3.4 upgrade test to Ubuntu 20.04. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-26 16:32:02 -07:00
Anders Kaseorg	cc30ed8ec7	actions: Delete zerver.lib.actions. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-14 17:14:38 -07:00
Alex Vandiver	09860dc284	check-database-compatibility: Sort and prettify output.	2022-04-06 14:10:46 -07:00
Alex Vandiver	eb31681934	check-database-compatibility: Ignore squashed and renamed migrations. Fixes: #21596.	2022-04-01 16:15:41 -07:00
Alex Vandiver	0af00a3233	upgrade: Mark puppet as having started the server. We previously used restart-server if puppet was run, as a nod to the fact that `supervisor reread && supervisor update` will _start_ service groups that were modified, even if they were previously stopped; this is because they are marked as `autostart=true`, which is honored on service change. However, upgrades want to run while there are no services running. If puppet is run, explicitly set the server as potentially being "up", so that a `shutdown_server()` before migrations, if they exist, will stop services.	2022-03-31 17:21:39 -07:00
Alex Vandiver	e9596637e7	upgrade: Move the shutdown_server calls to where they are relevant. shutdown_server is a noop if the server is already stopped; placing these in each block makes the logic more apparent.	2022-03-31 17:21:39 -07:00
Alex Vandiver	65e19c4fbd	supervisor: 'foo:' also matches 'foo'. `7c4293a7d3` switched to checking if the service was already running, and use `supervisorctl start` if it was not. Unfortunately, `list_supervisor_processes("zulip-tornado:")` did not include `zulip-tornado`, and as such a non-sharded process was always considered to _not_ be running, and was thus started, not restarted. Starting an already-started service is a no-op, and thus non-sharded tornado processes were never restarted. The observed behaviour is that requests to the tornado process attempt to load the user from the cache, with a different prefix from Django, and immediately invalidate the session and eject the user back to the login page. Fix the `list_supervisor_processes` logic to match without the trailing `:*`.	2022-03-31 10:41:41 -07:00
Anders Kaseorg	55882fb343	python: Use modern set comprehension syntax. Generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-25 10:45:12 -07:00
Anders Kaseorg	1f68c73e66	supervisor: Update superseded super(C, self) syntax to superior super(). Generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-25 10:45:12 -07:00
Anders Kaseorg	2762121162	python: Convert last type comments to annotations. We had skipped these in #14693 so we could keep generating a friendly error on Python 3.5, but we gave that up in #19801. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-24 20:32:39 -07:00
Alex Vandiver	d7b59c86ce	puppet: Build wal-g from source for aarch64. Since wal-g does not provide binaries for aarch64, build them from source. While building them from source for arm64 would better ensure that build process is tested, the build process takes 7min and 700M of temp files, which is an unacceptable cost; we thus only build on aarch64. Since the wal-g build process uses submodules, which are not in the Github export, we clone the full wal-g repository. Because the repository is relatively small, we clone it anew on each new version, rather than attempt to manage the remotes. Fixes #21070.	2022-03-22 15:02:35 -07:00
Alex Vandiver	c0cc98c6a8	install: Re-order final steps. Move database creation to immediately before database initialization; this means it happens in a directory readable by the `zulip` user, as well as placing it alongside similar operations. It removes the check for the `zulip::postgresql_common` Puppet class; instead it keeps the check for `--no-init-db`, and switches to require `zulip::app_frontend_base`. This is a behavior change for any install of `zulip::postgresql_common`-only classes, but that is not a common form -- and such installs likely already pass `--no-init-db` because they are warm spare replicas. As a result, all non-`zulip::app_frontend_base` installs now skip database initialization, even without `--no-init-db`. This is clearly correct for, e.g. Redis-only hosts, and makes clearer that the frontend, not the database host, is responsible for database initialization.	2022-03-21 16:33:28 -07:00
Alex Vandiver	394f1eadde	setup: Rename postgresql-init-db to create-database. The old name was confusingly similar to initialize-database.	2022-03-21 16:33:28 -07:00
Anders Kaseorg	7d4b02738d	install-node: Upgrade Node.js from 16.14.0 to 16.14.1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-17 15:24:46 -07:00
Alex Vandiver	52d363cada	upgrade: Skip re-checking of new bots on upgrade. This was added in `c770bdaa3a`, and we have not added any realm-internal bots since `c770bdaa3a`. Speed up the critical period during upgrades by skipping this step.	2022-03-14 14:14:53 -07:00
Alex Vandiver	d26a15b14d	setup-apt-repo: Make hashes file not contain full path. Using an absolute `ZULIP_SCRIPTS` path when computing sha245sums results in a set of hashes which varies based on the path that the script is called as. This means that each deploy _always_ has `setup-apt-repo --verify` fail, since it is a different base path. Make all paths passed to sha256sum be relative to the repository root, ensuring they can be compared across runs.	2022-03-12 17:24:19 -08:00
Anders Kaseorg	646e466341	install: Desupport Ubuntu 22.04 for now. Ubuntu 22.04 pushed a post-feature-freeze update to Python 3.10, breaking virtual environments in a Debian patch (https://bugs.launchpad.net/ubuntu/+source/python3.10/+bug/1962791). Also, our antique version of Tornado doesn’t work in 3.10, and we’ll need to do some work to upgrade that. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-07 11:46:07 -08:00
Anders Kaseorg	60e943b92e	install-node: Upgrade Node.js from 16.13.2 to 16.14.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-01 23:09:46 -08:00
Anders Kaseorg	de1fb2b8d0	check-database-compatibility: Ignore guardian, django.contrib.sites. We can safely ignore the presence of the extra tables that could be left behind in the database from when we had these installed (before Zulip 1.7.0 and 2.0.0, respectively). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-03-01 10:30:23 -08:00
Tim Abbott	98a05257ea	scripts: Print names of missing migrations in compatibility check. This will make it much easier to debug any situations where this happens.	2022-02-28 11:09:52 -08:00
Anders Kaseorg	894a50b5c9	install: Support Ubuntu 22.04. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-25 14:49:07 -08:00
Anders Kaseorg	f852af0709	upgrade-zulip-stage-2: Set default PostgreSQL version for Debian 11. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-25 14:49:07 -08:00
Anders Kaseorg	1fa2761790	upgrade-zulip-stage-2: Remove create_large_indexes optimization. This was only used for upgrading from Zulip < 1.9.0, which is no longer possible because Zulip < 2.1.0 had no common supported platforms with current main. If we ever want this optimization for a future migration, it would be better implemented using Django merge migrations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-23 11:59:45 -08:00
Anders Kaseorg	1629d6bfb3	python: Reformat with Black 22 (stable). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-18 18:03:13 -08:00
Alex Vandiver	1d2582c899	upgrade: Log the commit hash and directory when upgrading.	2022-02-16 12:33:58 -08:00
Anders Kaseorg	f6a701090c	setup-apt-repos: Don’t install lsb_release. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-14 16:38:53 -08:00
Anders Kaseorg	9c8d2b7be3	apt-repos: Downgrade PostgreSQL to dodge PGroonga regression. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	fdc1294993	setup-apt-repo: Support installing an APT preferences file. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	7077a289ae	setup-apt-repo: Move supported release check earlier. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Anders Kaseorg	c8bb98554e	setup-apt-repo: Use /etc/os-release instead of lsb_release. But still install lsb-release for now since Puppet acts funny without it. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-13 19:11:49 -08:00
Tim Abbott	1a7c4a0276	scripts: Fix typo in logging statement.	2022-02-11 13:47:24 -08:00
Alex Vandiver	8da6098631	upgrade: Catch "upgrade" attempts which would downgrade the database. Attempting to "upgrade" from `main` to 4.x should abort; Django does not prevent running old code against the new database (though it likely errors at runtime), and `./manage.py migrate` from the old version during the "upgrade" does not downgrade the database, since the migrations are entirely missing in that directory, so don't get reversed. Compare the list of applied migrations to the list of on-disk migrations, and abort if there are applied migrations which are not found on disk. Fixes: #19284.	2022-02-10 16:02:49 -08:00
Alex Vandiver	71e02d7893	zulip_tools: Factor out ZULIP_VERSION parsing.	2022-02-10 16:02:49 -08:00
Anders Kaseorg	e1f42c1ac5	docs: Add missing space to compound verbs “back up”, “log in”, etc. Noun: backup, login, logout, lookup, setup. Verb: back up, log in, log out, look up, set up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-07 19:20:54 -08:00
Alex Vandiver	88c3f560ae	supervisor: Add a filter for only(-not)-running.	2022-01-26 12:39:54 -08:00
Alex Vandiver	7243c3c73d	scripts: Re-implement list_supervisor_processes using API.	2022-01-26 12:39:54 -08:00
Alex Vandiver	8e35cdb3da	scripts: Add a supervisor package, to use the XMLRPC Supervisor API. For many uses, shelling out to `supervisorctl` is going to produce better error messages. However, for instances where we wish to parse the output of `supervisorctl`, using the API directly is less brittle.	2022-01-26 12:39:54 -08:00
Alex Vandiver	a5496f4098	CVE-2021-43799: Set a secure Erlang cookie. The RabbitMQ docs state ([1]): RabbitMQ nodes and CLI tools (e.g. rabbitmqctl) use a cookie to determine whether they are allowed to communicate with each other. [...] The cookie is just a string of alphanumeric characters up to 255 characters in size. It is usually stored in a local file. ...and goes on to state (emphasis ours): If the file does not exist, Erlang VM will try to create one with a randomly generated value when the RabbitMQ server starts up. Using such generated cookie files are appropriate in development environments only. The auto-generated cookie does not use cryptographic sources of randomness, and generates 20 characters of `[A-Z]`. Because of a semi-predictable seed, the entropy of this password is thus less than the idealized 26^20 = 94 bits of entropy; in actuality, it is 36 bits of entropy, or potentially as low as 20 if the performance of the server is known. These sizes are well within the scope of remote brute-force attacks. On provision, install, and upgrade, replace the default insecure 20-character Erlang cookie with a cryptographically secure 255-character string (the max length allowed). [1] https://www.rabbitmq.com/clustering.html#erlang-cookie	2022-01-25 02:13:53 +00:00
Alex Vandiver	bd7deed691	upgrade: Show output from (re)starting zulip. `5c450afd2d`, in ancient history, switched from `check_call` to `check_output` and throwing away its result. Use check_call, so that we show the steps to (re)starting the server.	2022-01-25 01:52:34 +00:00
Alex Vandiver	e705883857	CVE-2021-43799: During upgrades, restart rabbitmq if necessary. Check if it is listening on a public interface on port 25672, and if so shut it down so it can pick up the new configuration.	2022-01-25 01:51:56 +00:00
Alex Vandiver	da5201b986	upgrade: Make calling shutdown_server twice, only try once.	2022-01-25 01:48:05 +00:00
Alex Vandiver	43d63bd5a1	puppet: Always set the RabbitMQ nodename to zulip@localhost. This is required in order to lock down the RabbitMQ port to only listen on localhost. If the nodename is `rabbit@hostname`, in most circumstances the hostname will resolve to an external IP, which the rabbitmq port will not be bound to. Installs which used `rabbit@hostname`, due to RabbitMQ having been installed before Zulip, would not have functioned if the host or RabbitMQ service was restarted, as the localhost restrictions in the RabbitMQ configuration would have made rabbitmqctl (and Zulip cron jobs that call it) unable to find the rabbitmq server. The previous commit ensures that configure-rabbitmq is re-run after the nodename has changed. However, rabbitmq needs to be stopped before `rabbitmq-env.conf` is changed; we use an `onlyif` on an `exec` to print the warning about the node change, and let the subsequent config change and notify of the service and configure-rabbitmq to complete the re-configuration.	2022-01-25 01:48:02 +00:00
Alex Vandiver	3bfcfeac24	puppet: Run configure-rabbitmq on nodename change. `/etc/rabbitmq/rabbitmq-env.conf` sets the nodename; anytime the nodename changes, the backing database changes, and this requires re-creating the rabbitmq users and permissions. Trigger this in puppet by running configure-rabbitmq after the file changes.	2022-01-25 01:46:51 +00:00
Anders Kaseorg	21548ff7c0	install-node: Upgrade Node.js from 16.13.1 to 16.13.2. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-24 15:55:38 -08:00
Alex Vandiver	a3adaf4aa3	puppet: Fix standalone certbot configurations. This addresses the problems mentioned in the previous commit, but for existing installations which have `authenticator = standalone` in their configurations. This reconfigures all hostnames in certbot to use the webroot authenticator, and attempts to force-renew their certificates. Force-renewal is necessary because certbot contains no way to merely update the configuration. Let's Encrypt allows for multiple extra renewals per week, so this is a reasonable cost. Because the certbot configuration is `configobj`, and not `configparser`, we have no way to easily parse to determine if webroot is in use; additionally, `certbot certificates` does not provide this information. We use `grep`, on the assumption that this will catch nearly all cases. It is possible that this will find `authenticator = standalone` certificates which are managed by Certbot, but not Zulip certificates. These certificates would also fail to renew while Zulip is running, so switching them to use the Zulip webroot would still be an improvement. Fixes #20593.	2022-01-24 12:13:44 -08:00
Alex Vandiver	76ce8631c0	setup: Install a temporary certificate, before certbot runs. Installing certbot with --method=standalone means that the configuration file will be written to assume that the standalone method will be used going forward. Since nginx will be running, attempts to renew the certificate will fail. Install a temporary self-signed certificate, just to allow nginx to start, and then follow up (after applying puppet to start nginx) with the call to setup-certbot, which will use the webroot authenticator. The `setup-certbot --method=standalone` option is left intact, for use in development environments. Fixes part of #20593; it does not address installs which were previously improperly configured with `authenticator = standalone`.	2022-01-24 12:13:44 -08:00
Anders Kaseorg	97e4e9886c	python: Replace universal_newlines with text. This is supported in Python ≥ 3.7. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-23 22:16:01 -08:00
Anders Kaseorg	a58a71ef43	Remove Ubuntu 18.04 support. As a consequence: • Bump minimum supported Python version to 3.7. • Move Vagrant environment to Debian 10, which has Python 3.7. • Move CI frontend tests to Debian 10. • Move production build test to Debian 10. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-21 17:26:14 -08:00
Alex Vandiver	677467f040	upgrade-zulip-from-git: Fix upstream URL for existing deploys.	2022-01-18 21:10:38 -08:00
Alex Vandiver	bad58cdca6	upgrade-zulip-from-git: Fix the upstream URL not be the custom remote.	2022-01-18 21:10:38 -08:00
Anders Kaseorg	e2cc554077	zulip_tools: Rename may_be_perform_purging to maybe_perform_purging. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-01-12 13:21:35 -08:00
Alex Vandiver	b31658482b	upgrade-zulip: Pass any arguments down to upgrade-zulip-stage-2. This is the equivalent of `93f3da4c05` but for the tarball codepath.	2022-01-11 14:26:54 -08:00
Alex Vandiver	06e115bb00	zulip_tools: Switch get_deploy_options to use shlex.split. This makes it honor quoting in the config file.	2022-01-11 14:26:54 -08:00
Alex Vandiver	4aaa250623	zulip_tools: Fix a typo in a comment.	2022-01-05 14:48:52 -08:00
Alex Vandiver	9d85f64e5a	upgrade-zulip-stage-2: Pass through --skip-tornado and --less-graceful. These restart-server arguments are useful to be able to provide to `upgrade-zulip`.	2021-12-31 11:17:14 -08:00
Alex Vandiver	fb3368b482	restart-server: Factor out argparser, to allow reuse.	2021-12-31 11:17:14 -08:00
Alex Vandiver	93f3da4c05	upgrade-from-git: Pass unknown options through to the upgrade process.	2021-12-31 11:17:14 -08:00
Anders Kaseorg	82748d45d8	install-yarn: Use test -ef in case /srv is a symlink. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-30 13:42:07 -08:00
Anders Kaseorg	0b454dda12	install: Try apt-get update if the Ubuntu universe check fails. On a system where ‘apt-get update’ has never been run, ‘apt-cache policy’ may show no repositories at all. Try to correct this with ‘apt-get update’ before giving up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-16 17:56:23 -08:00
Alex Vandiver	01e8f752a8	puppet: Use certbot package timer, not our own cron job. The certbot package installs its own systemd timer (and cron job, which disabled itself if systemd is enabled) which updates certificates. This process races with the cron job which Zulip installs -- the only difference being that Zulip respects the `certbot.auto_renew` setting, and that it passes the deploy hook. This means that occasionally nginx would not be reloaded, when the systemd timer caught the expiration first. Remove the custom cron job and `certbot-maybe-renew` script, and reconfigure certbot to always reload nginx after deploying, using certbot directory hooks. Since `certbot.auto_renew` can't have an effect, remove the setting. In turn, this removes the need for `--no-zulip-conf` to `setup-certbot`. `--deploy-hook` is similarly removed, as running deploy hooks to restart nginx is now the default; pass `--no-directory-hooks` in standalone mode to not attempt to reload nginx. The other property of `--deploy-hook`, of skipping symlinking into place, is given its own flog.	2021-12-09 13:47:33 -08:00
Tim Abbott	9aa2e0ad45	upgrade-zulip-from-git: Improve webpack failure error handling. We've had a number of unhappy reports of upgrades failing due to webpack requiring too much memory. While the previous commit will likely fix this issue for everyone, it's worth improving the error message for failures here. We avoid doing the stop+retry ourselves, because that could cause an outage in a production system if webpack fails for another reason. Fixes #20105.	2021-12-09 12:26:34 -08:00
Tim Abbott	72b381d749	upgrade-zulip-from-git: Require more memory to run webpack. Since the upgrade to Webpack 5, we've been seeing occasional reports that servers with roughly 4GiB of RAM were getting OOM kills while running webpack. Since we can't readily optimize the memory requirements for webpack itself, we should raise the RAM requirements for doing the lower-downtime upgrade strategy. Fixes #20231.	2021-12-09 12:23:25 -08:00
Anders Kaseorg	2e5af073b7	install-node: Upgrade Node.js from 16.13.0 to 16.13.1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-03 14:33:53 -08:00
Alex Vandiver	544e8c569e	install: Switch default to PostgreSQL 14.	2021-11-08 18:21:46 -08:00
Anders Kaseorg	f2a443a736	install-node: Upgrade Node.js from 14.18.1 to 16.13.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-11-05 17:34:13 -07:00
Anders Kaseorg	458844a2f5	install-yarn: Verify that the install location is /srv/zulip-yarn. scripts.lib.node_cache expects Yarn to be in /srv/zulip-yarn, so if it’s installed somewhere else, even if it’s the right version, we need to reinstall it. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-11-03 16:49:58 -07:00
rht	bb8504d925	lint: Fix typos found by codespell.	2021-10-19 16:51:13 -07:00
Anders Kaseorg	291087d70c	install-yarn: Upgrade Yarn from 1.22.11 to 1.22.17. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-17 07:15:09 -07:00
Anders Kaseorg	7df96b78c6	install-node: Upgrade Node.js from 14.17.6 to 14.18.1. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-17 07:15:09 -07:00
Anders Kaseorg	2f993f1a79	install-node: Stop using NVM. NVM doesn’t check hashes or signatures and really just adds complexity we don’t need. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-24 06:58:32 -07:00
Anders Kaseorg	902883d818	setup_venv: Skip virtualenv’s automatic download of setuptools. It recently started failing on Debian 10 (buster). We immediately follow this by replacing these packages with our own versions from pip.txt, anyway. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-23 14:29:04 -07:00
Anders Kaseorg	08e459b393	zulip_tools: Convert "".format to Python 3.6 f-strings. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-22 13:58:46 -07:00
Anders Kaseorg	9bed17e0ab	install-node: Upgrade Node.js from 14.17.5 to 14.17.6. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-13 10:12:43 -07:00
Gaurav Pandey	502697d239	docs: Add documentation for bullseye support. The support for bullseye was added in #17951 but it was not documented as bullseye was frozen and did not have proper configuration files, hence wasn't documented. Since now bullseye is released as a stable version, it's support can be documented.	2021-09-09 11:05:16 -07:00
Anders Kaseorg	02582c6956	upgrade-zulip-from-git: Run git fetch with --prune. This prevents upgrading to an obsolete version of a branch that has been deleted or renamed. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-01 05:34:57 -07:00
Anders Kaseorg	3cb66d59ac	install: Remove /dev/null redirect for zulip-puppet-apply. The usual output from this command looks like Notice: Compiled catalog for localhost in environment production in 2.33 seconds Notice: /Stage[main]/Zulip::Apt_repository/Exec[setup_apt_repo]/returns: current_value 'notrun', should be ['0'] (noop) Notice: Class[Zulip::Apt_repository]: Would have triggered 'refresh' from 1 event Notice: Stage[main]: Would have triggered 'refresh' from 1 event Notice: Applied catalog in 1.20 seconds which doesn’t seem abnormally alarming, and hiding it makes failures harder to diagnose. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-31 16:30:53 -07:00
Anders Kaseorg	7b2e585213	install-yarn: Upgrade Yarn from 1.22.10 to 1.22.11. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-23 12:33:27 -07:00
Anders Kaseorg	ebb8e9109c	install-node: Upgrade Node.js from 14.17.3 to 14.17.5. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-23 12:29:04 -07:00
Anders Kaseorg	4206e5f00b	python: Remove locally dead code. These changes are all independent of each other; I just didn’t feel like making dozens of commits for them. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-19 01:51:37 -07:00
Anders Kaseorg	5483ebae37	python: Convert "".format to Python 3.6 f-strings. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	ad5f0c05b5	python: Remove default "utf8" argument for encode(), decode(). Partially generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	1760897a8c	python: Remove default "r" mode for open(). Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	3665deb93a	python: Remove unnecessary intermediate lists. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
manavdesai27	572cef9a0f	provision: Add support for Fedora 34.	2021-07-20 12:10:41 -07:00
Anders Kaseorg	47897c76a2	scripts: Use curl -f (--fail). This makes curl exit with nonzero status on HTTP 4xx/5xx errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-13 16:47:49 -07:00
Alex Vandiver	c94bdd8534	zulip_tools: Find missing processes/groups in list_supervisor_processes. Nonexistent processes and groups passed to `supervisortctl status` are printed to STDOUT as follows: ``` $ supervisorctl status zulip-django nonexistent-process nonexistent-group:* nonexistent-process: ERROR (no such process) nonexistent-group: ERROR (no such group) zulip-django RUNNING pid 16043, uptime 17:31:31 ``` On supervisor 4 and above, this exits with an exit code of 4; previously, it returned exit code 0. Ubuntu 18.04 has version 3.3.1, and Ubuntu 20.04 has version 4.1.0. Skip any lines with `ERROR (no such ...)`, and accept exit code 4 from `supervisorctl status`.	2021-07-09 10:04:53 -07:00
Alex Vandiver	85a9c0982a	zulip_tools: Extract out `list_supervisor_processes`.	2021-07-09 10:04:53 -07:00
Anders Kaseorg	d83c91526b	install-node: Upgrade Node.js from 14.17.0 to 14.17.3. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-05 14:51:24 -07:00
Anders Kaseorg	0ba9114c22	install-yarn: Rewrite Yarn installer. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-27 16:58:17 -07:00
Anders Kaseorg	91bfebca7d	install: Replace wget with curl. curl uses Happy Eyeballs to avoid long timeouts on systems with broken IPv6. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-25 09:05:07 -07:00
Anders Kaseorg	3b60b25446	ci: Remove bullseye hack. base-files 11.1 marked bullseye as Debian 11 in /etc/os-release. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-24 14:35:51 -07:00
Anders Kaseorg	bf361e9951	ci: Remove uses of VERSION_CODENAME. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-24 14:35:51 -07:00
Tim Abbott	28d49edee3	script: Add --no-headings option to purge-old-deployments. This parameter is somewhat useful, and adding this also fixes a regression where purge-old-deployments would crash since the changes around `c5580607a7` because of inconsistent supported args lists.	2021-06-17 15:49:23 -07:00
Mateusz Mandera	06c0a29e47	email-mirror-postfix: Choose scheme based on http_only config. Fixes #16659. If the server is behind a reverse proxy with http_only=True, the requests made by email-mirror-postfix need to use http, as https doesn't work.	2021-06-17 09:06:09 -07:00
Alex Vandiver	d51272cc3d	puppet: Remove zulip_deliver_scheduled_* from zulip-workers:. Staging and other hosts that are `zulip::app_frontend_base` but not `zulip::app_frontend_once` do not have a /etc/supervisor/conf.d/zulip/zulip-once.conf and as such do not have `zulip_deliver_scheduled_emails` or `zulip_deliver_scheduled_messages` and thus supervisor will fail to reload. Making the contents of `zulip-workers` contingent on if the server is _also_ a `-once` server is complicated, and would involve using Concat fragments, which severely limit readability. Instead, expel those two from `zulip-workers`; this is somewhat reasonable, since they are use an entirely different codepath from zulip_events_, using the database rather than RabbitMQ for their queuing.	2021-06-14 17:12:59 -07:00
Riken Shah	45af71e33b	clean_unused_caches: Allow the main function to accept `Namespace` args. This commit will allow us to pass the arguments in the 'clean...' functions when calling the `main` function (in `provision`). It also changes args parsing function location to `if __name__ == "__main__"` block as we wouldn't need it to parse args when we call the function.	2021-06-12 07:28:16 -07:00
Riken Shah	4f54e15993	refactor: Convert `clean-unused-caches` to`clean_unused_caches.py`. We convert the `clean-unused-caches` script to a python file so we can run it in provision by importing it instead of running the script, hence saving some time.	2021-06-12 07:28:16 -07:00
Anders Kaseorg	d8cb418586	zulip_tools: Flush ‘set -x’-style messages in run. Otherwise they often get buffered until after the command actually runs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-09 14:05:31 -07:00
Anders Kaseorg	342834ee9c	python: Simplify stdio flushing using print(…, flush=True). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-09 14:05:31 -07:00
Anders Kaseorg	bc169d63a7	install-node: Upgrade Node.js from 14.16.1 to 14.17.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-06-08 16:02:12 -07:00
Alex Vandiver	e080a05b05	node_cache: Serialize to structured data before hashing. Appending data back-to-back without serializing it loses the information about where the breaks between them lie, which can lead to different inputs having the same hash.	2021-05-27 22:47:56 -07:00
Alex Vandiver	87a109e3e0	puppet: Pull in pinned puppet modules. Using puppet modules from the puppet forge judiciously will allow us to simplify the configuration somewhat; this specifically pulls in the stdlib module, which we were already using parts of.	2021-05-27 21:14:48 -07:00
Alex Vandiver	f3eea72c2a	setup: Merge multiple setup-apt-repo scripts into one. This moves the `.asc` files into subdirectories, and writes out the according `.list` files into them. It moves from templates to written-out `.list` files for clarity and ease of implementation (Debian and Ubuntu need different templates for `zulip`), and as a way of making explicit which releases are supported for each list. For the special-case of the PGroonga signing key, we source an additional file within the directory. This simplifies the process for adding another class of `.list` file.	2021-05-26 14:42:29 -07:00
Adam Birds	4539899cae	installer: Add support for custom database user and dbname. Add support for custom database names and database users, which can be set with the `--postgresql-database-name` and `--postgresql-database-user` install script options. If these parameters aren't provided, then the defaults remain "zulip". Fixes #17662. Co-authored-by: Alex Vandiver <alexmv@zulip.com>	2021-05-25 13:56:05 -07:00
Alex Vandiver	7ff3c9f966	upgrade-zulip: Support arbitrary database user and dbname. Co-authored-by: Adam Birds <adam.birds@adbwebdesigns.co.uk>	2021-05-25 13:56:05 -07:00
Adam Birds	21cc186105	installer: Add run_psql_as_postgres function zulip_tools.py. Add a helper `run_psql_as_postgres` function in `scripts/lib/zulip_tools.py`. This is preparatory refactoring for the work to add custom database and user names.	2021-05-24 16:58:11 -07:00
Alex Vandiver	81644f110e	install: $ZULIP_ADMINISTRATOR may be unset for non-frontend hosts.	2021-05-23 13:29:23 -07:00
Anders Kaseorg	09f6ba1971	install: Run git config commands from a known readable cwd. Fixes this error when running the installer from a directory that isn’t world-readable: + su zulip -c 'git config --global user.email anders@zulip.com' fatal: cannot come back to cwd: Permission denied Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-13 22:00:29 -07:00
Tim Abbott	de47feab43	scripts: Fix check for services running when upgrading. When upgrading from a pre-4.0 release, scripts/stop-server logic would check whether supervisord configuration files were present to determine what it needed to restart, but only considered paths to those files that are introduced in Zulip 4.0. Fixed #18493.	2021-05-13 18:57:19 -07:00
Anders Kaseorg	3f83b843c2	upgrade-zulip-from-git: Create deployment directories with git worktree. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-13 13:42:05 -07:00
Tim Abbott	03420831b0	upgrade-zulip-from-git: Fetch tags from upstream repository. This ensures that the `git describe` queries that we run for caching Zulip's Git version are guaranteed to include recent releases. This change ensures that we have accurate output even if we're pointed at a fork of Zulip that never updates its tags. Additionally, it will make it possible to record the `git merge-base upstream/master` in future commits. Note that because we run this code before unpacking the new version, the pre-upgrade version of this code runs. As a result, we cannot assume that the upstream repository exists.	2021-05-13 11:17:25 -07:00
Alex Vandiver	3ccb77da74	install: Tell NVM to not change $PATH earlier. This removes a possible window where an installer error could leave `nvm` in a state where it had prepended the full path to the newly-installed `npm` to `$PATH`; we would like to avoid `nvm` fiddling with path whenever possible (ref `ebe930ab2c`).	2021-05-11 11:25:34 -10:00
Anders Kaseorg	9ba48c4ed3	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-07 22:42:39 -07:00
Anders Kaseorg	d0c6f4f400	python: Strip leading and trailing spaces from docstrings. This is enforced by Black ≥ 21.4b0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-07 22:42:39 -07:00
Robert Imschweiler	534d78232c	scripts: Add {start,stop,restart}-server support for postgresql role. During the upgrade process of a postgresql-only Zulip installation, (`puppet_classes = zulip::profile::postgresql` in `/etc/zulip/zulip.conf`) either `scripts/start-server` or `scripts/stop-server` fail because they try to handle supervisor services that are not available (e.g. Tornado) since only `/etc/supervisor/conf.d/zulip/zulip_db.conf` is present and not `/etc/supervisor/conf.d/zulip/zulip.conf`. While this wasn't previously supported, it's a pretty reasonable thing to do, and can be readily supported by just adding a few conditionals.	2021-05-07 09:41:05 -07:00
Anders Kaseorg	405bc8dabf	requirements: Remove Thumbor. Thumbor and tc-aws have been dragging their feet on Python 3 support for years, and even the alphas and unofficial forks we’ve been running don’t seem to be maintained anymore. Depending on these projects is no longer viable for us. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-06 20:07:32 -07:00
Alex Vandiver	eda9ce2364	locale: Use `C.UTF-8` rather than `en_US.UTF-8`. The `en_US.UTF-8` locale may not be configured or generated on all installs; it also requires that the `locales` package be installed. If users generate the `en_US.UTF-8` locale without adding it to the permanent set of system locales, the generated `en_US.UTF-8` stops working when the `locales` package is updated. Switch to using `C.UTF-8` in all cases, which is guaranteed to be installed. Fixes #15819.	2021-05-04 08:51:46 -07:00
Mateusz Mandera	dd7f3a1dce	upgrade: Use restart-server unless --skip-puppet is used. In some cases, puppet can end up restarting supervisord services - which will use code from the old deployment, because when puppet runs, /home/zulip/deployments/current still points there. Thus restart-server needs to be used in favor of start-server, unless we know that puppet has been skipped.	2021-05-03 08:12:54 -07:00
Alex Vandiver	ebe930ab2c	upgrade: Set an explicit value for PATH. Previous versions of zulip used `nvm alias default ...` to have `nvm` prepend the full path to the latest `node` install to the `PATH` in root's shell. Unfortunately, this means that `update-prod-static`, when called from `upgrade-zulip-stage-2` after an upgrade of node in `install-node`, would still have the full path to the _old_ `node` at the start of its PATH, because the PATH of `upgrade-zulip-stage-2` would still be unchanged. Bootstrap out of this by setting a known-reasonable PATH during upgrade, and remove the problematic `nvm alias default` behaviour. Fixes #18258.	2021-05-01 07:16:45 -07:00
Alex Vandiver	49144247dd	install: Set explicit value for PATH. In Debian, becoming root as `su` does not alter the `$PATH`; this can lead to the root user not having `/usr/sbin` in its path, and thus the `useradd zulip` step of the installer fails. Fixes #17441.	2021-05-01 07:16:45 -07:00
Alex Vandiver	ae2c377d13	postgresql: Switch to defaulting to PostgreSQL 13.	2021-04-27 16:55:04 -07:00
Robert Imschweiler	ba25580b19	clean-unused-caches: Handle non-existent yarn cache.	2021-04-27 10:02:49 -07:00
Riken Shah	1288dcbaaf	clean-unused-caches: Add script to remove redundant yarn cache. This commit removes redundant yarn cache by removing the old version directories, i.e. All the directory under `~/.cache/yarn` except `~/.cache/yarn/v6` (current version directory). Fixes #15964.	2021-04-26 16:28:08 -07:00
Alex Vandiver	6db454b252	upgrade: Call start-server rather than restart-server if we stopped it. This saves a little time, and thus causes a shorter outage window, since we will not try to stop the services; we know they are already down.	2021-04-21 10:28:30 -07:00
Alex Vandiver	16650ba239	upgrade: Call ./scripts/stop-server rather than duplicate the logic.	2021-04-21 10:28:30 -07:00
Alex Vandiver	0de8357820	scripts: Fix path to additional Zulip supervisor files. The path which contains all of the Zulip supervisor files changed in `3ab9b31d2f` to make it easier to purge now-unwanted supervisor configuration files. However, the paths that the zulip upgrade process, and restart-server, look at were not adjusted. Fix the supervisor configuration file paths.	2021-04-21 10:24:08 -07:00
Alex Vandiver	de41a10d38	upgrade: Install python3-yaml as needed. `3314fefaec` started needing `python3-yaml`, but incorrectly claimed that it was always an indirect dependency; it is a dependency of `ubuntu-minimal` on 20.04, but not required on 18.04 or Debian. We cannot install it in puppet because then is definitionally too late; it is needed at load time by `zulip-puppet-apply`. Install `python3-yaml`, but guarded by a simple check so as to not further slow most installs. Fixes #18179.	2021-04-21 09:52:56 -07:00
Alex Vandiver	4c8502f7fd	upgrade: Show fewer stacktraces. The stacktraces here are seldom useful -- for the calls to upgrade-stage-2, we know precisely what was run. For the `run` wrapper, the output contains the command that failed, which is sufficient to identify where in the upgrade process it was. Showing more stacktrace below the actual error merely confuses users and scrolls the real error off of the screen.	2021-04-21 09:51:40 -07:00
Siddharth Asthana	d2706fa246	install: Create a .gitconfig file for the zulip user. For installs which use the `upgrade-zulip-from-git` process, the deployment directory is a git checkout. This means that an administrator can, as an emergency tool, run `git revert` and similar commands -- assuming there is a `~/.gitconfig` set up for the zulip user. Add commands to `scripts/lib/install` to create a `~/.gitconfig` file at installation time. The `user.name` and `user.email` fields are set to the hostname and passed-in `--email` value, respectively. Fixes #18039.	2021-04-20 22:47:20 -07:00
Gaurav Pandey	feb720b463	install: Add beta support for debian bullseye for production. This won't work on a real bullseye system until Bullseye actually officially releases. Fixes part of #17863.	2021-04-15 21:38:31 -07:00
Gaurav Pandey	78524d4f87	provision: Add support for debian bullseye. Fixes part of #17863.	2021-04-15 21:38:31 -07:00
Anders Kaseorg	b6b117274c	install-node: Upgrade Node.js to 14.16.1 and nvm to 0.38.0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-04-07 21:05:01 -07:00

1 2 3 4 5 ...

920 Commits