zulip

Commit Graph

Author	SHA1	Message	Date
Tim Abbott	9821dfa9fc	puppet: The letsencrypt package is debian is now certbot. It was an alias starting with Ubuntu Xenial, and will eventually be removed.	2020-04-16 17:30:01 -07:00
Tim Abbott	8e5a866122	puppet: Update tuning for load average monitoring.	2020-04-16 16:47:05 -07:00
Anders Kaseorg	c734bbd95d	python: Modernize legacy Python 2 syntax with pyupgrade. Generated by `pyupgrade --py3-plus --keep-percent-format` on all our Python code except `zthumbor` and `zulip-ec2-configure-interfaces`, followed by manual indentation fixes. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-09 16:43:22 -07:00
Vishnu KS	449f7e2d4b	team: Generate team page data using cron job. This eliminates the contributors data as a possible source of flakiness when installing Zulip from Git. Fixes #14351.	2020-04-08 12:52:31 -07:00
Stefan Weil	d2fa058cc1	text: Fix some typos (most of them found and fixed by codespell). Signed-off-by: Stefan Weil <sw@weilnetz.de>	2020-03-27 17:25:56 -07:00
Anders Kaseorg	7ff9b22500	docs: Convert many http URLs to https. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-03-26 21:35:32 -07:00
Anders Kaseorg	687553a661	setup_path_on_import: Replace with setup_path function. isort 5 knows not to reorder imports across function calls, so this will stop isort from breaking our code. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-02-25 15:40:21 -08:00
Mateusz Mandera	4c5a8e6f0c	queue: Remove missedmessage_email_senders.	2020-01-31 12:13:51 -08:00
Tim Abbott	dd969b5339	install: Remove references to "Zulip Voyager". "Zulip Voyager" was a name invented during the Hack Week to open source Zulip for what a single-system Zulip server might be called, as a Star Trek pun on the code it was based on, "Zulip Enterprise". At the time, we just needed a name quickly, but it was never a good name, just a placeholder. This removes that placeholder name from much of the codebase. A bit more work will be required to transition the `zulip::voyager` Puppet class, as that has some migration work involved.	2020-01-30 12:40:41 -08:00
Tim Abbott	d70e799466	bots: Remove FEEDBACK_BOT implementation. This legacy cross-realm bot hasn't been used in several years, as far as I know. If we wanted to re-introduce it, I'd want to implement it as an embedded bot using those common APIs, rather than the totally custom hacky code used for it that involves unnecessary queue workers and similar details. Fixes #13533.	2020-01-25 22:41:39 -08:00
Anders Kaseorg	ea6934c26d	dependencies: Remove WebSockets system for sending messages. Zulip has had a small use of WebSockets (specifically, for the code path of sending messages, via the webapp only) since ~2013. We originally added this use of WebSockets in the hope that the latency benefits of doing so would allow us to avoid implementing a markdown local echo; they were not. Further, HTTP/2 may have eliminated the latency difference we hoped to exploit by using WebSockets in any case. While we’d originally imagined using WebSockets for other endpoints, there was never a good justification for moving more components to the WebSockets system. This WebSockets code path had a lot of downsides/complexity, including: * The messy hack involving constructing an emulated request object to hook into doing Django requests. * The `message_senders` queue processor system, which increases RAM needs and must be provisioned independently from the rest of the server). * A duplicate check_send_receive_time Nagios test specific to WebSockets. * The requirement for users to have their firewalls/NATs allow WebSocket connections, and a setting to disable them for networks where WebSockets don’t work. * Dependencies on the SockJS family of libraries, which has at times been poorly maintained, and periodically throws random JavaScript exceptions in our production environments without a deep enough traceback to effectively investigate. * A total of about 1600 lines of our code related to the feature. * Increased load on the Tornado system, especially around a Zulip server restart, and especially for large installations like zulipchat.com, resulting in extra delay before messages can be sent again. As detailed in https://github.com/zulip/zulip/pull/12862#issuecomment-536152397, it appears that removing WebSockets moderately increases the time it takes for the `send_message` API query to return from the server, but does not significantly change the time between when a message is sent and when it is received by clients. We don’t understand the reason for that change (suggesting the possibility of a measurement error), and even if it is a real change, we consider that potential small latency regression to be acceptable. If we later want WebSockets, we’ll likely want to just use Django Channels. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-01-14 22:34:00 -08:00
Tim Abbott	f84c037225	puppet: Tune check_postgres_locks parameters. This has been a spurious alert for a long time. It's unclear that this check is useful at all, but if it spikes dramatically above what's normal, there's perhaps still utility in being alerted.	2019-10-23 15:04:38 -07:00
Tim Abbott	e4dee9532c	nagios: Update configuration for user_activity worker change. Since LoopQueueProcessingWorker jobs cannot be monitored by checking for connected consumers (since they poll, rather than consuming as events arrive), they can't be monitored with check_consumers. It's OK, because that monitoring was redundant with monitoring for potential growth in their queue that we have as well. Also clean up the block comments for the two other similar queue procesors.	2019-09-23 11:49:46 -07:00
Anders Kaseorg	0962393933	cleanup: Delete trailing newlines. Delete trailing newlines from all files, except tools/ci/success-http-headers.txt and tools/setup/dev-motd, where they are significant, and static/third, where we want to stay close to upstream. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-06 23:29:11 -07:00
Anders Kaseorg	becef760bf	cleanup: Delete leading newlines. Previous cleanups (mostly the removals of Python __future__ imports) were done in a way that introduced leading newlines. Delete leading newlines from all files, except static/assets/zulip-emoji/NOTICE, which is a verbatim copy of the Apache 2.0 license. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-06 23:29:11 -07:00
Wyatt Hoodes	a109508e34	typing: Remove now-unnecessary conditional import. As a result of dropping support for trusty, we can remove our old pattern of putting `if False` before importing the typing module, which was essential for Python 3.4 support, but not required and maybe harmful on newer versions. cron_file_helper check_rabbitmq_consumers hash_reqs check_zephyr_mirror check_personal_zephyr_mirrors check_cron_file zulip_tools check_postgres_replication_lag api_test_helpers purge-old-deployments setup_venv node_cache clean_venv_cache clean_node_cache clean_emoji_cache pg_backup_and_purge restore-backup generate_secrets zulip-ec2-configure-interfaces diagnose check_user_zephyr_mirror_liveness	2019-07-29 15:18:22 -07:00
Wyatt Hoodes	e331a758c3	python: Migrate open statements to use with. This is low priority, but it's nice to be consistently using the best practice pattern. Fixes: #12419.	2019-07-20 15:48:52 -07:00
Tim Abbott	271319fb13	puppet: Fix hacky release test for whether we're in EC2. The result is still a bit hacky, but guaranteed to be correct if we adjust the OS version of our systems, which we of course will do over time.	2019-06-25 22:19:04 -07:00
Tim Abbott	8d8cfb314b	puppet: Remove zulip_ops configuration for trusty. There are no longer any zulip_ops systems using trusty.	2019-06-25 22:09:06 -07:00
Tim Abbott	b41c2d93d1	puppet: Exclude squashfs filesystems from nagios disk checks. These generally aren't being written to.	2019-06-16 16:22:23 -07:00
Tim Abbott	0ec1b4e82c	puppet: Move check_send_receive_time to the _once ruleset. We don't actually want to run this bundle of message-sending Nagios checks to run on every single server.	2019-06-16 15:48:35 -07:00
Tim Abbott	df83979c76	zulip_ops: Extract a prod_app_frontend_once ruleset.	2019-06-16 15:48:35 -07:00
Tim Abbott	738cfe54c3	puppet: Move app_frontend_once out of prod configuration. That logic made it inconvenient to run multiple prod servers with the same top-level puppet configuration.	2019-06-16 15:24:20 -07:00
Tim Abbott	e85250941d	puppet: Fix quoting of commented-out python3-boto. This will avoid a linter error if/when we uncomment it.	2019-06-13 14:39:24 -07:00
Tim Abbott	337efe0fb7	puppet: Remove puppet-el, which no longer exists. This package was only every available on Ubuntu Xenial.	2019-06-13 14:39:24 -07:00
Vishnu Ks	ecdd3bea43	billing: Add cron job to run invoice_plans once a day. Fixes #11960	2019-04-29 11:23:17 -07:00
Anders Kaseorg	643bd18b9f	lint: Fix code that evaded our lint checks for string % non-tuple. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-04-23 15:21:37 -07:00
Anders Kaseorg	9f7c0b7e65	postgres_master.pp: Fix wacky su command line. The construction `su postgres -c -- bash -c 'psql …'` didn’t behave the way it reads, and only worked by accident: 1. `-c --` sets the command to `--`. 2. `bash` sets the first argument to `bash`. 3. `-c 'psql …'` replaces the command with `psql …`. Thus, `su` ended up executing `<shell> -c 'psql …' bash`, where `<shell>` is the `postgres` user’s login shell, usually also `bash`, which then executed 'psql …' and ignored the extra `bash`. Unconfuse this construction. Note from tabbott: The old code didn't even work by accident, it was just broken. The right fix is to move the quoting around properly. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-04-12 17:27:23 -07:00
Anders Kaseorg	649235cfec	python: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-22 16:54:36 -08:00
Anders Kaseorg	c109690cf8	puppet: Remove unused Python imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:02:12 -08:00
rht	9ee2ee046a	puppet: Use systemctl instead of pg_ctlcluster on CentOS.	2019-01-05 15:49:03 -08:00
Tim Abbott	047817b6b0	puppet: Disable log2zulip cron job. It hasn't been working for years, but more importantly, it spams up root's mail queue so that one can't find important things in there (e.g. the fact that the long-term-idle cron job was failing).	2019-01-05 10:56:44 -08:00
Tim Abbott	2558f101af	docs: Add documentation for `if False` mypy pattern in scripts. This should help make it clear what's going on with these scripts.	2018-12-17 11:12:53 -08:00
rht	d2aa81858c	puppet/zulip_ops: Replace apt::source with setup-apt-repo-debathena. Tweaked by tabbott to use a clearer name.	2018-12-11 13:02:56 -08:00
Tim Abbott	b218c2a70e	loadbalancer: Use same certbot cert for zulipstaging.com. This is a simple configuration improvement.	2018-12-07 13:43:21 -08:00
Tim Abbott	467694c1fa	nginx: Enable http2 in external nginx configuration. This should be a nice performance improvement for browsers that support it. We can't yet enabled this in the Zulip on-premise nginx configuration, because that still has to support Trusty.	2018-12-07 13:43:02 -08:00
Tim Abbott	5abf4dee92	nagios: Add new host groups for Tornado processes. We also move all the existing Tornado monitoring rules to the singletornado_frontends rule.	2018-11-06 16:33:18 -08:00
Tim Abbott	5f3b79c9e7	nagios: Fix tab-based whitespace.	2018-11-06 16:30:29 -08:00
Tim Abbott	dc7d44a245	puppet: Don't run calculate-first-visible-message-id on most systems. This should only be run on systems that are running zilencer, because the cron job is part of the zilencer project.	2018-10-30 11:40:24 -07:00
Tim Abbott	2c7f9ce0fc	puppet: Fix puppet-lint warnings in various manifests. Apparently, `puppet-lint` on Ubuntu trusty throws warnings for certain quoting patterns that are OK in modern `puppet-lint`. I believe the old Zulip code was actually correct (i.e. the old `puppet-lint` implementation was the problem), but it seems worth changing anyway to suppress the warnings. We also exclude more of puppet-apt from linting, since it's third-party code.	2018-08-28 13:46:31 -07:00
Tim Abbott	b53a712856	nginx: Update configuration for using certbot certs everywhere.	2018-08-22 11:59:15 -07:00
Tim Abbott	90828297e4	puppet-lint: Enforce double_quoted_strings check. This makes our puppet codebase more consistent by using single-quoted strings consistently.	2018-08-13 12:31:19 -07:00
Tim Abbott	d0b51b70f4	puppet-lint: Enforce 2sp_soft_tables puppet-lint check. This cleans up the puppet codebase's whitespace formatting to be more consistent.	2018-08-13 12:31:16 -07:00
Tim Abbott	b26e0a957d	puppet-lint: Enforce arrow_alignment check. This fixes all exceptions in our puppet codebase to this lint rule.	2018-08-13 12:30:57 -07:00
Aditya Bansal	710d4507de	puppet-lint: Fix lines longer than 140 characters lint warnings. We fix these by adding ignore statements in a bunch of files where this error popped up. We target only specific lines using the ignore statements and not the entire files.	2018-08-07 10:03:40 -07:00
Anders Kaseorg	edfd5ef992	setup_disks.sh: Fix shellcheck warnings. In puppet/zulip_ops/files/postgresql/setup_disks.sh line 15: array_name=$(mdadm --examine --scan \| sed 's/.*name=//') ^-- SC2034: array_name appears unused. Verify use (or export if used externally). Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2018-08-03 09:15:26 -07:00
Anders Kaseorg	5a0fecc2d5	munin_plugins: Fix shellcheck warnings. In puppet/zulip_ops/files/munin-plugins/rabbitmq_connections line 66: echo "connections.value $(HOME=$HOME rabbitmqctl list_connections \| grep -v "^Listing" \| grep -v "done.$" \| wc -l)" ^-- SC2126: Consider using grep -c instead of grep\|wc -l. In puppet/zulip_ops/files/munin-plugins/rabbitmq_consumers line 32: VHOST=${vhost:-"/"} ^-- SC2034: VHOST appears unused. Verify use (or export if used externally). In puppet/zulip_ops/files/munin-plugins/rabbitmq_messages line 32: VHOST=${vhost:-"/"} ^-- SC2034: VHOST appears unused. Verify use (or export if used externally). In puppet/zulip_ops/files/munin-plugins/rabbitmq_messages_unacknowledged line 32: VHOST=${vhost:-"/"} ^-- SC2034: VHOST appears unused. Verify use (or export if used externally). In puppet/zulip_ops/files/munin-plugins/rabbitmq_messages_uncommitted line 32: VHOST=${vhost:-"/"} ^-- SC2034: VHOST appears unused. Verify use (or export if used externally). In puppet/zulip_ops/files/munin-plugins/rabbitmq_queue_memory line 32: VHOST=${vhost:-"/"} ^-- SC2034: VHOST appears unused. Verify use (or export if used externally). Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2018-08-03 09:15:08 -07:00
Tim Abbott	02ae71f27f	api: Stop using API keys for Django->Tornado authentication. As part of our effort to change the data model away from each user having a single API key, we're eliminating the couple requests that were made from Django to Tornado (as part of a /register or home request) where we used the user's API key grabbed from the database for authentication. Instead, we use the (already existing) internal_notify_view authentication mechanism, which uses the SHARED_SECRET setting for security, for these requests, and just fetch the user object using get_user_profile_by_id directly. Tweaked by Yago to include the new /api/v1/events/internal endpoint in the exempt_patterns list in test_helpers, since it's an endpoint we call through Tornado. Also added a couple missing return type annotations.	2018-07-30 12:28:31 -07:00
Tim Abbott	07af59d4cc	tornado: Split get_events_backend into two functions. The lower-layer function, now called get_events_backend, is intended to be called by multiple code paths (including the upcoming get_events_internal).	2018-07-30 12:28:31 -07:00
Tim Abbott	63fe39e381	zulip_ops: Disable Ubuntu's built-in update-motd.d files. We can't really do this in the zulip manifests (since it's sorta a sysadmin policy decision), but these scripts can cause significant load when Nagios logs into a server (because many of them take 50ms or more of work to run). So we just get rid of them.	2018-05-06 18:47:40 -07:00
Tim Abbott	4e8487c886	nagios: Bump maximum processes limits. These seemed to be flapping for no good reason.	2018-05-02 11:12:47 -07:00
Tim Abbott	718492638b	puppet: Fix name for dhcpcd5 package. Apparently the name dhcpcd isn't installable.	2018-04-23 11:32:07 -07:00
Tim Abbott	35aa4f0377	puppet: Sort ensure attributes to be always first. This inconsistency was flagged by puppet-lint.	2018-04-22 23:41:49 -07:00
Tim Abbott	e103c2ff2d	puppet: Switch to modern quoted, octal file modes. This is one of the prerequisite tasks for Puppet 4 support. Constructed using puppet-lint.	2018-04-22 23:30:48 -07:00
Tim Abbott	62b12e0c34	zulip_ops: Add missing dependency on dhcpcd.	2018-04-19 14:27:48 -07:00
neiljp (Neil Pilgrim)	090b47ed19	mypy: Add explicit Optional for default=None parameters in various files.	2018-03-28 12:31:51 -07:00
neiljp (Neil Pilgrim)	f32f3cbf72	mypy: Amend zulip-ec2-configure-interfaces to avoid None.	2018-03-23 11:39:54 -07:00
Tim Abbott	d98be2f19f	puppet: Only run analytics Nagios checks on machine running cron. Running this on additional machines would be redundant; additionally, the FillState checker cron job runs only on cron systems, so this will crash on other app frontends.	2018-03-06 13:38:27 -08:00
Tim Abbott	8e8faab006	puppet: Move clearsessions cron job to app_frontend_once. While this is a different system than I'd written up in #8004, I think this is a better solution to the general problem of cron jobs to run on just one server. Fixes #8004.	2018-03-06 13:35:51 -08:00
Tim Abbott	3ae645ed12	puppet: Rename analytics.pp to app_frontend_once.pp.	2018-03-06 13:35:51 -08:00
Tim Abbott	24b6106c9c	puppet: Dsiable checking for evictions in memcached nagios. Zulip's caching model for message history is such that it is normal and healthy for there to eventually be a nontrivial volume of evictions.	2018-03-06 13:34:02 -08:00
Greg Price	4475950ddf	queue: Restore prematurely-cut upgrade path. Revert `c8f034e9a` "queue: Remove missedmessage_email_senders code." As the comment in the code says, it ensures a smooth upgrade path from 1.7.x; we can delete it in master after 1.8.0 is released. The removal commit was merged early due to a communication failure.	2018-02-28 11:15:53 -08:00
Umair Khan	c8f034e9a0	queue: Remove missedmessage_email_senders code. After `68513952fb`, all emails are sent through email_senders queue. This commit removes code related to the legacy queue.	2018-02-21 16:43:56 -08:00
Tim Abbott	005b0fb566	puppet: Clean up ssh authorized_keys configuration rules.	2018-02-09 16:37:03 -08:00
Tim Abbott	aca25b6f0a	puppet: Move ssh configuration to use notify. This handles more correctly the case where we're using the upstream sshd_config file.	2018-02-09 16:37:03 -08:00
Tim Abbott	486de8abfc	puppet: Edit some rules to support chat.zulip.org. This should make it possible to use the zulip_ops base rules successfully on chat.zulip.org. Many of the changes in this commit are hacks and probably can be cleaned up later, but given that we plan to drop trusty support soon, it's likely that most of them will simply be deleted then.	2018-02-09 16:37:03 -08:00
Rishi Gupta	1d581a9c6e	nagios: Add nagios check for analytics state. This should help us detect issues where the analytics cron jobs aren't running properly. The cron/nagios part of the implementation done by tabbott.	2018-02-09 16:36:05 -08:00
Tim Abbott	9ed2a94b8c	nagios: Add configuration designed for full-stack servers. This doesn't yet pass all Nagios checks correctly, and still has a few flaws: * The ideal setup code for the `nagios` user in the database isn't included. * Some of the other details are a bit off; we need to split some host roles. But it's better than nothing, and we can iterate from here.	2018-01-24 14:16:03 -08:00
Tim Abbott	2365b13b68	puppet: Move postgres Nagios plugin to main postgres-common. This plugins package is required in order to use Nagios checks to verify the Zulip postgres database, and thus belongs in the default package set.	2018-01-23 10:31:48 -08:00
Umair Khan	68513952fb	email-worker: Create EmailSendingWorker. This commit just copies all the code from MissedMessageSendingWorker class to a new EmailSendingWorker class. All the logic to send an email through a queue was already there. This commit only makes the logic generic. It does so by creating a special purpose queue called 'email_senders' to send any type of email. To make MissedMessageSendingWorker still work we derive it from EmailSendingWorker. All the tests that were testing MissedMessageSendingWorker now run against EmailSendingWorker.	2017-12-20 19:36:27 -08:00
Vishnu Ks	766511e519	actions: Mark all messages as read when user unsubscribes from stream. This fixes a bug where, when a user is unsubscribed from a stream, they might have unread messages on that stream leak. While it might seem to be a minor problem, it can cause significant problems for computing the `unread_msgs` data structures, since it means we need to add an extra filter for whether the user is still subscribed, either in the backend or in the UI. Fixes #7095.	2017-11-21 20:09:17 -08:00
Tim Abbott	94554c65da	certbot: Modify nginx configuration to support automated renewal.	2017-11-08 12:32:26 -08:00
Tim Abbott	62bb465896	puppet: Modify lb0 nginx configuration.	2017-11-08 12:32:26 -08:00
rht	549a26860f	refactor: Remove six.moves.range import.	2017-11-07 10:46:42 -08:00
Tim Abbott	0d1194811f	mypy: Remove ignores for a few typeshed bugs fixed upstream.	2017-10-27 17:09:00 -07:00
Tim Abbott	540cae19a8	puppet: Remove obsolete sparkle configuration. Sparkle was the auto-update system used by the legacy desktop app. We haven't been capable of using it for auto-update in years, so there's no reason to keep around the configuration. The new Electron app uses a different system anyway.	2017-10-19 16:35:55 -07:00
rht	b57289aacd	py3: Remove all `from __future__ import print_function. Except for these files: - tools/linter_lib/* - tools/lib - tools/lister.py	2017-10-18 12:07:19 -07:00
rht	2f3ae84e5a	py3: Remove all `__future__ import division`.	2017-10-17 23:09:12 -07:00
Tim Abbott	6a5cb0e48c	puppet: Make problems with Zephyr mirroring pageable. Generally this indicates sending messages is completely broken.	2017-10-12 00:16:32 -07:00
rht	de30400fc5	pg_backup_and_purge.py: Remove .py extension.	2017-10-08 15:32:43 -07:00
Tim Abbott	47c5aae5b2	log2zulip: Enforce using python 3 in cron job. We aren't guaranteed to have the Zulip dependencies installed on Python 2.	2017-10-06 16:37:17 -07:00
Tim Abbott	f2055397c1	nagios: Update apache configuration to be generated. Since this is basically just stock Apache configuration for Nagios with a hostname put in, we can just fetch the hostname from our configuration.	2017-10-05 21:51:29 -07:00
Tim Abbott	3af01bed85	puppet: Simplify zulip_ops nginx configuration. Whatever dist/ functionality this had in 2014 is now served by zulip.org, and since this serves as a sample, it should be as simple as possible. Previously, this was more cluttered than it needed to be.	2017-10-05 21:17:57 -07:00
Tim Abbott	e6e7bcf6e1	nagios: Move camo_check_url into configuration.	2017-10-05 21:09:24 -07:00
Tim Abbott	1c453fdf2a	puppet: Add redis_password file for Nagios. This allows the Nagios user to access redis without having full access to the redis system. Ideally, this would eventually use a password that only has statistics read access, but I'm not sure redis supports that.	2017-10-05 20:42:07 -07:00
Tim Abbott	13a36d9af3	puppet: Make old redis_tunnel configuration usable. This old puppet configuration was never really used, and regardless hardcoded an ancient zulip.net hostname. We fix this to use the zulipconf system to get the host domain (though not, at present, the hostname).	2017-10-05 20:40:22 -07:00
Tim Abbott	96c3014da0	nagios: Automate configuration of outgoing email with msmtp. Now we no longer need to check in a bunch of hostnames in order to configure Nagios.	2017-10-05 20:29:47 -07:00
Tim Abbott	5b4c260c3f	puppet: Add munin apache auth configuration. This is completely stock configuration, and seems to be required for munin to run properly.	2017-10-05 20:17:12 -07:00
Tim Abbott	ba7be4102e	puppet: Update munin tunnels configuration to use zulipconf. This eliminates another old hardcoding of zulip.net.	2017-10-05 20:14:43 -07:00
Tim Abbott	162eaf8917	nagios: Modify check for swap to allow no swap. If a machine is configured with no swap intentationally, that shouldn't be a Nagios problem. This alert is intended to flag machines which are swapping.	2017-10-05 20:07:44 -07:00
Tim Abbott	80a16bf873	nagios: Fix path to source zulip_nagios.cfg. Arguably, we should make this a symlink, but it's probably a good idea to have every change in the production Nagios configuration go through the zulip-puppet-apply diff experience.	2017-10-05 20:06:50 -07:00
Tim Abbott	886a8853ac	nagios: Move server-specific config into hostgroups. These new hostgroups exist so we can eliminate explicit references to individual hosts in services.cfg.	2017-10-05 20:06:48 -07:00
Tim Abbott	b6ce9583a9	nagios: Fetch list of hosts from zulip.conf. This makes this much more configurable and much less hardcoded.	2017-10-05 20:06:30 -07:00
Tim Abbott	5193936bc3	nagios: Add Memcached and Redis monitoring. These are standard Nagios plugins that might be sometimes helpful.	2017-10-05 20:06:16 -07:00
Tim Abbott	f7d554d533	nagios: Rename zmirror2 to zmirrorp in configuration. The "p" stands for "personals", aka zephyr private messages, which is what this host manages.	2017-10-05 20:06:08 -07:00
Tim Abbott	062d280914	puppet: Clean up unnecessary pagerduty_nagios.cfg.	2017-10-05 19:23:33 -07:00
Tim Abbott	7e328ba865	nagios: Move email addresses for contacts into variables.	2017-10-05 19:23:33 -07:00
Tim Abbott	6017d3dec5	puppet: Move contacts.cfg to be a template.	2017-10-05 19:23:33 -07:00
Tim Abbott	09aec3e467	puppet: Move hosts.cfg to be managed by a template.	2017-10-05 19:23:33 -07:00
Tim Abbott	692f4b77d1	puppet: Remove messy Nagios crontab.	2017-10-05 19:23:33 -07:00
Tim Abbott	26982ff55f	puppet: Remove pageduty_nagios.pl. This hasn't been used in like 4 years, and clutters the repo.	2017-10-05 18:46:09 -07:00
Tim Abbott	5a80c029a2	nagios: Update path to sync_public_streams to match new config.	2017-10-05 13:34:27 -07:00
Tim Abbott	fdd021fd6a	zephyr-mirror: Update supervisor configuration for repository split. This now points to the path of the integration in the new package.	2017-10-05 13:18:37 -07:00
Tim Abbott	1eff717146	zephyr-mirror: Update cron job to use python-zulip-api. This is a deferred follow-up project to the repository split.	2017-10-05 13:07:45 -07:00
Greg Price	d02101a401	APNs: Rip out the existing, broken implementation. This code empirically doesn't work. It's not entirely clear why, even having done quite a bit of debugging; partly because the code is quite convoluted, and because it shows the symptoms of people making changes over time without really understanding how it was supposed to work. Moreover, this code targets an old version of the APNs provider API. Apple deprecated that in 2015, in favor of a shiny new one which uses HTTP/2 to meet the same needs for concurrency and scale that the old one had to do a bunch of ad-hoc protocol design for. So, rip this code out. We'll build a pathway to the new API from scratch; it's not that complicated.	2017-08-26 14:16:05 -07:00
Greg Price	a099e698e2	py3: Switch almost all shebang lines to use `python3`. This causes `upgrade-zulip-from-git`, as well as a no-option run of `tools/build-release-tarball`, to produce a Zulip install running Python 3, rather than Python 2. In particular this means that the virtualenv we create, in which all application code runs, is Python 3. One shebang line, on `zulip-ec2-configure-interfaces`, explicitly keeps Python 2, and at least one external ops script, `wal-e`, also still runs on Python 2. See discussion on the respective previous commits that made those explicit. There may also be some other third-party scripts we use, outside of this source tree and running outside our virtualenv, that still run on Python 2.	2017-08-16 17:54:43 -07:00
Greg Price	e4d1d22e9f	py3: Explicitly keep our wal-e PostgreSQL replication on Python 2. On `trusty` there is no package for `boto` or `gevent` on Python 3, both of which are dependencies of `wal-e` (at the version we've pinned.) This is something used only on database servers and only in a replication scenario, and it doesn't involve any of our code outside the wal-e repo, so the Python version it uses is quite independent of the Zulip application server itself and the rest of our code. For now, keep it explicitly on Python 2 while we move forward for most everything else.	2017-08-15 17:30:31 -07:00
Greg Price	2a4d851a7c	py3: Explicitly keep one boto-using ops script on Python 2. This script in `zulip_ops` is handy for managing EC2 instances. It uses `boto`, which isn't available in `trusty` for Python 3. The use of `boto` here isn't particularly deep, so we could replace it with some more manual HTTP calls if it comes to that. For now, just mark it to stay on Python 2 while we move the app and all the rest of the ops code (except this and another straggler or two) to Python 3. Also make a comment on this package in the Puppet manifest clearer about what it specifically refers to.	2017-08-15 17:30:31 -07:00
Greg Price	61666a9262	zulip_ops: Delete the long-disused `stats1.zulip.net` config and its dependencies. This consists of the `zulip_ops::stats` Puppet class, which has apparently not been used since 2014, and a number of files that I believe were only used for that. Also a couple of tiny loose ends in other files.	2017-08-15 17:30:31 -07:00
Greg Price	0042fc0c19	py3: Move `python-gevent` dependency to narrow its scope. This is only actually used in our `wal-e` setup, which is in zulip_ops::postgres_common. (In fact the only mentions of `gevent` in our whole Git history are for `wal-e`.) So remove where we mention it on the broader zulip::postgres_common module, and move it where it's needed. This follows up on `98cef0ab4` by eliminating the only dependency outside of the `zulip_ops` Puppet tree on a system Python-library package which isn't available in `trusty` for Python 3.	2017-08-15 17:30:31 -07:00
Greg Price	98cef0ab48	py3: Augment all mentions of system Python packages to include Python 3. In some of these contexts, we may still be using the Python 2 version, but at least this should eliminate running into `ImportError`s one by one in scripts that run outside a virtualenv, as we update their shebangs to refer to Python 3. Several Python libraries we use don't come in Python 3 versions on trusty: gevent, boto, twisted, django, django-tagging, whisper. The latter two don't come in Python 3 versions even on xenial. So some work required before we can actually switch the code that relies on those libraries to run as Python 3 -- probably the best solution will be to backport them all in our apt repo. (All but `whisper` are packaged in zesty; `whisper` upstream just grew Python 3 support this year.)	2017-08-09 14:07:05 -07:00
Greg Price	b8089bdd1c	api: Update log2zulip cron job to find the script at its new path.	2017-07-31 21:24:02 -07:00
Greg Price	c127630dcf	Delete some obsolete usage-stats tools. These are no longer useful, with our spiffy new analytics framework, and we haven't in fact been using them for some time, while the `active-user-stats` cron job does cause regular mail from cron. Just delete them.	2017-07-31 17:06:15 -07:00
neiljp (Neil Pilgrim)	8cabce9f5e	mypy: For EC2, Ensure to_configure is passed a not-None argument.	2017-07-17 16:57:42 -07:00
neiljp (Neil Pilgrim)	ba51958c40	mypy: For EC2, pre-assign address & gateway to enable assertion.	2017-07-17 16:57:42 -07:00
neiljp (Neil Pilgrim)	fd941e8f88	mypy: For EC2, make guess_gateway return None if address is None.	2017-07-17 16:57:42 -07:00
Aditya Bansal	5989c88545	pep8: Make compliant zulip-ec2-configure-interfaces with rule E261.	2017-05-31 17:07:15 -07:00
Aditya Bansal	6b8e85e065	pep8: Make compliant check_zephyr_mirror with rule E261.	2017-05-31 17:07:15 -07:00
Aditya Bansal	49ae51f23a	pep8: Make compliant check_user_zephyr_mirror_liveness with rule E261.	2017-05-31 17:07:15 -07:00
Aditya Bansal	6d0927ed0b	pep8: Add compliance with rule E261 to check_personal_zephyr_mirrors.	2017-05-31 17:07:15 -07:00
Tim Abbott	c5bc1265f3	bots: Move log2zulip into api/integrations.	2017-05-26 15:15:56 -07:00
Tim Abbott	55f69c677a	puppet: Remove obsolete zuliprc.nagios file. This hasn't done anything for years.	2017-05-26 15:14:12 -07:00
Reid Barton	ccb4c5c26f	bots: Move zephyr-related files to api/integrations/zephyr/.	2017-05-26 15:07:02 -07:00
Elliott Jin	0ec9e54954	bots: Add queue and QueueProcessingWorker for embedded bots.	2017-05-25 15:00:51 -07:00
Tim Abbott	1d1b0894a3	zulip_ops: Add logrotate configuration from main zulip.	2017-05-15 21:54:35 -07:00
Tim Abbott	e049ea01b1	puppet: Update munin configuration to work with modern munin.	2017-05-15 21:49:53 -07:00
Tim Abbott	6871c227cd	puppet: Add missing restarts of Nagios on config updates.	2017-05-15 21:49:53 -07:00
vaibhav	8881b5eb9f	Outgoing Webhook System: Check for @-mentioned outgoing webhook bots. Also puts them into a processing queue, though the queue processor does nothing. Rewritten by tabbott to avoid unnecessary database queries in do_send_messages.	2017-05-02 09:22:04 -07:00
hackerkid	b2504084ab	Replace timezone.now with timezone_now.	2017-04-16 12:28:56 -07:00
Tim Abbott	6a5e98b77e	puppet: Increase MaxStartups SSH configuration.	2017-03-08 22:28:16 -08:00
K.Kanakhin	6a801db1c2	missed-emails-sending: Move email sending to separate queue worker. - Add new 'missedmessage_email_senders' queue for sending missed messages emails. - Add the new worker to process 'missedmessage_email_senders' queue. - Split aggregation missed messages and sending missed messages email to separate queue workers. - Adapt tests for sending missed emails to the new logic. Fixes #2607	2017-03-07 20:08:40 -08:00
Raghav Jajodia	a3a03bd6a5	mypy: Added Dict, List and Set imports. Fixed mypy errors associated with the upgrade.	2017-03-04 14:33:44 -08:00
Rishi Gupta	3d07ac0c49	Change timezone-naive datetimes to use timezone.now() where safe to do so. Change timezone-naive datetimes to use timezone.now() in cases where there is no change in behavior.	2017-03-01 22:54:28 -08:00
Tim Abbott	fa8045a484	puppet: Add websockets Nagios test to configuration. Since browser clients send messages via websockets and not the API, this is an important element in making sure mission-critical Zulip functionality is working.	2017-02-08 11:13:19 -08:00
Tim Abbott	ba5f454be5	puppet: Extract zulip::analytics. I'm not altogether happy with this (a better solution would be database-level locking), but I think it solves the immediate problem of folks with 2 servers being very likely to run analytics on both of them.	2017-02-07 12:29:15 -08:00
Tim Abbott	36d54cf5ff	Replace references to zulip.com/dist with zulip.org/dist. Now that zulip.org has all the files to distribute, there's no reason to still point to the soon-to-be-decommissioned zulip.com/dist.	2017-01-28 17:56:25 -08:00
Tim Abbott	4e171ce787	lint: Clean up E126 PEP-8 rule.	2017-01-23 22:06:13 -08:00
Tim Abbott	d6e38e2a5c	lint: Clean up E123 PEP-8 rule.	2017-01-23 21:34:26 -08:00
JefftheBest1	9de75f5167	Fixed typos with separate	2017-01-12 04:52:05 -08:00
JefftheBest1	ff8639f9db	Fixed typos with threshold.	2017-01-12 04:50:20 -08:00
JefftheBest1	5008f45112	Fixed typo in munin.conf.erb	2017-01-12 04:49:19 -08:00
Tim Abbott	3e32102016	nagios: Fix various critical issues not tagged as pageable.	2017-01-06 21:49:20 -08:00
Tim Abbott	edebf7619b	puppet: Add PAM common_session disabling systemd-login. This fixes a weird problem with systemd where logging into a server via ssh frequently has a 15s+ lag.	2017-01-06 21:49:15 -08:00
Tim Abbott	93c2c19775	nagios: Increase process count limits.	2017-01-06 21:49:15 -08:00
Tim Abbott	2c6cb37385	munin: Add default munin configuration template.	2017-01-06 21:44:57 -08:00
Tim Abbott	9ab8e7ba34	nagios: Disable swap checks for servers with no swap.	2017-01-06 21:39:07 -08:00
Tim Abbott	3e01ed1f73	nagios: Increase NTP max_check_attempts. NTP often suffers from brief interruptions of service that lead to spurious Nagios alerts; it makes sense to suppress these.	2017-01-06 21:32:43 -08:00
Tim Abbott	e4420b08d2	zulip_ops: Disable unattended upgrades of security packages. Since Zulip does not handle e.g. postgres server restarts gracefully, it's best for a system administrator to manually trigger security updates.	2017-01-06 21:30:56 -08:00
Tim Abbott	6f9c73d0e5	zmirror: Update Debathena release in configuration. The zulip_ops configuration is now for xenial, not obsolete wheezy.	2017-01-06 21:30:41 -08:00
Tim Abbott	bd9176d1d9	nagios: Remove some default files. Nagios ships with a bunch of default configuration files that one needs to delete in order to configure it.	2017-01-06 21:25:12 -08:00

1 2 3 4 5 ...

303 Commits