zulip

Commit Graph

Author	SHA1	Message	Date
Alex Vandiver	359f37389a	puppet: Remove in-nagios auth restrictions. `51b985b40d` made nagios only accessible from localhost, or as proxied via teleport. Remove the HTTP-level auth requirements.	2021-06-07 16:17:45 -07:00
Alex Vandiver	51b985b40d	puppet: Move nagios to behind teleport. This makes the server only accessible via localhost, by way of the Teleport application service.	2021-06-02 18:38:38 -07:00
Alex Vandiver	c9141785fd	puppet: Use concat fragments to place port allows next to services. This means that services will only open their ports if they are actually run, without having to clutter rules.v4 with a log of `if` statements. This does not go as far as using `puppetlabs/firewall`[1] because that would represent an additional DSL to learn; raw IPtables sections can easily be inserted into the generated iptables file via `concat::fragment` (either inline, or as a separate file), but config can be centralized next to the appropriate service. [1] https://forge.puppet.com/modules/puppetlabs/firewall	2021-05-27 21:14:48 -07:00
Alex Vandiver	9ea86c861b	puppet: Add a nagios alert configuration for smokescreen. This verifies that the proxy is working by accessing a highly-available website through it. Since failure of this equates to failures of Sentry notifications and Android mobile push notifications, this is a paging service.	2021-03-18 10:11:15 -07:00
Alex Vandiver	a215c83c2d	puppet: Switch to more explicit variable rather than reuse a nagios one. Redis is not nagios, and this only leads to confusion as to why there is a nagios domain setting on frontend servers; it also leaves the `redis0` part of the name buried in the template. Switch to an explicit variable for the redis hostname.	2021-03-10 11:44:54 -08:00
Alex Vandiver	d938dd9d4a	puppet: Document smokescreen installation, and move to puppet/zulip/. This is more broadly useful than for just Kandra; provide documentation and means to install Smokescreen for stand-alone servers, and motivate its use somewhat more.	2021-03-02 17:16:38 -08:00
Alex Vandiver	32149c6a1c	puppet: Add ksplice uptrack for kernel hotpatches.	2021-02-25 18:05:47 -08:00
Alex Vandiver	0b736ef4cf	puppet: Remove puppet_ops configuration for separate loadbalancer host.	2021-02-22 16:05:13 -08:00
Alex Vandiver	e30b524896	iptables: Limit smokescreen port 4750, add camo port. Limit incoming connections to port 4750 to only the smokescreen host, and also allow access to the Camo server on that host, on port 9292.	2021-02-17 13:52:38 -08:00
Alex Vandiver	29f60bad20	smokescreen: Put the version into the supervisorctl command. This makes it reload correctly if the version is changed.	2021-02-16 08:12:31 -08:00
Alex Vandiver	45f6c79c4a	puppet: Rename postgres_ variables to postgresql_.	2020-10-28 11:51:52 -07:00
Alex Vandiver	e124324050	puppet: Rename postgres_appdb in nagios to postgresql.	2020-10-28 11:51:52 -07:00
Alex Vandiver	78b92a51cc	puppet: Allow access to smokescreen port via iptables.	2020-10-15 15:18:35 -07:00
Alex Vandiver	0d5356969e	puppet: Reformat ipv4 iptables rules comments.	2020-10-15 15:18:35 -07:00
Alex Vandiver	24383a5082	puppet: Rename hosts_domain so hosts_prefix can be grepped for.	2020-07-10 00:14:09 -07:00
Alex Vandiver	f8fc3a16eb	puppet: Use "primary" / "replica" consistently in comments. The style guide for Zulip is to always use "primary" and "replica" when describing database replication. Adjust a few comments under `puppet/` that do not adhere to this. Unfortunately, some references still remain to the insensitive and inaccurate "master" / "slave" terminology. However, these are only in files which we are attempting to preserve as close to the upstream versions they are derived from (e.g. postgresql.conf, postfix/master.cf).	2020-06-15 16:18:07 -07:00
Alex Vandiver	7d4a370a57	puppet: Move monitoring of pg replication to the pg hosts. Instead of SSH'ing around to them, run directly on the database hosts. This means that the replicas do not know how many bytes behind they are in _receiving_ the wall logs; thus, the monitoring also extends to the primary database, which knows that information for each replica. This also allows for detecting when there are too few active replicas.	2020-06-15 16:18:07 -07:00
Alex Vandiver	8b1d49dbc7	puppet: Rename "wiki" realm to "monitoring". This is vestigial. It requires manually altering the `htdigest` file (not stored in this repo) to change the digest realm from `wiki` to `monitoring`, and will re-prompt users for their passwords if the browsers currently store them.	2020-05-30 12:26:21 -07:00
Tim Abbott	cfbb617f5c	puppet: Update nagios configuration for checking local disk.	2020-04-16 17:48:36 -07:00
Stefan Weil	d2fa058cc1	text: Fix some typos (most of them found and fixed by codespell). Signed-off-by: Stefan Weil <sw@weilnetz.de>	2020-03-27 17:25:56 -07:00
Anders Kaseorg	becef760bf	cleanup: Delete leading newlines. Previous cleanups (mostly the removals of Python __future__ imports) were done in a way that introduced leading newlines. Delete leading newlines from all files, except static/assets/zulip-emoji/NOTICE, which is a verbatim copy of the Apache 2.0 license. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-06 23:29:11 -07:00
Tim Abbott	5abf4dee92	nagios: Add new host groups for Tornado processes. We also move all the existing Tornado monitoring rules to the singletornado_frontends rule.	2018-11-06 16:33:18 -08:00
Tim Abbott	4e8487c886	nagios: Bump maximum processes limits. These seemed to be flapping for no good reason.	2018-05-02 11:12:47 -07:00
Tim Abbott	9ed2a94b8c	nagios: Add configuration designed for full-stack servers. This doesn't yet pass all Nagios checks correctly, and still has a few flaws: * The ideal setup code for the `nagios` user in the database isn't included. * Some of the other details are a bit off; we need to split some host roles. But it's better than nothing, and we can iterate from here.	2018-01-24 14:16:03 -08:00
Tim Abbott	f2055397c1	nagios: Update apache configuration to be generated. Since this is basically just stock Apache configuration for Nagios with a hostname put in, we can just fetch the hostname from our configuration.	2017-10-05 21:51:29 -07:00
Tim Abbott	e6e7bcf6e1	nagios: Move camo_check_url into configuration.	2017-10-05 21:09:24 -07:00
Tim Abbott	13a36d9af3	puppet: Make old redis_tunnel configuration usable. This old puppet configuration was never really used, and regardless hardcoded an ancient zulip.net hostname. We fix this to use the zulipconf system to get the host domain (though not, at present, the hostname).	2017-10-05 20:40:22 -07:00
Tim Abbott	96c3014da0	nagios: Automate configuration of outgoing email with msmtp. Now we no longer need to check in a bunch of hostnames in order to configure Nagios.	2017-10-05 20:29:47 -07:00
Tim Abbott	ba7be4102e	puppet: Update munin tunnels configuration to use zulipconf. This eliminates another old hardcoding of zulip.net.	2017-10-05 20:14:43 -07:00
Tim Abbott	886a8853ac	nagios: Move server-specific config into hostgroups. These new hostgroups exist so we can eliminate explicit references to individual hosts in services.cfg.	2017-10-05 20:06:48 -07:00
Tim Abbott	b6ce9583a9	nagios: Fetch list of hosts from zulip.conf. This makes this much more configurable and much less hardcoded.	2017-10-05 20:06:30 -07:00
Tim Abbott	f7d554d533	nagios: Rename zmirror2 to zmirrorp in configuration. The "p" stands for "personals", aka zephyr private messages, which is what this host manages.	2017-10-05 20:06:08 -07:00
Tim Abbott	062d280914	puppet: Clean up unnecessary pagerduty_nagios.cfg.	2017-10-05 19:23:33 -07:00
Tim Abbott	7e328ba865	nagios: Move email addresses for contacts into variables.	2017-10-05 19:23:33 -07:00
Tim Abbott	6017d3dec5	puppet: Move contacts.cfg to be a template.	2017-10-05 19:23:33 -07:00
Tim Abbott	09aec3e467	puppet: Move hosts.cfg to be managed by a template.	2017-10-05 19:23:33 -07:00
Tim Abbott	e049ea01b1	puppet: Update munin configuration to work with modern munin.	2017-05-15 21:49:53 -07:00
JefftheBest1	5008f45112	Fixed typo in munin.conf.erb	2017-01-12 04:49:19 -08:00
Tim Abbott	2c6cb37385	munin: Add default munin configuration template.	2017-01-06 21:44:57 -08:00
Tim Abbott	4fbe201187	puppet: Automate autossh process monitoring maintenance. Previously, the Zulip Nagios configuration effectively hardcoded the count for how many system should have autossh connections.	2016-10-26 00:49:03 -07:00
Tim Abbott	0a5a2c4eda	nagios: Automate authorized users list maintenance.	2016-10-26 00:37:29 -07:00
Tim Abbott	36e336edc3	puppet: Rename zulip_internal to zulip_ops. The old "zulip_internal" name was from back when Zulip, Inc. had two distributions of Zulip, the enterprise distribution in puppet/zulip/ and the "internal" SAAS distribution in puppet/zulip_internal. I think the name is a bit confusing in the new fully open-source Zulip work, so we're replacing it with "zulip_ops". I don't think the new name is perfect, but it's better. In the following commits, we'll delete a bunch of pieces of Zulip, Inc.'s infrastructure that don't exist anymore and thus are no longer useful (e.g. the old Trac configuration), with the goal of cleaning the repository of as much unnecessary content as possible.	2016-10-16 19:23:27 -07:00

42 Commits