zulip/docs/subsystems/queuing.md

# Queue processors

Zulip uses RabbitMQ to manage a system of internal queues. These are
used for a variety of purposes:

- Asynchronously doing expensive operations like sending email
  notifications which can take seconds per email and thus would
  otherwise time out when 100s are triggered at once (E.g. inviting a
  lot of new users to a realm).

- Asynchronously doing non-time-critical somewhat expensive operations
  like updating analytics tables (e.g. UserActivityInternal) which
  don't have any immediate runtime effect.

- Communicating events to push to clients (browsers, etc.) from the
  main Zulip Django application process to the Tornado-based events
  system. Example events might be that a new message was sent, a user
  has changed their subscriptions, etc.

- Processing mobile push notifications and email mirroring system
  messages.

- Processing various errors, frontend tracebacks, and slow database
  queries in a batched fashion.

Needless to say, the RabbitMQ-based queuing system is an important
part of the overall Zulip architecture, since it's in critical code
paths for everything from signing up for account, to rendering
messages, to delivering updates to clients.

We use the `pika` library to interface with RabbitMQ, using a simple
custom integration defined in `zerver/lib/queue.py`.

### Adding a new queue processor

To add a new queue processor:

- Define the processor in `zerver/worker/` using the `@assign_queue` decorator;
  it's pretty easy to get the template for an existing similar queue
  processor. This suffices to test your queue worker in the Zulip development
  environment (`tools/run-dev` will automatically restart the queue processors
  and start running your new queue processor code). You can also run a single
  queue processor manually using e.g. `./manage.py process_queue --queue=user_activity`.

- So that supervisord will know to run the queue processor in
  production, you will need to add to the `queues` variable in
  `puppet/zulip/manifests/app_frontend_base.pp`; the list there is
  used to generate `/etc/supervisor/conf.d/zulip.conf`.

The queue will automatically be added to the list of queues tracked by
`scripts/nagios/check-rabbitmq-consumers`, so Nagios can properly
check whether a queue processor is running for your queue. You still
need to update the sample Nagios configuration in `puppet/kandra`
manually.

### Publishing events into a queue

You can publish events to a RabbitMQ queue using the
`queue_json_publish` function defined in `zerver/lib/queue.py`.

An interesting challenge with queue processors is what should happen
when queued events in Zulip's backend tests. Our current solution is
that in the tests, `queue_json_publish` will (by default) simple call
the `consume` method for the relevant queue processor. However,
`queue_json_publish` also supports being passed a function that should
be called in the tests instead of the queue processor's `consume`
method. Where possible, we prefer the model of calling `consume` in
tests since that's more predictable and automatically covers the queue
processor's code path, but it isn't always possible.

### Clearing a RabbitMQ queue

If you need to clear a queue (delete all the events in it), run
`./manage.py purge_queue <queue_name>`, for example:

```bash
./manage.py purge_queue user_activity
```

You can also use the amqp tools directly. Install `amqp-tools` from
apt and then run:

```bash
amqp-delete-queue --username=zulip --password='...' --server=localhost \
   --queue=user_presence
```

with the RabbitMQ password from `/etc/zulip/zulip-secrets.conf`.
docs: Improve several headings. 2016-06-26 18:49:35 +02:00			`# Queue processors`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00
docs: Apply sentence single-spacing from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:53:28 +02:00			`Zulip uses RabbitMQ to manage a system of internal queues. These are`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`used for a variety of purposes:`

docs: Apply bullet style changes from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:45:39 +02:00			`- Asynchronously doing expensive operations like sending email`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`notifications which can take seconds per email and thus would`
docs: Add missing space to compound verbs “log in”, “set up”, etc. Noun: backup, checkout, cleanup, login, logout, setup, shutdown, signup, timeout. Verb: back up, check out, clean up, log in, log out, set up, shut down, sign up, time out. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-04-25 23:05:38 +02:00			`otherwise time out when 100s are triggered at once (E.g. inviting a`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`lot of new users to a realm).`

docs: Apply bullet style changes from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:45:39 +02:00			`- Asynchronously doing non-time-critical somewhat expensive operations`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`like updating analytics tables (e.g. UserActivityInternal) which`
			`don't have any immediate runtime effect.`

docs: Apply bullet style changes from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:45:39 +02:00			`- Communicating events to push to clients (browsers, etc.) from the`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`main Zulip Django application process to the Tornado-based events`
docs: Apply sentence single-spacing from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:53:28 +02:00			`system. Example events might be that a new message was sent, a user`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`has changed their subscriptions, etc.`

docs: Apply bullet style changes from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:45:39 +02:00			`- Processing mobile push notifications and email mirroring system`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`messages.`

docs: Apply bullet style changes from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:45:39 +02:00			`- Processing various errors, frontend tracebacks, and slow database`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`queries in a batched fashion.`

			`Needless to say, the RabbitMQ-based queuing system is an important`
			`part of the overall Zulip architecture, since it's in critical code`
			`paths for everything from signing up for account, to rendering`
			`messages, to delivering updates to clients.`

			We use the `pika` library to interface with RabbitMQ, using a simple
			custom integration defined in `zerver/lib/queue.py`.

			`### Adding a new queue processor`

			`To add a new queue processor:`

worker: Split into separate files. This makes each worker faster to start up. 2024-04-16 20:49:37 +02:00			- Define the processor in `zerver/worker/` using the `@assign_queue` decorator;
			`it's pretty easy to get the template for an existing similar queue`
			`processor. This suffices to test your queue worker in the Zulip development`
			environment (`tools/run-dev` will automatically restart the queue processors
			`and start running your new queue processor code). You can also run a single`
			queue processor manually using e.g. `./manage.py process_queue --queue=user_activity`.
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00
docs: Apply bullet style changes from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:45:39 +02:00			`- So that supervisord will know to run the queue processor in`
puppet: Move normal_queues to the one place that uses it. 2020-10-20 09:28:17 +02:00			production, you will need to add to the `queues` variable in
			`puppet/zulip/manifests/app_frontend_base.pp`; the list there is
			used to generate `/etc/supervisor/conf.d/zulip.conf`.
docs: Update queuing documentation for new templates. 2017-02-19 06:36:57 +01:00
queues: Add new system for managing rabbitmq per-queue work. Our lists of rabbitmq queues was likely to end up out of date, since there was nothing enforcing that the various lists of queues were correct or the same as each other. 2017-02-17 07:16:43 +01:00			`The queue will automatically be added to the list of queues tracked by`
			`scripts/nagios/check-rabbitmq-consumers`, so Nagios can properly
docs: Apply sentence single-spacing from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:53:28 +02:00			`check whether a queue processor is running for your queue. You still`
puppet: Rename puppet/zulip_ops to puppet/kandra. This makes for easier tab-completion, and also is a bit more explicit about the expected consumer. 2024-02-06 21:40:19 +01:00			need to update the sample Nagios configuration in `puppet/kandra`
queues: Add new system for managing rabbitmq per-queue work. Our lists of rabbitmq queues was likely to end up out of date, since there was nothing enforcing that the various lists of queues were correct or the same as each other. 2017-02-17 07:16:43 +01:00			`manually.`
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00
			`### Publishing events into a queue`

			`You can publish events to a RabbitMQ queue using the`
			`queue_json_publish` function defined in `zerver/lib/queue.py`.

docs: Document the new queue_json_publish model in our unit tests. 2017-11-26 20:49:42 +01:00			`An interesting challenge with queue processors is what should happen`
docs: Apply sentence single-spacing from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:53:28 +02:00			`when queued events in Zulip's backend tests. Our current solution is`
docs: Document the new queue_json_publish model in our unit tests. 2017-11-26 20:49:42 +01:00			that in the tests, `queue_json_publish` will (by default) simple call
docs: Apply sentence single-spacing from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:53:28 +02:00			the `consume` method for the relevant queue processor. However,
docs: Document the new queue_json_publish model in our unit tests. 2017-11-26 20:49:42 +01:00			`queue_json_publish` also supports being passed a function that should
			be called in the tests instead of the queue processor's `consume`
docs: Apply sentence single-spacing from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:53:28 +02:00			method. Where possible, we prefer the model of calling `consume` in
docs: Document the new queue_json_publish model in our unit tests. 2017-11-26 20:49:42 +01:00			`tests since that's more predictable and automatically covers the queue`
			`processor's code path, but it isn't always possible.`

Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`### Clearing a RabbitMQ queue`

			`If you need to clear a queue (delete all the events in it), run`
			`./manage.py purge_queue <queue_name>`, for example:

docs: Add syntax highlighting languages to code blocks. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 07:09:04 +02:00			```bash
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`./manage.py purge_queue user_activity`
			```

docs: Apply sentence single-spacing from Prettier. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 21:53:28 +02:00			You can also use the amqp tools directly. Install `amqp-tools` from
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`apt and then run:`

docs: Add syntax highlighting languages to code blocks. Signed-off-by: Anders Kaseorg <anders@zulip.com> 2021-08-20 07:09:04 +02:00			```bash
Add documentation on the Zulip RabbitMQ queues. 2016-04-01 07:23:05 +02:00			`amqp-delete-queue --username=zulip --password='...' --server=localhost \`
			`--queue=user_presence`
			```

			with the RabbitMQ password from `/etc/zulip/zulip-secrets.conf`.