zulip/docs/subsystems/schema-migrations.md

# Schema Migrations

Zulip uses the [standard Django system for doing schema
migrations](https://docs.djangoproject.com/en/1.10/topics/migrations/).
There is some example usage in the [new feature
tutorial](../tutorials/new-feature-tutorial.md).

This page documents some important issues related to writing schema
migrations.

* If your database migration is just to reflect new fields in
  `models.py`, you'll typically want to just:
  * Rebase your branch before you start (this may save work later).
  * Update the model class definitions in `zerver/models.py`.
  * Run `./manage.py makemigrations` to generate a migration file
  * Rename the migration file to have a descriptive name if Django
    generated used a date-based name like `0089_auto_20170710_1353.py`
    (which happens when the changes are to multiple models and Django).
  * `git add` the new migration file
  * Run `tools/provision` to update your local database to apply the
    migrations.
  * Commit your changes.
* For more complicated migrations where you need to run custom Python
  code as part of the migration, it's best to read past migrations to
  understand how to write them well.  `git grep RunPython
  zerver/migrations/02*` will find many good examples.  Before writing
  migrations of this form, you should read Django's docs and the
  sections below.
* **Numbering conflicts across branches**: If you've done your schema
  change in a branch, and meanwhile another schema change has taken
  place, Django will now have two migrations with the same
  number. There are two easy way to fix this:
  * If your migrations were automatically generated using `manage.py
    makemigrations`, a good option is to just remove your migration
    and rerun the command after rebasing.  Remember to `git rebase` to
    do this in the the commit that changed `models.py` if you have a
    multi-commit branch.
  * If you wrote code as part of preparing your migrations, or prefer
    this workflow, you can use run `./tools/renumber-migrations`,
    which renumbers your migration(s) and fixes up the "dependencies"
    entries in your migration(s).  The tool could use a bit of work to
    prompt unnecessarily less, but it will update the working tree for
    you automatically (you still need to do all the `git add`
    commands, etc.).

* **Large tables**: For our very largest tables (e.g. Message and
  UserMessage), we often need to take precautions when adding columns
  to the table, performing data backfills, or building indexes. We
  have a `zerver/lib/migrate.py` library to help with adding columns
  and backfilling data.  For building indexes on these tables, we
  should do this using SQL with postgres's CONCURRENTLY keyword.

* **Atomicity**.  By default, each Django migration is run atomically
  inside a transaction.  This can be problematic if one wants to do
  something in a migration that touches a lot of data and would best
  be done in batches of e.g. 1000 objects (e.g. a `Message` or
  `UserMessage` table change).  There is a [useful Django
  feature][migrations-non-atomic] that makes it possible to add
  `atomic=False` at the top of a `Migration` class and thus not have
  the entire migration in a transaction.  This should make it possible
  to use the batch update tools in `zerver/lib/migrate.py` (originally
  written to work with South) for doing larger database migrations.

* **Accessing code and models in RunPython migrations**. When writing
  a migration that includes custom python code (aka `RunPython`), you
  almost never want to import code from `zerver` or anywhere else in
  the codebase. If you imagine the process of upgrading a Zulip
  server, it goes as follows: first a server admin checks out a recent
  version of the code, and then runs any migrations that were added
  between the last time they upgraded and the current check out. Note
  that for each migration, this means the migration is run using the
  code in the server admin's check out, and not the code that was there at the
  time the migration was written. This can be a difference of
  thousands of commits for installations that are only upgraded
  occasionally. It is hard to reason about the effect of a code change
  on a migration that imported it so long ago, so we recommend just
  copying any code you're tempted to import into the migration file
  directly, and have a linter rule enforcing this.

  There is one special case where this doesn't work: you can't copy
  the definition of a model (like `Realm`) into a migration, and you
  can't import it from `zerver.models` for the reasons above. In this
  situation you should use Django's `apps.get_model` to get access to
  a model as it is at the time of a migration. Note that this will
  work for doing something like `Realm.objects.filter(..)`, but
  shouldn't be used for accessing properties like `Realm.subdomain` or
  anything not related to the Django ORM.

* **Making large migrations work**.  Major migrations should have a
few properties:

  * **Unit tests**.  You'll want to carefully test these, so you might
    as well write some unit tests to verify the migration works
    correctly, rather than doing everything by hand.  This often saves
    a lot of time in re-testing the migration process as we make
    adjustments to the plan.
  * **Run in batches**.  Updating more than 1K-10K rows (depending on
    type) in a single transaction can lock up a database.  It's best
    to do lots of small batches, potentially with a brief sleep in
    between, so that we don't block other operations from finishing.
  * **Rerunnability/idempotency**.  Good migrations are ones where if
    operational concerns (e.g. it taking down the Zulip server for
    users) interfere with it finishing, it's easy to restart the
    migration without doing a bunch of hand investigation.  Ideally,
    the migration can even continue where it left off, without needing
    to redo work.
  * **Multi-step migrations**.  For really big migrations, one wants
  to split the transition into into several commits that are each
  individually correct, and can each be deployed independently:

    1. First, do a migration to add the new column to the Message table
      and start writing to that column (but don't use it for anything)
    2. Second, do a migration to copy values from the old column to
    the new column, to ensure that the two data stores agree.
    3. Third, a commit that stops writing to the old field.
    4. Any cleanup work, e.g. if the old field were a column, we'd do
       a migration to remove it entirely here.

    This multi-step process is how most migrations on large database
    tables are done in large-scale systems, since it ensures that the
    system can continue running happily during the migration.

## Automated testing for migrations

Zulip has support for writing automated tests for your database
migrations, using the `MigrationsTestCase` test class.  This system is
inspired by [a great blog post][django-migration-test-blog-post] on
the subject.

We have integrated this system with our test framework so that if you
use the `use_db_models` decorator, you can use some helper methods
from `test_classes.py` and friends from inside the tests (which is
normally not possible in Django's migrations framework).

If you find yourself writing logic in a `RunPython` migration, we
highly recommend adding a test using this framework.  We may end up
deleting the test later (they can get slow once they are many
migrations away from current), but it can help prevent disaster where
an incorrect migration messes up a database in a way that's impossible
to undo without going to backups.

[django-migration-test-blog-post]: https://www.caktusgroup.com/blog/2016/02/02/writing-unit-tests-django-migrations/
[migrations-non-atomic]: https://docs.djangoproject.com/en/1.10/howto/writing-migrations/#non-atomic-migrations

## Schema and initial data changes

If you follow the processes described above, `tools/provision` and
`tools/test-backend` should detect any changes to the declared
migrations and run migrations on (`./manage.py migrate`) or rebuild
the relevant database automatically as appropriate.

While developing migrations, you may accidentally corrupt
your databases while debugging your new code.
You can always rebuild these databases from scratch.

Use `tools/rebuild-test-database` to rebuild the database
used for `test-backend` and other automated tests.

Use `tools/rebuild-dev-database` to rebuild the database
used in [manual testing](../development/using.md).
Add some docs on schema migrations. 2016-04-01 08:50:53 +02:00			`# Schema Migrations`

			`Zulip uses the [standard Django system for doing schema`
docs: Document migration atomicity issues. 2017-02-23 08:35:37 +01:00			`migrations](https://docs.djangoproject.com/en/1.10/topics/migrations/).`
docs: Add missing link in schema-migrations.md. 2016-06-26 18:47:23 +02:00			`There is some example usage in the [new feature`
docs: Reduce the number of apparently broken links on github. - Updated 260+ links from ".html" to ".md" to reduce the number of issues reported about hyperlinks not working when viewing docs on Github. - Removed temporary workaround that suppressed all warnings reported by sphinx build for every link ending in ".html". Details: The recent upgrade to recommonmark==0.5.0 supports auto-converting ".md" links to ".html" so that the resulting HTML output is correct. Notice that links pointing to a heading i.e. "../filename.html#heading", were not updated because recommonmark does not auto-convert them. These links do not generate build warnings and do not cause any issues. However, there are about ~100 such links that might still get misreported as broken links. This will be a follow-up issue. Background: docs: pip upgrade recommonmark and CommonMark #13013 docs: Allow .md links between doc pages #11719 Fixes #11087. 2019-09-30 19:37:56 +02:00			`tutorial](../tutorials/new-feature-tutorial.md).`
Add some docs on schema migrations. 2016-04-01 08:50:53 +02:00
			`This page documents some important issues related to writing schema`
			`migrations.`
docs: Document naming for schema migrations. 2017-07-11 01:02:19 +02:00
docs: Simplify discussion of rebasing schema migrations. The long manual tutorial is likely now unnecessary, and the article was missing some helpful heading content. 2019-11-08 22:49:24 +01:00			`* If your database migration is just to reflect new fields in`
docs: Reorganize testing.md and using.md. This is a fairly involved set of changes, including changes that: * Delete various legacy or semi-duplicated sections of testing.md. Nobody needs to manually delete the postgres datbase anymore, as reflected in the fact that the docs still mention postgres 9.1 from Ubuntu Precise. * Simplify the distracting heading section at the top of testing.md. * Move content on manual testing to docs/development/using.md. * Moves some content related to managing the database to schema-migrations.md. (Resulting in some cleanups to that page as well). I ideally would have split this into smaller pieces. 2020-01-17 09:00:09 +01:00			`models.py`, you'll typically want to just:
			`* Rebase your branch before you start (this may save work later).`
			* Update the model class definitions in `zerver/models.py`.
			* Run `./manage.py makemigrations` to generate a migration file
			`* Rename the migration file to have a descriptive name if Django`
			generated used a date-based name like `0089_auto_20170710_1353.py`
			`(which happens when the changes are to multiple models and Django).`
			* `git add` the new migration file
			* Run `tools/provision` to update your local database to apply the
			`migrations.`
			`* Commit your changes.`
			`* For more complicated migrations where you need to run custom Python`
			`code as part of the migration, it's best to read past migrations to`
			understand how to write them well. `git grep RunPython
			zerver/migrations/02*` will find many good examples. Before writing
			`migrations of this form, you should read Django's docs and the`
			`sections below.`
Add some docs on schema migrations. 2016-04-01 08:50:53 +02:00			`* Numbering conflicts across branches: If you've done your schema`
			`change in a branch, and meanwhile another schema change has taken`
docs: Simplify discussion of rebasing schema migrations. The long manual tutorial is likely now unnecessary, and the article was missing some helpful heading content. 2019-11-08 22:49:24 +01:00			`place, Django will now have two migrations with the same`
			`number. There are two easy way to fix this:`
			* If your migrations were automatically generated using `manage.py
			makemigrations`, a good option is to just remove your migration
			and rerun the command after rebasing. Remember to `git rebase` to
			do this in the the commit that changed `models.py` if you have a
			`multi-commit branch.`
			`* If you wrote code as part of preparing your migrations, or prefer`
			this workflow, you can use run `./tools/renumber-migrations`,
			`which renumbers your migration(s) and fixes up the "dependencies"`
			`entries in your migration(s). The tool could use a bit of work to`
			`prompt unnecessarily less, but it will update the working tree for`
			you automatically (you still need to do all the `git add`
			`commands, etc.).`
docs: Document migration atomicity issues. 2017-02-23 08:35:37 +01:00
docs: Reorganize testing.md and using.md. This is a fairly involved set of changes, including changes that: * Delete various legacy or semi-duplicated sections of testing.md. Nobody needs to manually delete the postgres datbase anymore, as reflected in the fact that the docs still mention postgres 9.1 from Ubuntu Precise. * Simplify the distracting heading section at the top of testing.md. * Move content on manual testing to docs/development/using.md. * Moves some content related to managing the database to schema-migrations.md. (Resulting in some cleanups to that page as well). I ideally would have split this into smaller pieces. 2020-01-17 09:00:09 +01:00			`* Large tables: For our very largest tables (e.g. Message and`
			`UserMessage), we often need to take precautions when adding columns`
			`to the table, performing data backfills, or building indexes. We`
			have a `zerver/lib/migrate.py` library to help with adding columns
			`and backfilling data. For building indexes on these tables, we`
			`should do this using SQL with postgres's CONCURRENTLY keyword.`

docs: Document migration atomicity issues. 2017-02-23 08:35:37 +01:00			`* Atomicity. By default, each Django migration is run atomically`
			`inside a transaction. This can be problematic if one wants to do`
			`something in a migration that touches a lot of data and would best`
			be done in batches of e.g. 1000 objects (e.g. a `Message` or
docs: Reorganize testing.md and using.md. This is a fairly involved set of changes, including changes that: * Delete various legacy or semi-duplicated sections of testing.md. Nobody needs to manually delete the postgres datbase anymore, as reflected in the fact that the docs still mention postgres 9.1 from Ubuntu Precise. * Simplify the distracting heading section at the top of testing.md. * Move content on manual testing to docs/development/using.md. * Moves some content related to managing the database to schema-migrations.md. (Resulting in some cleanups to that page as well). I ideally would have split this into smaller pieces. 2020-01-17 09:00:09 +01:00			`UserMessage` table change). There is a [useful Django
			`feature][migrations-non-atomic] that makes it possible to add`
docs: Document migration atomicity issues. 2017-02-23 08:35:37 +01:00			`atomic=False` at the top of a `Migration` class and thus not have
			`the entire migration in a transaction. This should make it possible`
			to use the batch update tools in `zerver/lib/migrate.py` (originally
			`written to work with South) for doing larger database migrations.`

lint: Prevent importing from zerver in migrations. 2017-03-15 03:48:55 +01:00			`* Accessing code and models in RunPython migrations. When writing`
			a migration that includes custom python code (aka `RunPython`), you
			almost never want to import code from `zerver` or anywhere else in
			`the codebase. If you imagine the process of upgrading a Zulip`
			`server, it goes as follows: first a server admin checks out a recent`
			`version of the code, and then runs any migrations that were added`
			`between the last time they upgraded and the current check out. Note`
			`that for each migration, this means the migration is run using the`
			`code in the server admin's check out, and not the code that was there at the`
			`time the migration was written. This can be a difference of`
			`thousands of commits for installations that are only upgraded`
			`occasionally. It is hard to reason about the effect of a code change`
			`on a migration that imported it so long ago, so we recommend just`
			`copying any code you're tempted to import into the migration file`
			`directly, and have a linter rule enforcing this.`

			`There is one special case where this doesn't work: you can't copy`
			the definition of a model (like `Realm`) into a migration, and you
			can't import it from `zerver.models` for the reasons above. In this
			situation you should use Django's `apps.get_model` to get access to
			`a model as it is at the time of a migration. Note that this will`
			work for doing something like `Realm.objects.filter(..)`, but
docs: Reorganize testing.md and using.md. This is a fairly involved set of changes, including changes that: * Delete various legacy or semi-duplicated sections of testing.md. Nobody needs to manually delete the postgres datbase anymore, as reflected in the fact that the docs still mention postgres 9.1 from Ubuntu Precise. * Simplify the distracting heading section at the top of testing.md. * Move content on manual testing to docs/development/using.md. * Moves some content related to managing the database to schema-migrations.md. (Resulting in some cleanups to that page as well). I ideally would have split this into smaller pieces. 2020-01-17 09:00:09 +01:00			shouldn't be used for accessing properties like `Realm.subdomain` or
			`anything not related to the Django ORM.`
lint: Ban importing zerver.models in database migrations. This doesn't work correctly, but in a subtle way. 2017-03-05 02:29:08 +01:00
docs: Expand discussion of database schema migrations. This covers the standard multi-step process for doing large migrations, as well as other important properties to consider when writing migrations. Also documents the new Django 'atomic=False' option. Fixes #1332. 2017-02-23 08:43:36 +01:00			`* Making large migrations work. Major migrations should have a`
			`few properties:`

			`* Unit tests. You'll want to carefully test these, so you might`
			`as well write some unit tests to verify the migration works`
			`correctly, rather than doing everything by hand. This often saves`
			`a lot of time in re-testing the migration process as we make`
			`adjustments to the plan.`
			`* Run in batches. Updating more than 1K-10K rows (depending on`
			`type) in a single transaction can lock up a database. It's best`
			`to do lots of small batches, potentially with a brief sleep in`
			`between, so that we don't block other operations from finishing.`
			`* Rerunnability/idempotency. Good migrations are ones where if`
			`operational concerns (e.g. it taking down the Zulip server for`
			`users) interfere with it finishing, it's easy to restart the`
			`migration without doing a bunch of hand investigation. Ideally,`
			`the migration can even continue where it left off, without needing`
			`to redo work.`
			`* Multi-step migrations. For really big migrations, one wants`
			`to split the transition into into several commits that are each`
			`individually correct, and can each be deployed independently:`

			`1. First, do a migration to add the new column to the Message table`
			`and start writing to that column (but don't use it for anything)`
			`2. Second, do a migration to copy values from the old column to`
			`the new column, to ensure that the two data stores agree.`
			`3. Third, a commit that stops writing to the old field.`
			`4. Any cleanup work, e.g. if the old field were a column, we'd do`
			`a migration to remove it entirely here.`

			`This multi-step process is how most migrations on large database`
			`tables are done in large-scale systems, since it ensures that the`
			`system can continue running happily during the migration.`

migration_tests: Document the migration test framework. 2018-05-21 18:56:45 +02:00			`## Automated testing for migrations`

			`Zulip has support for writing automated tests for your database`
			migrations, using the `MigrationsTestCase` test class. This system is
			`inspired by [a great blog post][django-migration-test-blog-post] on`
			`the subject.`

			`We have integrated this system with our test framework so that if you`
			use the `use_db_models` decorator, you can use some helper methods
			from `test_classes.py` and friends from inside the tests (which is
			`normally not possible in Django's migrations framework).`

			If you find yourself writing logic in a `RunPython` migration, we
			`highly recommend adding a test using this framework. We may end up`
			`deleting the test later (they can get slow once they are many`
			`migrations away from current), but it can help prevent disaster where`
			`an incorrect migration messes up a database in a way that's impossible`
			`to undo without going to backups.`

			`[django-migration-test-blog-post]: https://www.caktusgroup.com/blog/2016/02/02/writing-unit-tests-django-migrations/`
docs: Document migration atomicity issues. 2017-02-23 08:35:37 +01:00			`[migrations-non-atomic]: https://docs.djangoproject.com/en/1.10/howto/writing-migrations/#non-atomic-migrations`
docs: Reorganize testing.md and using.md. This is a fairly involved set of changes, including changes that: * Delete various legacy or semi-duplicated sections of testing.md. Nobody needs to manually delete the postgres datbase anymore, as reflected in the fact that the docs still mention postgres 9.1 from Ubuntu Precise. * Simplify the distracting heading section at the top of testing.md. * Move content on manual testing to docs/development/using.md. * Moves some content related to managing the database to schema-migrations.md. (Resulting in some cleanups to that page as well). I ideally would have split this into smaller pieces. 2020-01-17 09:00:09 +01:00
			`## Schema and initial data changes`

			If you follow the processes described above, `tools/provision` and
			`tools/test-backend` should detect any changes to the declared
			migrations and run migrations on (`./manage.py migrate`) or rebuild
docs: Fix some typos in documentation (most of them found and fixed by codespell). Signed-off-by: Stefan Weil <sw@weilnetz.de> 2020-03-17 13:57:10 +01:00			`the relevant database automatically as appropriate.`
docs: Reorganize testing.md and using.md. This is a fairly involved set of changes, including changes that: * Delete various legacy or semi-duplicated sections of testing.md. Nobody needs to manually delete the postgres datbase anymore, as reflected in the fact that the docs still mention postgres 9.1 from Ubuntu Precise. * Simplify the distracting heading section at the top of testing.md. * Move content on manual testing to docs/development/using.md. * Moves some content related to managing the database to schema-migrations.md. (Resulting in some cleanups to that page as well). I ideally would have split this into smaller pieces. 2020-01-17 09:00:09 +01:00
db tools: Rename do-destroy-*database. The new tools now have more concise, more parallel names: - rebuild-dev-database - rebuild-test-database The actual implementations are still pretty different: rebuild-dev-database: mostly delegates to 5 management scripts rebuild-test-database: is a very thin wrapper for generate-fixtures We'll try to clean that up a bit soon. 2020-04-21 22:03:12 +02:00			`While developing migrations, you may accidentally corrupt`
			`your databases while debugging your new code.`
			`You can always rebuild these databases from scratch.`

			Use `tools/rebuild-test-database` to rebuild the database
			used for `test-backend` and other automated tests.

			Use `tools/rebuild-dev-database` to rebuild the database
			`used in [manual testing](../development/using.md).`