zulip

Commit Graph

Author	SHA1	Message	Date
Zixuan James Li	7fd8d77ce0	typing: Import ValuesQuerySet alias from django_stubs_ext. This saves us from using a conditional import. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-10-05 16:15:56 -07:00
Zixuan James Li	f5f94b9cad	stream_subscription: Tighten function signatures with generic QuerySet. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-07-07 11:28:13 -07:00
Zixuan James Li	ab1bbdda65	typing: Broaden type annotations for QuerySet compatibility. To explain the rationale of this change, for example, there is `get_user_activity_summary` which accepts either a `Collection[UserActivity]`, where `QuerySet[T]` is not strictly `Sequence[T]` because its slicing behavior is different from the `Protocol`, making `Collection` necessary. Similarily, we should have `Iterable[T]` instead of `List[T]` so that `QuerySet[T]` will also be an acceptable subtype, or `Sequence[T]` when we also expect it to be indexed. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-07-07 11:27:42 -07:00
Zixuan James Li	fd9a0f4274	typing: Apply trivial none-checks with assertions as necessary. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-06-23 19:25:48 -07:00
Steve Howell	fe3295d395	performance: Avoid monster query for existing subs. Part of our codepath for subscribing users involves fetching the users' existing subscriptions to make sure we can do things like properly report to the clients that the users were already subscribed. This codepath used to be coupled to code that helped users maintain unique stream colors. Suppose you are creating a new stream, and you are importing users from an older stream with 15k subscribers, and each of your users is subscribed to about 20 streams. The prior code, instead of filtering on recipient_id, would literally look at every subscription for every user, which was kind of crazy if you didn't understand the pick-stream-color complications. Before this commit, we would fetch 300k rows with 15 columns each (granted, all but one of the columns are bool/int). That's a total of 4.5 million tiny objects that we had to glom into Django ORM objects and slice and dice. After this commit, we would fetch exactly zero rows for the are-they-already-subscribed logic. Yes, ZERO. If we were to mistakenly try to re-add the same 15k subscribers to the new stream (under the new code), we will now fetch 15k Sub rows instead of 300k. It is worth looking at the prior commit. We go through great pains to ensure that users get new stream colors when we invite them to a stream, and we still fetch a bunch of data for that. Instead of 4.5 million cells, it's more like 600k cells (2 columns per row), and it's less than that insofar as some users may only have 24 distinct colors among their many streams. It's a lot of work.	2021-12-28 12:15:02 -08:00
Steve Howell	f638fd6f72	performance: Get used stream colors in separate trip. This commit sets us up for the next commit, which will save us a very expensive query. If you are adding 15k users to a stream, and each user has about 20 existing streams, then we need to retrieve 300k rows from the database to figure out which stream colors they already have. We don't need all the extra fields from Subscription, so now we get just the two values we need for making a color map. In the next commit we'll eliminate the other use case for the big query, and I will explain in greater depth how splitting out the color-picking code can be a huge win. It is possible that some product decisions could make this codepath easier. We could also do some engineering specific to stream colors, such as caching which colors users have already used. This does cost us an extra round trip to the database.	2021-12-28 12:15:02 -08:00
Tim Abbott	0bfef96543	bulk_access_messages: Bulk fetch Subscription details. This completes the effort to make it possible to use bulk_access_message in contexts where there are more than a handful of messages without creating performance issues.	2021-05-12 16:23:22 -07:00
Steve Howell	b4470ac8e1	performance: Add get_subscriptions_for_send_message. This new function optimizes how we fetch subscriptions for streams. Basically, it excludes most long-term-idle users from the query. With 8k users, of which all but 400 are long term idle, this speeds up get_recipient_info from about 150ms to 50ms. Overall this change appears to save a factor of 2-3 in the backend processing time for sending or editing a message in large, public streams in chat.zulip.org (at 18K users today).	2021-05-12 08:10:57 -07:00
Mateusz Mandera	1a8ad796f8	models: Replace __id syntax with _id where possible. model__id syntax implies needing a JOIN on the model table to fetch the id. That's usually redundant, because the first table in the query simply has a 'model_id' column, so the id can be fetched directly. Django is actually smart enough to not do those redundant joins, but we should still avoid this misguided syntax. The exceptions are ManytoMany fields and queries doing a backward relationship lookup. If "streams" is a many-to-many relationship, then streams_id is invalid - streams__id syntax is needed. If "y" is a foreign fields from X to Y: class X: y = models.ForeignKey(Y) then object x of class X has the field x.y_id, but y of class Y doesn't have y.x_id. Thus Y queries need to be done like Y.objects.filter(x__id__in=some_list)	2021-04-22 14:53:00 -07:00
Tim Abbott	6346b9d3eb	models: Replace user_profile__is_active queries with is_user_active. This saves a couple database queries by using the recently added denormalization for Subscription objects.	2021-04-19 18:30:31 -07:00
Mateusz Mandera	ccfcc186ad	subs: Fix subscriber_..._history_access to not exclude subbed guests. Guests are supposed to have stream history access to public streams they're subscribed to.	2021-04-19 10:10:51 -07:00
Mateusz Mandera	50bfbb588e	subs: Allow filtering by is_user_active in get_active_subscriptions. get_active_subscriptions_for_stream_id should allow specifying whether subscriptions of deactivated users should be included in the result. Active subs of deactivated users are a subtlety that's easy to miss when writing relevant code, so we make include_deactivated_users a mandatory kwarg - this will force callers to definitely give thought to whether such subs should be included or not. This commit is just a refactoring, we keep original behavior everywhere - there are places where subs of deactivates users should probably be excluded but aren't - we don't fix that here, it'll be addressed in follow-up commits.	2021-04-19 10:10:51 -07:00
Tim Abbott	9f57961e5f	stream_subscription: Remove opaque reference to guest role.	2021-04-13 21:49:57 -07:00
LoopThrough-i-j	277fbb3f02	stream_subscription: Add subscribe_ids_with_stream_history_access. This new function returns the set of `user_ids` with access to the stream's full history, for use in send_event calls.	2021-04-05 13:23:11 -07:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Steve Howell	7bbcc2ac96	refactor: Compute peers for public streams later. This saves us a query for edge cases like when you try to unsubscribe from a public stream that you have already unsubscribed from. But this is mostly to prep for upcoming optimizations.	2020-10-20 11:31:22 -07:00
Steve Howell	6d1f9de7d3	performance: Use SubInfo when removing subscribers. We get two speedups: * The query to get existing subscribers only gets the two fields we need. We no longer need all the overhead of user_profile and recipient data being returned in the query. * We avoid Django making extra hops to the database to get user info.	2020-10-16 12:58:11 -07:00
Steve Howell	73982f6cc9	refactor: Move SubInfo to stream_subscription.py.	2020-10-16 12:58:11 -07:00
Steve Howell	b4346d0276	performance: Extract subscribers/peers in bulk. We replace get_peer_user_ids_for_stream_change with two bulk functions to get peers and/or subscribers. Note that we have three codepaths that care about peers: subscribing existing users: we need to tell peers about new subscribers we need to tell subscribed user about old subscribers unsubscribing existing users: we only need to tell peers who unsubscribed subscribing new user: we only need to tell peers about the new user (right now we generate send_event calls to tell the new user about existing subscribers, but this is a waste of effort that we will fix soon) The two bulk functions are this: bulk_get_subscriber_peer_info bulk_get_peers They have some overlap in the implementation, but there are some nuanced differences that are described in the comments. Looking up peers/subscribers in bulk leads to some nice optimizations. We will save some memchached traffic if you are subscribing to multiple public streams. We will save a query in the remove-subscriber case if you are only dealing with private streams.	2020-10-15 15:12:01 -07:00
Steve Howell	b894597fa3	refactor: Use sets of stream_ids for helper args.	2020-10-15 15:12:01 -07:00
Aman Agrawal	190f481f49	stream_subscription: Mark notifications disabled for web public users. Users without an account can't get notifications, so we might as well ensure any UI displays them appropriately.	2020-10-01 14:40:48 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Anders Kaseorg	fead14951c	python: Convert assignment type annotations to Python 3.6 style. This commit was split by tabbott; this piece covers the vast majority of files in Zulip, but excludes scripts/, tools/, and puppet/ to help ensure we at least show the right error messages for Xenial systems. We can likely further refine the remaining pieces with some testing. Generated by com2ann, with whitespace fixes and various manual fixes for runtime issues: - invoiced_through: Optional[LicenseLedger] = models.ForeignKey( + invoiced_through: Optional["LicenseLedger"] = models.ForeignKey( -_apns_client: Optional[APNsClient] = None +_apns_client: Optional["APNsClient"] = None - notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - signup_notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + signup_notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - author: Optional[UserProfile] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) + author: Optional["UserProfile"] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) - bot_owner: Optional[UserProfile] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) + bot_owner: Optional["UserProfile"] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) - default_sending_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) - default_events_register_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_sending_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_events_register_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) -descriptors_by_handler_id: Dict[int, ClientDescriptor] = {} +descriptors_by_handler_id: Dict[int, "ClientDescriptor"] = {} -worker_classes: Dict[str, Type[QueueProcessingWorker]] = {} -queues: Dict[str, Dict[str, Type[QueueProcessingWorker]]] = {} +worker_classes: Dict[str, Type["QueueProcessingWorker"]] = {} +queues: Dict[str, Dict[str, Type["QueueProcessingWorker"]]] = {} -AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional[LDAPSearch] = None +AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional["LDAPSearch"] = None Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 11:02:32 -07:00
Steve Howell	49b8218463	perf: Extract get_subscribed_stream_ids_for_user. This new method prevents us from getting fat objects from the database. Instead, now we just get ids from the database to build our subqueries. Note that we could also technically eliminate the `set(...)` wrappers in this code to have Django make a subquery and save a round trip. I am postponing that for another commit (since it's still somewhat coupled to some other complexity in `do_get_streams` that I am trying to cut through, plus it's not the main point of this commit.) BEFORE: # old, still in use for other codepaths def get_stream_subscriptions_for_user(user_profile: UserProfile) -> QuerySet: # TODO: Change return type to QuerySet[Subscription] return Subscription.objects.filter( user_profile=user_profile, recipient__type=Recipient.STREAM, ) user_subs = get_stream_subscriptions_for_user(user_profile).filter( active=True, ).select_related('recipient') recipient_check = Q(id__in=[sub.recipient.type_id for sub in user_subs]) AFTER: # newly added def get_subscribed_stream_ids_for_user(user_profile: UserProfile) -> QuerySet: return Subscription.objects.filter( user_profile_id=user_profile, recipient__type=Recipient.STREAM, active=True, ).values_list('recipient__type_id', flat=True) subscribed_stream_ids = get_subscribed_stream_ids_for_user(user_profile) recipient_check = Q(id__in=set(subscribed_stream_ids))	2020-03-01 22:38:03 -08:00
Hashir Sarwar	dcbd3e486f	stream_subscription: Remove unused TypedDict `SubInfo`.	2020-02-10 14:04:22 -08:00
Anders Kaseorg	68dd8e4ec8	mypy: Migrate from mypy_extensions to typing_extensions. This gives us access to typing_extensions.Deque, which was not added to typing until 3.5.4. (PROVISION_VERSION is not bumped because the transitive dependency set in dev.txt hasn’t changed.) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-05 17:24:09 -07:00
Yashashvi Dave	8e269b4651	models: Rename notification to `enable_stream_audible_notifications`. Rename notification property `enable_stream_sounds` to `enable_stream_audible_notifications` to match with other notification property patterns. Fixes part of #12304	2019-06-12 16:24:51 -07:00
Harshit Bansal	b553507412	subscriptions: Migrate notification setting defaults model. This commit migrates the Subscription's notification fields from a BooleanField to a NullBooleanField where a value of None means to inherit the value from user's profile. Also includes a migrations to set the corresponding settings to None if they match the user profile's values. This migration helps us in getting rid of the weird "Apply to all" widget that we offered on subscription settings page. The mobile apps can't handle None appearing as the stream-level notification settings, so for backwards-compatibility we arrange to only send True/False to the mobile apps by applying those defaults server-side. We introduce a notification_settings_null value within a client_capabilities structure that newer versions of the mobile apps can use to request the new model. This mobile compatibility code is pretty effectively tested by the existing test_events tests for the subscriptions subsystem.	2019-05-08 17:45:10 -07:00
neiljp (Neil Pilgrim)	a7bfb09067	Mypy: Use models.py QuerySet annotation approach in stream_subscription.py. Namely, annotate as best as possible, and add notes to indicate preference, if QuerySet develops generic typing. Note that the return values of functions with annotations changed in this commit are used elsewhere as QuerySets, so the Sequence[T] approach used for some functions in models.py is not applicable.	2018-03-15 12:54:43 -07:00
rht	33b1a541d7	zerver/lib: Use python 3 syntax for typing. With tweaks by tabbott to fix line spacing.	2017-11-18 16:09:04 -08:00
Steve Howell	1ac2360d2e	mypy: Fix QuerySet -> QuerySet[Subscription].	2017-10-30 16:33:51 -07:00
Steve Howell	faba34dae4	Simplify bulk_remove_subscriptions(). We extract get_bulk_stream_subscriber_info() from this function to remove some of the complexity. Also, in that new function we avoid a hop to the database by querying on stream ids instead of recipient ids. The query that gets changed here does require a join to the recipient table (to get the stream id), so it's a little bit of a tradeoff.	2017-10-30 16:33:50 -07:00
Steve Howell	08ad26f913	refactor: Extract get_stream_subscriptions_for_users().	2017-10-29 18:36:35 -07:00
Steve Howell	b3192d17ab	refactor: Extract get_stream_subscriptions_for_user().	2017-10-29 18:36:35 -07:00
Steve Howell	8e0b417bd9	Extract get_active_subscriptions_for_stream_ids().	2017-10-29 18:36:35 -07:00
Steve Howell	126e14d1de	Add zerver/lib/stream_subscription.py. The first method we extract to this library is get_active_subscriptions_for_stream_id(). We also move num_subscribers_for_stream_id() to here, which is slightly annoying (having the method on Stream was nice) but avoids some circular dependency issues.	2017-10-29 18:36:35 -07:00

38 Commits