zulip

Commit Graph

Author	SHA1	Message	Date
shanukun	4b00e5da72	refactor: Make acting_user a mandatory kwarg for do_deactivate_realm.	2021-04-02 14:44:41 -07:00
shanukun	626cf52723	refactor: Make acting_user a mandatory kwarg for do_set_realm_signup_notifications_stream.	2021-04-02 14:44:41 -07:00
shanukun	00d998b955	refactor: Make acting_user a mandatory kwarg for do_set_realm_notifications_stream.	2021-04-02 14:44:41 -07:00
WookieMonkeys	1b6f68bb59	stream: Add entropy to deactivated streams. Adding an additional `!` to the stream name each time a stream is deactivated, to a maximum of 21 times, effectively limits number of times a stream with a given name can be deactivated. This is unlikely to come up in common usage, but may be confusing when testing. Change what we prepend to deactivated stream names to something with more entropy than just `!`, by instead prepending a substring of hash of the stream's ID. `!`s. Using 128 bits of the hash means that it will require more than 10^18th renames to have a 1% chance of collision. Because too-long stream names are also truncated at 60 characters, having this entropy in the beginning of the name also helps address potential issues from stream names that differed only in, e.g. the 60th character. Fixes #17016.	2021-04-01 17:16:35 -07:00
Tim Abbott	08116a17b0	typing: Move to parameter validation to view code.	2021-04-01 08:30:47 -07:00
Tim Abbott	2a8e9db8f1	typing: Remove obsolete block comment. The legacy feature described here was removed in `d5cc29755e`.	2021-04-01 08:13:23 -07:00
Dinesh	ddca602123	typing_notifications: Do op validation in view function. Instead of validating `op` value later, this commit does that in `REQ`. Also helps avoiding duplication of this validation when stream typing notifications feature is added.	2021-04-01 07:50:02 -07:00
Mateusz Mandera	353e1a2016	migrations: Subscription.is_user_active denormalization - final step. With the previous two commits deployed, we're ready to use the denormalization to optimize the query. With dev environment db prepared using ./manage.py populate_db --extra-users=2000 --extra-streams=400 this takes the execution time of the query in bulk_get_subscriber_user_ids from 1.5-1.6s to 0.4-0.5s on my machine.	2021-03-30 09:29:36 -07:00
Tim Abbott	53ed759fc1	users: Fix ordering issue with deactivating bots. The new comment explains the issue in some detail, but basically if we deactivate the bots first, then an error partway through is corrected by a retry; if we deactivate the user first, then we may leak undeactivated bots if a failure occurs.	2021-03-30 09:21:41 -07:00
Mateusz Mandera	f329878376	migrations: Subscription.is_user_active denormalization - step one. This adds the is_user_active with the appropriate code for setting the value correctly in the future. In the following commit a migration to backfill the value for existing Subscriptions will be added. To ensure correct user_profile.is_active handling also in tests, we replace all direct .is_active mutation with calls to appropriate functions.	2021-03-30 09:19:03 -07:00
Mateusz Mandera	d236d3f738	users: Improve db transaction structure in user (de)activation process. These procedures should be done atomically overall, with the exception of the code that sends events to avoid block if there's a delay communicating with Tornado. We add the savepoint=False on underlying function that already executes inside an atomic context - to avoid the overhead of creating savepoints where they aren't needed.	2021-03-30 09:15:24 -07:00
Mateusz Mandera	0e6d230804	users: Fix do_deactivate_user to save is_mirror_user. This was a bug - is_mirror_user was not listed in update_fields despite being changed.	2021-03-30 09:15:24 -07:00
shanukun	f8ef7d56b9	refactor: Make acting_user a mandatory kwarg for do_set_realm_message_editing.	2021-03-29 15:51:45 -07:00
shanukun	4dc62f962b	refactor: Make acting_user a mandatory kwarg for do_set_realm_authentication_methods.	2021-03-29 15:51:45 -07:00
shanukun	c95061e9b9	refactor: Make acting_user a mandatory kwarg for do_deactivate_user.	2021-03-29 15:51:45 -07:00
shanukun	8f3ae715c0	refactor: Make acting_user a mandatory kwarg for do_reactivate_user.	2021-03-29 15:51:45 -07:00
shanukun	3c3d805dd1	refactor: Make acting_user a mandatory kwarg for do_change_user_role	2021-03-29 15:51:45 -07:00
shanukun	459710a897	refactor: Make acting_user a mandatory kwarg for do_set_realm_property.	2021-03-29 15:51:45 -07:00
shanukun	bc2d58ad4a	custom_profile_fields: Remove op field for the event. * `op` (operation) field, added in `f6fb88549f`, was never intended for `custom_profile_fields` event. This commit removes the `op` as it doesn't have any use in the code. * As a part of cleanup, this also eliminates the schema check warnings for `custom_profile_fields` event, mentioned in #17568.	2021-03-26 16:28:33 -07:00
Mateusz Mandera	f147c42f9d	actions: Change caching of create_mirror_user_if_needed. Emails are not unique, so we can only sensibly cache using keys formed with both email and realm. This requires adding a new cache key function for caching by delivery email - user_profile_delivery_email_cache_key.	2021-03-25 00:47:42 -07:00
tushar912	b220d29fed	custom profile fields: Rename "CHOICE" to "SELECT" in backend. Rename the "CHOICE" field to "SELECT" in backend. This is done to improve readability as a prep for the upcoming "SELECT_MULTIPLE" field.	2021-03-24 12:54:51 -07:00
shanukun	cfe0fa3788	event_schema: Add schema check for realm/deactivated event. This add the schema checker, openapi schema, and also a test for realm/deactivated event. With several block comments by tabbott explaining the logic behind our behavior here. Part of #17568.	2021-03-23 12:16:16 -07:00
Abhijeet Prasad Bodas	a30ca8490d	refactor: Allow custom msg strs in send_message_moved_breadcrumbs. This is a prep commit which modifies the `send_message_moved_breadcrumbs` function to take message strings as input. This is done to reuse the function in other places like the /digress command.	2021-03-18 16:13:00 -07:00
Tim Abbott	e42354c917	do_create_realm: Require passing kwargs by name.	2021-03-14 08:50:02 -07:00
Mateusz Mandera	d91d3a05b9	tests: Use do_create_realm where possible. Using do_create_realm should be preferred over manual creation where possible, as it creates more realistic data.	2021-03-14 08:50:02 -07:00
sahil839	9a432b0c3b	events: Remove name field from update subscription events. This commit removes name field from update subscription events, as it is not used by any of the clients, and use stream_id in the events code instead.	2021-03-07 22:03:24 -08:00
sahil839	b53c773987	events: Remove email field from update subscription events. This commit removes email field from update subscription events, as email field is of no use in this case.	2021-03-01 14:52:06 -08:00
Mateusz Mandera	d91d1cba96	actions: Simplify the conditionals in revoke_preregistration_users. This is a refactor to make the ifs easier to reason through.	2021-02-26 08:26:43 -08:00
Mateusz Mandera	22ac0f152e	actions: Change prereg_user.status in revoke_preregistration_users. It's clearer to have all the logic adjusting PreregistrationUser statuses in one place rather than scattered.	2021-02-26 08:26:43 -08:00
Mateusz Mandera	c651bed0d4	actions: Extract revoke_preregistration_users function.	2021-02-26 08:26:43 -08:00
Mateusz Mandera	4b903c5dcd	invites: Fix bug revoking user invites in other realms than intended. Fixes #17238. In process_new_human user, the queries were wrong, revoking all invites sent to the email address, even in other realms than the one where the new account just got created.	2021-02-26 08:26:43 -08:00
shanukun	fafe1a31d7	refactor: Make acting_user a mandatory kwarg for do_activate_user.	2021-02-25 17:58:00 -08:00
shanukun	4b67946605	refactor: Make acting_user a mandatory kwarg for do_create_user.	2021-02-25 17:58:00 -08:00
Mateusz Mandera	51d7f24d20	actions: Remove realm argument to internal_send_stream_message. The argument is redundant.	2021-02-23 15:26:47 -08:00
Mateusz Mandera	09fc79f911	actions: Remove realm argument to internal_send_private_message. The argument is redundant.	2021-02-23 15:26:47 -08:00
sahil839	d71afc5a26	actions: Include ROLE_MODERATOR in realm_user_count_by_role. This commmit includes ROLE_MODERATOR in realm_user_count_by_role. We also update test_change_role in test_audit_log.py to include changes for moderator role as well.	2021-02-23 15:01:14 -08:00
Abhijeet Prasad Bodas	fc0488fdb1	actions: Rename notify_topic_moved_streams function. This is a minor refactor which renames the notify_topic_moved_streams function to send_message_moved_breadcrumbs. This is done because this function will be also used for other things in the future, when moving streams or when using the /digress command, for example.	2021-02-16 17:28:59 -08:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Steve Howell	d0ba3cadcf	minor: Clean up code formatting for do_create_user. This makes the code easier to visually scan.	2021-02-08 09:07:04 -05:00
Anders Kaseorg	d13a039b54	actions: Sort available_notification_sounds. os.listdir uses an arbitrary filesystem-dependent order. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-07 06:33:55 -05:00
Aman Agrawal	b26727ed16	invite-new-users: Specify that the limit spans for the whole day.	2021-01-29 09:51:11 -08:00
Alex Vandiver	3381fad258	registration: Stop enqueueing to the signups queue. `c2526844e9` removed the `signups` queue worker, and the command-line tool that enqueues to it -- but not the automated process that enqueues during signups itself. Remove the signup, since it is no longer in use.	2021-01-24 09:42:55 -08:00
Steve Howell	f2586d2f9b	refactor: Introduce SubscriptionInfo dataclass. We use this as the return type for gather_subscriptions_helper and get_web_public_subs, instead of tuples.	2021-01-21 15:04:07 -08:00
Steve Howell	768117f0ff	refactor: Unify include_subscribers logic.	2021-01-21 15:04:07 -08:00
Steve Howell	e735ce3f01	refactor: Move subscribers logic up to caller. The gather_subscriptions_helper function now updates subscribers instead of delegating.	2021-01-21 15:04:07 -08:00
Steve Howell	d9740045a5	refactor: Eliminate checks in build_stream_dict_for_sub. We eliminate some redundant checks. We also consistently provide a `subscribers` field in our stream data with `[]`, even if our users can't access subscribers. We therefore bump the API version and tweak the docs. (See further down for a detailed justification of the change.) Even though it is sometimes fine to have redundant code that is defensive in nature, some upcoming changes are gonna move subscriber-related logic out of build_stream_dict_for_sub for certain codepaths as part of our effort to streamline the payload for subscribers within page_params. So we can't rely on the code that I removed here inside of build_stream_dict_for_sub. Anyway, it makes more sense to do these checks explicitly in the validate function. The code in build_stream_dict_for_sub was almost effectively a noop, since the validation function was already preventing us from getting subscriber info. The only difference it made was sometimes converting `[]` to `None`, and then subsequently omitting the subscribers field. Neither ZT nor the webapp make any distinction between `[]` or <missing key> for the `subscribers` data in `page_params`. The webapp has had this code for a long time (and now equivalent code elsewhere in this PR): if (!Object.prototype.hasOwnProperty.call(sub, "subscribers")) { sub.subscribers = new LazySet([]); } The webapp calculates access based on booleans, anyway: sub.can_access_subscribers = page_params.is_admin \|\| sub.subscribed \|\| (!page_params.is_guest && !sub.invite_only); And ZT would choke if `subscribers` were missing, except that it never gets to the relevant code due to other checks: def get_other_subscribers_in_stream(<snip>): assert stream_id is not None or stream_name is not None if stream_id: assert self.is_user_subscribed_to_stream(stream_id) return [sub for sub in self.stream_dict[stream_id]['subscribers'] if sub != self.user_id] else: return [sub for _, stream in self.stream_dict.items() for sub in stream['subscribers'] if stream['name'] == stream_name if sub != self.user_id] You could make a semantic argument that we should prefer <missing key> to `[]` when subscribers aren't even available, but we have precedent from the way that `bulk_get_subscriber_user_ids` has traditionally populated its result: result: Dict[int, List[int]] = {stream["id"]: [] for stream in stream_dicts} If we changed `stream_dicts` to `target_stream_dicts` we would faciliate a move toward `None`, but it would just cause headaches for other server code as well as the frontends (which, to reiterate, already prefer the empty array for convenience).	2021-01-21 15:04:07 -08:00
Steve Howell	40b0c36d21	minor: Update comment for guest subscription access. As my comment indicates, I would prefer to handle this explicitly by raising JsonableError in an else statement here, but it's not a big deal. This function can probably be simplified with a bit of work, mostly on the testing side to make sure we are covering all edge cases, but that is out of the scope of my current PR.	2021-01-21 15:04:07 -08:00
Mateusz Mandera	b15dd9147d	create_user: Remove redundant argument of get_display_email_address.	2021-01-21 13:04:38 -08:00
Steve Howell	36b1794c1d	user_status: Fix bug with resetting away status. The fix is pretty simple here--if the client doesn't send an away status, then don't change it. I improved the tests to cover this case. Fixes #17071	2021-01-20 13:59:35 -05:00
Mateusz Mandera	3623681d30	message_edit: Don't rely on .recipient_id change not affecting recipient. The codepath for moving a topic changes the message.recipient_id to the id of the new recipient, but later, in update_messages_for_topic_edit, it uses message.recipient when querying for messages with the matching topic in the old stream (because those are the other messages that need to be moved). This is a bug which happens to work fine, because in Django 2, if message.recipient gets fetched first and then message.recipient_id is mutated, message.recipient will not be altered and thus will retain the outdated, previously fetched value. In Django 3 changing .recipient_id causes .recipient to be updated to the new Recipient objects, which is the Recipient of the new stream. That will cause the bug to manifest. This is a bugfix preparing for the upgrade to Django 3.	2021-01-17 10:39:46 -08:00
Siddharth Asthana	6c888977a6	change_subdomain: Create a deactivated realm on updating subdomain. When changing the subdomain of a realm, create a deactivated realm with the old subdomain of the realm, and set its deactivated_redirect to the new subdomain. Doing this will help us to do the following: - When a user visits the old subdomain of a realm, we can tell the user that the realm has been moved. - During the registration process, we can assure that the old subdomain of the realm is not used to create a new realm. If the subdomain is changed multiple times, the deactivated_redirect fields of all the deactivated realms are updated to point to the new uri.	2021-01-07 14:15:22 -08:00
Aman Agrawal	e566e985e4	topic_edit: Store edit history in all the message affected. Instead of just storing the edit history in the message which triggered the topic edit, we store the edit history in all the messages that changed. This helps users track the edit history of a message more reliably.	2021-01-04 18:18:05 -08:00
Mateusz Mandera	160cc5120a	api: Require can_create_users permission to create users via API. Allowing any admins to create arbitrary users is not ideal because it can lead to abuse issues. We should require something stronger that requires the server operator's approval and thus we add a new can_create_users permission.	2020-12-21 13:20:21 -08:00
Mateusz Mandera	d0dc04a093	models: Rename is_api_super_user to can_forge_sender,	2020-12-21 13:15:39 -08:00
sahil839	2fa33be683	actions: Refactor check_message to change return dataclass instead of Dict. We change the return type of check_message to be dataclass instead of Dict[str, Any]. This refactoring helps us to understand the context of the data structure returned by check_message clearly which was not possible when using Dict. SendMessageRequest class is added in zerver/lib/message.py inspite of it not being used in that file itself just to maintain consistency as other TypedDicts and dataclasses are defined in that file and to avoid circular dependency as SendMessageRequest is being used in lib/widget.py as well. We also rename local variable to 'send_request' for accessing SendMessageRequest objects.	2020-12-21 12:55:30 -08:00
Anders Kaseorg	a054f57af6	message: Bundle message stripping, validation, and truncation. We always want to do these at the same time. Previously, message editing did too much stripping (fixes #16837) and failed to check for NUL bytes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-18 17:44:13 -08:00
sahil839	37c8505435	message: Raise exception when trying to mirror an already sent message. Previously we were just returning a dict containing a message id when trying to mirror a already sent message in 'zephyr_mirror' cases. This commit changes this behaviour to raise an exception when trying to mirror an already sent message by adding a new exception class ZephyrMessageAlreadySentException and then the caller returns the message_id directly, instead of calling do_send_messages which also returns a list of size one containing the message_id only. This is a prep commit for changing the return type of check_message to be a dataclass instead of a Dict as now we have only single output for check_message.	2020-12-18 16:40:11 -08:00
sahil839	db85b8a236	actions: Change type of wildcard_mention_user_ids in message_dict to set. The message_dict['wildcard_mention_user_ids'] should be empty set instead of empty list when there are no wildcard mentions similar to the case when there are wildcard mentions, where it is equal to set of user ids and not list of user ids.	2020-12-18 16:17:26 -08:00
Siddharth Asthana	82f5759299	Realm: Add a deactivated_redirect URLField to Realm object. We export a realm's data, and disable the realm, because the user is moving from Zulip Cloud (e.g. https://example.zulipchat.com/) to self-hosting or another platform (e.g. https://zulip.example.com/) which we do not control. This commit adds a field in the realm object called deactivated_redirect to store the url to which the realm has moved.	2020-12-14 21:04:52 -08:00
Mateusz Mandera	47228f3a95	actions: Implement do_delete_user. To have a reasonable way of creating the dummy user without duplicating code, we need change create_user to have the optional force_id argument.	2020-11-09 11:58:02 -08:00
Anders Kaseorg	41f509170b	users: Canonicalize the timezone identifier. While working on shifting toward native browser time zone APIs (#16451), it was found that all but very recent Chrome and Node versions reject certain legacy timezone aliases like US/Pacific (https://crbug.com/364374). For now, we only canonicalize the timezone property returned in user objects and not the timezone setting itself. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Tim Abbott	6d7cd351a3	events: Optimize creating streams for new users. During the new user creation code path, there can be no existing active clients for the user being created, so we can skip the code to send events to that user's clients. The tests here reflect that we need to send fewer events, and do fewer queries that would have been spent computing data for these.. Fixes #16503, combined with the long series of recent changes by Steve Howell to fix super-linear behavior in this code path.	2020-10-26 12:47:15 -07:00
Steve Howell	88a7a1b002	events: Optimize peer_add/peer_remove for public streams. We no bulk up peer_add/peer_remove events by user if the same user has subscribed to multiple streams (and just that single user). This mostly optimizes the new-user codepath, but the algorithm is a bit more general in nature.	2020-10-26 12:33:28 -07:00
sahil839	d0f5537fb2	actions: Modify check_message for handling wildcard_mention_policy setting. This commit adds enforcement for sending messages containing wildcard mentions according to wildcard_mention_policy.	2020-10-22 14:46:32 -07:00
Steve Howell	7ff3859136	subscriber events: Change schema for peer_add/peer_remove. We now can send an implied matrix of user/stream tuples for peer_add and peer_remove events. The client code basically does this: for stream_id in event['stream_ids']: for user_id in event['user_ids']: update_sub(stream_id, user_id) We used to send individual events, which gets real expensive when you are creating new streams. For the case of copy-to-stream case, we should see events go from U to 1, where U is the number of users added. Note that we don't yet fully optimize the potential of this schema. For adding a new user with lots of default streams, we still send S peer_add events. And if you subscribe a bunch of users to a bunch of private streams, we only go from U * S to S; we can't optimize it down to one event easily.	2020-10-22 11:19:53 -07:00
Steve Howell	85ed6f332a	performance: Avoid Recipient lookup for stream messages. All the fields of a stream's recipient object can be inferred from the Stream, so we just make a local object. Django will create a Message object without checking that the child Recipient object has been saved. If that behavior changes in some upgrade, we should see some pretty obvious symptom, including query counts changing. Tweaked by tabbott to add a longer explanatory comment, and delete a useless old comment.	2020-10-20 11:47:23 -07:00
Steve Howell	7bbcc2ac96	refactor: Compute peers for public streams later. This saves us a query for edge cases like when you try to unsubscribe from a public stream that you have already unsubscribed from. But this is mostly to prep for upcoming optimizations.	2020-10-20 11:31:22 -07:00
Steve Howell	363e5d31a6	refactor: Split out public/private logic for peer events. This doesn't change anything yet, but the goal is to eventually optimize events for the case where one user (typically a new user) gets subscribed to multiple public streams.	2020-10-20 11:31:22 -07:00
Steve Howell	3961e69381	refactor: Extract send_peer_subscriber_events. We now use the same basic code to send peer_add and peer_remove events.	2020-10-20 11:31:22 -07:00
Steve Howell	e6f6f8d45f	refactor: Avoid "stream_id" on sub. There was no need to put "stream_id" on the sub dictionary here. It's kinda annoying to introduce the little helper here, but I feel that's better than crufting up the sub data structure.	2020-10-18 14:27:31 -07:00
Steve Howell	628a826aa2	minor: Move code and add comments about three lists.	2020-10-18 14:27:31 -07:00
Steve Howell	ffee129a35	refactor: Clean up is_web_public flag. The is_web_public flag is already in Stream.API_FIELDS, so there is no reason for all this complicated logic. There's no reason to hack it on to the subscription object.	2020-10-18 14:27:31 -07:00
Steve Howell	4dce34ab8b	refactor: Simplify call to bulk_get_subscriber_user_ids. The way we were computing the dictionary was very convoluted--all we need is a set of subscribed user ids.	2020-10-18 14:27:31 -07:00
Steve Howell	b58152abda	refactor: Introduce all_streams_map. We replace all_streams_id with a map. We also use it to populate never_subscribed_streams. And all_streams_map is a superset of stream_hash, which we will soon kill off as well.	2020-10-18 14:27:31 -07:00
Steve Howell	78384ebf1b	minor: Remove confusing parens. Apparently I put these parens in the code as part of `73c30774cb` during 2017. It looks like I extracted is_public during the middle of my change and forgot to remove the unnecessary parens. (The code was correct, but it makes it look like a tuple if you're skimming it too quickly.)	2020-10-18 14:27:31 -07:00
Steve Howell	d60dd94168	refactor: Extract funcs from gather_subscriptions_helper. This is a pure code move, apart from a little bit of quote cleanup and renames: user_profile -> user stream_dict -> result	2020-10-18 14:27:31 -07:00
Steve Howell	79fcf78143	refactor: Exclude "active" from API_FIELDS. We just need to make sure the relevant queries get it for the triage process.	2020-10-18 14:27:31 -07:00
Steve Howell	c5769d31f2	minor: Move code for web_public_stream_ids.	2020-10-18 14:27:31 -07:00
Steve Howell	0ca07ffd3c	peformance: Eliminate StreamRecipientMap. That class is an artifact of when Stream didn't have recipient_id. Now it's simpler to deal with stream subscriptions. We also save a query during page load (and other places where we get subscriber info).	2020-10-18 14:27:31 -07:00
Steve Howell	1951d75796	performance: Avoid select_related("realm"). We also move this query up in the function for some future refactorings.	2020-10-18 14:27:31 -07:00
Steve Howell	3685fcc701	refactor: Remove recipient arg for do_mute_topic.	2020-10-16 12:58:11 -07:00
Steve Howell	378062cc83	performance: Avoid call to access_stream_by_id. We already trust ids that are put on our queue for deferred work. For example, see the code for "mark_stream_messages_as_read_for_everyone" We now pass stream_recipient_id when we queue up work for do_mark_stream_messages_as_read. This generally saves about 3 queries per user when we unsubscribe them from a stream.	2020-10-16 12:58:11 -07:00
Steve Howell	31eb97ddde	performance: Fix do_mark_stream_messages_as_read. This function no longer asks for data that it doesn't need.	2020-10-16 12:58:11 -07:00
Steve Howell	6d1f9de7d3	performance: Use SubInfo when removing subscribers. We get two speedups: * The query to get existing subscribers only gets the two fields we need. We no longer need all the overhead of user_profile and recipient data being returned in the query. * We avoid Django making extra hops to the database to get user info.	2020-10-16 12:58:11 -07:00
Steve Howell	73982f6cc9	refactor: Move SubInfo to stream_subscription.py.	2020-10-16 12:58:11 -07:00
Tim Abbott	caa939d2d5	actions: Use transaction.atomic properly when removing subscriptions. Previously, the transaction.atomic() was not properly scoped to ensure that RealmAuditLog entries were created in the same transaction, making it possible for state changes to not be properly recorded in RealmAuditLog.	2020-10-15 15:12:05 -07:00
Steve Howell	0b91526f28	events: Remove "occupied" semantics for "streams". When apps like mobile register for "streams", we will now just use active streams as our baseline, rather than "occupied" streams. This means we will send a stream that is active, even if it happens to have zero occupants. It's actually pretty rare that a stream has zero occupants, and it's not exactly clear that we want to exclude a non-occupied but otherwise active stream from our list of streams. It also happens to be fairly expensive to compute whether a stream is occupied. This change only affects API clients (including possibly our mobile app). The main webapp never used the data from this codepath.	2020-10-15 15:12:01 -07:00
Steve Howell	b4346d0276	performance: Extract subscribers/peers in bulk. We replace get_peer_user_ids_for_stream_change with two bulk functions to get peers and/or subscribers. Note that we have three codepaths that care about peers: subscribing existing users: we need to tell peers about new subscribers we need to tell subscribed user about old subscribers unsubscribing existing users: we only need to tell peers who unsubscribed subscribing new user: we only need to tell peers about the new user (right now we generate send_event calls to tell the new user about existing subscribers, but this is a waste of effort that we will fix soon) The two bulk functions are this: bulk_get_subscriber_peer_info bulk_get_peers They have some overlap in the implementation, but there are some nuanced differences that are described in the comments. Looking up peers/subscribers in bulk leads to some nice optimizations. We will save some memchached traffic if you are subscribing to multiple public streams. We will save a query in the remove-subscriber case if you are only dealing with private streams.	2020-10-15 15:12:01 -07:00
Steve Howell	94e41c71f9	refactor: Use set of ids for altered users.	2020-10-15 15:12:01 -07:00
Steve Howell	b894597fa3	refactor: Use sets of stream_ids for helper args.	2020-10-15 15:12:01 -07:00
Steve Howell	3889554977	refactor: Extract send_peer_remove_events.	2020-10-15 15:12:01 -07:00
Tim Abbott	bf66e9c4ab	actions: Add transaction.atomic to bulk_add_subs_to_db_with_logging. This will ensure that we always fully execute the database part of modifying subscription objects. In particular, this should prevent invariant failures like #16347 where Subscription objects were created without corresponding RealmAuditLog entries. Fixes #16347.	2020-10-14 11:06:00 -07:00
Steve Howell	5728149e94	performance: Streamline query to add subscribers. We don't need the select_related('user_profile') optimization any more, because we just keep track of user info in our own data structures. In this codepath we are never actually modifying users; we just occasionally need their ids or emails. This can be a pretty substantive improvement if you are adding a bunch of users to a stream who each have a bunch of their own subscriptions. We could also limit the number of full rows in this query by adding an extra hop to the DB just to get colors (using values_list), and then only get full sub info for the streams that we're adding, rather than getting every single subscription, in full, for each user. Apart from finding what colors the user has already used, the only other reason we need all the columns in Subscription here is to handle streams that need to be reactivated. Otherwise we could do only("id", "active", "recipient_id", "user_profile_id") or similar. Fortunately, Subscription isn't an overly wide table; it's mostly bool fields. But by far the biggest thing to avoid is bringing in all the extra user_profile data. We have pretty good coverage on query counts here, so I think this fix is pretty low risk.	2020-10-14 11:03:07 -07:00
Steve Howell	116a441bc5	refactor: Introduce SubInfo class. This class removes a lot of the annoying tuples we were passing around. Also, by including the user everywhere, which is easily available to us when we make instances of SubInfo, it sets the stage to remove select_related('user_profile').	2020-10-14 10:53:10 -07:00
Steve Howell	febef45e38	minor: Add comments to do_get_streams.	2020-10-14 10:53:10 -07:00
Steve Howell	a9356508ca	events: Stop sending occupy/vacate events. We used to send occupy/vacate events when either the first person entered a stream or the last person exited. It appears that our two main apps have never looked at these events. Instead, it's generally the case that clients handle events related to stream creation/deactivation and subscribe/unsubscribe. Note that we removed the apply_events code related to these events. This doesn't affect the webapp, because the webapp doesn't care about the "streams" field in do_events_register. There is a theoretical situation where a third party client could be the victim of a race where the "streams" data includes a stream where the last subscriber has left. I suspect in most of those situations it will be harmless, or possibly even helpful to the extent that they'll learn about streams that are in a "quasi" state where they're activated but not occupied. We could try to patch apply_event to detect when subscriptions get added or removed. Or we could just make the "streams" piece of do_events_register not care about occupy/vacate semantics. I favor the latter, since it might actually be what users what, and it will also simplify the code and improve performance.	2020-10-14 10:53:10 -07:00
Steve Howell	598601e8fc	stream events: Prevent spurious events. If a user asks to be subscribed to a stream that they are already subscribed to, then that stream won't be in new_stream_user_ids, and we won't need to send an event for it. This change makes that happen more automatically.	2020-10-13 11:28:17 -07:00
Steve Howell	18771099e4	performance: Introduce new_stream_user_ids. Let U = number of users to subscribe S = number of streams to subscribe We were technically doing N^3 amount of work when we sent certain events, or to be more precise, U * S * S amount of work. For each stream, we were looping through a list of tuples of size U * S to find the users for the stream. In practice either U or S is usually 1, so the performance gains here are probably negligible, especially since the constant factors here were just slinging around Python data. But the code is actually more readable now, so it's a double win.	2020-10-13 11:28:17 -07:00
Steve Howell	ebb605319b	refactor: Rename stream_map to recipient_id_to_stream. I want to make a new dict called stream_id_to_stream, and stream_map would be confusing.	2020-10-13 11:28:17 -07:00

1 2 3 4 5 ...

1745 Commits