zulip

Commit Graph

Author	SHA1	Message	Date
Steve Howell	3599b1662e	cache: Eliminate transformed_bulk_cached_fetch. Its two callers now just directly call generic_bulk_cached_fetch with the explicit `lambda obj: obj` helpers.	2023-07-19 11:07:33 -07:00
Steve Howell	d19c1f7438	message fetching: Avoid duplicate cache layers. This code removes a lot of complexity with very likely positive overall impact on system performance and negligible downside. We already cache display recipients on a per-user level, so there's no need for another cache layer on top of that that keys them with recipient ids. We avoid strange things where Alice/Bob and Bob/Charlie get put into the top layer cache and then we still have a cache miss on Alice/Charlie despite the lower level cache being able to support per-user lookups. This change does introduce an extra database round trip if any of our messages have a huddle, but the query is extremely cheap, and we can always try to cache that function more directly or try to re-use some of our other huddle-based caches. As part of this, we clean up the names for the lower-level per-user cache of display recipients, and we simplify the cache keys. We also stop passing in a full Recipient object to the `bulk_get_huddle_user_ids` functions. The local impact of this change should be easy to measure (at least approximately), since we use this function every time a user gets messages via the /messages endpoint.	2023-07-19 11:07:33 -07:00
Steve Howell	b85d3dd65b	recipient caches: Split up bulk-fetching. The only overlap between how we fetched streams and users was to share some really complicated data structures. We can also short-circuit some logic if a message batch is either all-stream or all-DM.	2023-07-19 11:07:33 -07:00
Steve Howell	03557a5568	huddles: Find huddle user ids more efficiently. We restrict the columns, avoid quadratic looping, and don't bother with order_by. We also return the user ids (per recipient) as sets, since that's how the only caller uses the info (albeit implicitly via set.union accepting a list).	2023-07-19 11:07:33 -07:00
Zixuan James Li	7fd8d77ce0	typing: Import ValuesQuerySet alias from django_stubs_ext. This saves us from using a conditional import. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-10-05 16:15:56 -07:00
Zixuan James Li	552b58afb7	display_recipient: Tighten function signatures with ValuesQuerySet. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-07-07 11:28:13 -07:00
Anders Kaseorg	a2825e5984	python: Use Python 3.8 typing.{Protocol,TypedDict}. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 12:57:49 -07:00
Zixuan James Li	f21746ba0b	cache: Strength types of cache decorators with ParamSpec. This demonstrates a way to resolve the long-standing issue of typing higher-order identity functions without using `cast` and in a type-safe manner for decorators in `cache.py`. Signed-off-by: Zixuan James Li <359101898@qq.com>	2022-04-14 12:44:35 -07:00
Anders Kaseorg	b0ce4f1bce	docs: Fix many spelling mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-07 18:51:06 -08:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Steve Howell	b375581f58	api: Remove (sender_)short_name from message APIs.	2020-07-17 11:15:14 -07:00
Anders Kaseorg	1b96af2987	cache: Fix type: ignore issues. This was hiding an actual type error in test_cache: a mismatch between the object ID type, which is str, and the default id_fetcher, which returns int. Mypy’s insufficient support for default generic arguments basically means we can’t use them without a lot of overloading, and there are not enough callers here to justify that. https://github.com/python/mypy/issues/3737 We avoid this being super messy where the code calls this by adding some less generic wrappers for generic_bulk_cached_fetch. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-07-01 10:51:15 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Anders Kaseorg	8bcdf4ca97	python: Convert TypedDict declarations to Python 3.6 style. A subset of the diff generated by pyupgrade --py36-plus --keep-percent-format. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-26 11:43:40 -07:00
Anders Kaseorg	fead14951c	python: Convert assignment type annotations to Python 3.6 style. This commit was split by tabbott; this piece covers the vast majority of files in Zulip, but excludes scripts/, tools/, and puppet/ to help ensure we at least show the right error messages for Xenial systems. We can likely further refine the remaining pieces with some testing. Generated by com2ann, with whitespace fixes and various manual fixes for runtime issues: - invoiced_through: Optional[LicenseLedger] = models.ForeignKey( + invoiced_through: Optional["LicenseLedger"] = models.ForeignKey( -_apns_client: Optional[APNsClient] = None +_apns_client: Optional["APNsClient"] = None - notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - signup_notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + signup_notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - author: Optional[UserProfile] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) + author: Optional["UserProfile"] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) - bot_owner: Optional[UserProfile] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) + bot_owner: Optional["UserProfile"] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) - default_sending_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) - default_events_register_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_sending_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_events_register_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) -descriptors_by_handler_id: Dict[int, ClientDescriptor] = {} +descriptors_by_handler_id: Dict[int, "ClientDescriptor"] = {} -worker_classes: Dict[str, Type[QueueProcessingWorker]] = {} -queues: Dict[str, Dict[str, Type[QueueProcessingWorker]]] = {} +worker_classes: Dict[str, Type["QueueProcessingWorker"]] = {} +queues: Dict[str, Dict[str, Type["QueueProcessingWorker"]]] = {} -AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional[LDAPSearch] = None +AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional["LDAPSearch"] = None Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 11:02:32 -07:00
Anders Kaseorg	c734bbd95d	python: Modernize legacy Python 2 syntax with pyupgrade. Generated by `pyupgrade --py3-plus --keep-percent-format` on all our Python code except `zthumbor` and `zulip-ec2-configure-interfaces`, followed by manual indentation fixes. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-09 16:43:22 -07:00
Steve Howell	c2b3269420	message perf: Streamline stream name lookups. When we are fetching messages, we need to hydrate stream names into the messages for legacy reasons. (Ideally, we could skip this step for the webapp and modern mobile clients, since they really only need stream_ids, but we're not there yet.) We keep a recipient cache that maps recipient ids to stream names. When we populate that cache, we now use `values(...)` to avoid fat objects and extra DB work. Note that we are already using a similar technique for hydrating PM/huddle recipients.	2020-03-27 17:20:34 +00:00
Tim Abbott	b25abf3de9	display_recipient: Deduplicate list of display_recipient fields.	2019-08-20 12:22:37 -07:00
Mateusz Mandera	d200cb921d	display_recipient: Only fetch the necessary fields from the database. There is no need to fetch the entire Stream or UserProfile objects, as only several fields are needed. We use Django's .values() method to only get what's needed. For UserProfiles, it means that we get from the queries are dictionaries already in the display_recipient form (UserDisplayRecipient type) - so we can remove the user_profile_to_display_recipient_dict function, as there's no need for this UserProfile -> UserDisplayRecipient conversion anymore.	2019-08-20 12:15:30 -07:00
Mateusz Mandera	87ccb4f6c7	types: Consistently use DisplayRecipientT for display_recipient objects. Instead of having the rather unclear type Union[str, List[UserDisplayRecipient]] where display_recipient of message dicts was involved, we use DisplayRecipientT (renamed from DisplayRecipientCacheT - since there wasn't much reason to have the word Cache in there), which makes it clearer what is the actual nature of the objects and gets rid of this pretty big type declaration.	2019-08-20 12:15:30 -07:00
Mateusz Mandera	3ba0a37a92	types: Define UserDisplayRecipient type using TypedDict. Since the display_recipients dictionaries corresponding to users are always dictionaries with keys email, full_name, short_name, id, is_mirror_dummy - instead of using the overly general Dict[str, Any] type, we can define a UserDisplayRecipient type, using an appropriate TypedDict. The type definitions are moved from display_recipient.py to types.py, so that they can be imported in models.py. Appropriate type adjustments are made in various places in the code where we operate on display_recipients.	2019-08-20 12:15:30 -07:00
Mateusz Mandera	c779bb1959	messages: Don't use display_recipient values from cached message_dicts. The user information in display_recipient in cached message_dicts becomes outdated if the information is changed in any way. In particular, since we don't have a way to find all the message objects that might contain PMs after an organization toggles the setting to hide user email addresses from other users, we had a situation where client might see inaccurate cached data from before the transition for a period of up to hours. We address this by using our generic_bulk_cached_fetch toolchain to ensure we always are fetching display_recipient data from the database (and/or a special recipient_id -> display_recipient cache, which we can flush easily). Fixes #12818.	2019-08-20 12:15:30 -07:00
Mateusz Mandera	c6b3d0212d	models: Move some display_recipient code to display_recipient.py.	2019-08-15 17:15:32 -07:00

25 Commits