zulip

Commit Graph

Author	SHA1	Message	Date
Anders Kaseorg	8bcdf4ca97	python: Convert TypedDict declarations to Python 3.6 style. A subset of the diff generated by pyupgrade --py36-plus --keep-percent-format. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-26 11:43:40 -07:00
Tim Abbott	a53daa6f8c	message: Fix malformed reaction data. After a message was reset in our caches cache via message editing or adding/removing a reaction, we were sending corrupt data to the cache because build_message_dict (and thus build_dict_from_raw_db_row) was improperly being called before sewing in the reaction data. As a result, we were sending raw database data in the reaction dictionaries, rather than the reformatted version expected by the API. Bug introduced in `2a4c62a326`. Fixing this correctly required moving the rendering_realm_id logic one step higher in the call chain, which is a useful refactoring anyway (since we're no longer passing a `Message` object down)	2020-05-21 14:09:53 -07:00
Aman Agrawal	2a4c62a326	update_to_dict_cache: Use bulk queries when preparing msgs for cache. During events such as stream / topic name edit for a topic, we were running queries to db in loop for each message for reactions, submessages and realm_id. This commit reduces the queries to be done only for realm_id, which is yet to be fixed. This is accomplished by building messages with empty reactions and submessages and then updating them in the messages using bulk queries.	2020-05-19 10:30:03 -07:00
Aman Agrawal	b8fe6245e3	message: Extract method to sew submessages and reactions to msg.	2020-05-19 10:30:03 -07:00
Abhishek-Balaji	052368bd3e	alert_words: Move alert_words from UserProfile to separate model. Previously, alert words were a JSON list of strings stored in a TextField on user_profile. That hacky model reflected the fact that they were an early prototype feature. This commit migrates from that to a separate table, 'AlertWord'. The new AlertWord has user_profile, word, id and realm(denormalization so we can provide a nice index for fetching all the alert words in a realm). This transition requires moving the logic for flushing the Alert Words caches to their own independent feature. Note that this commit should not be cherry-picked without the following commit, which fixes case-sensitivity issues with Alert Words.	2020-04-27 11:29:50 -07:00
rebtung	f7fbe3419f	api: Improve consistency of reactions API. Previously, the message and event APIs represented the user differently for the same reaction data. To make this more consistent, I added a user_id field to the reaction dict for both messages and events. I updated the front end to use the user_id field rather than the user dict. Lastly, I updated front end and back end tests that used user info. I primarily tested this by running my local Zulip build and adding/removing reactions from messages. Fixes #12049.	2020-04-26 10:35:55 -07:00
Anders Kaseorg	fead14951c	python: Convert assignment type annotations to Python 3.6 style. This commit was split by tabbott; this piece covers the vast majority of files in Zulip, but excludes scripts/, tools/, and puppet/ to help ensure we at least show the right error messages for Xenial systems. We can likely further refine the remaining pieces with some testing. Generated by com2ann, with whitespace fixes and various manual fixes for runtime issues: - invoiced_through: Optional[LicenseLedger] = models.ForeignKey( + invoiced_through: Optional["LicenseLedger"] = models.ForeignKey( -_apns_client: Optional[APNsClient] = None +_apns_client: Optional["APNsClient"] = None - notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - signup_notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + signup_notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - author: Optional[UserProfile] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) + author: Optional["UserProfile"] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) - bot_owner: Optional[UserProfile] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) + bot_owner: Optional["UserProfile"] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) - default_sending_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) - default_events_register_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_sending_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_events_register_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) -descriptors_by_handler_id: Dict[int, ClientDescriptor] = {} +descriptors_by_handler_id: Dict[int, "ClientDescriptor"] = {} -worker_classes: Dict[str, Type[QueueProcessingWorker]] = {} -queues: Dict[str, Dict[str, Type[QueueProcessingWorker]]] = {} +worker_classes: Dict[str, Type["QueueProcessingWorker"]] = {} +queues: Dict[str, Dict[str, Type["QueueProcessingWorker"]]] = {} -AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional[LDAPSearch] = None +AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional["LDAPSearch"] = None Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 11:02:32 -07:00
Udit107710	db30cf470c	refactor: Making email_mirror independent of actions. Moved truncate_body, truncate_content and truncate_topic to message.py.	2020-04-18 16:58:29 -07:00
Anders Kaseorg	c734bbd95d	python: Modernize legacy Python 2 syntax with pyupgrade. Generated by `pyupgrade --py3-plus --keep-percent-format` on all our Python code except `zthumbor` and `zulip-ec2-configure-interfaces`, followed by manual indentation fixes. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-09 16:43:22 -07:00
Udit107710	ef741bf317	messages: Return shallow copy of message object. When more than one outgoing webhook is configured, the message which is send to the webhook bot passes through finalize_payload function multiple times, which mutated the message dict in a way that many keys were lost from the dict obj. This commit fixes that problem by having `finalize_payload` return a shallow copy of the incoming dict, instead of mutating it. We still mutate dicts inside of `post_process_dicts`, though, for performance reasons. This was slightly modified by @showell to fix the `test_both_codepaths` test that was added concurrently to this work. (I used a slightly verbose style in the tests to emphasize the transformation from `wide_dict` to `narrow_dict`.) I also removed a deepcopy call inside `get_client_payload`, since we now no longer mutate in `finalize_payload`. Finally, I added some comments here and there. For testing, I mostly protect against the root cause of the bug happening again, by adding a line to make sure that `sender_realm_id` does not get wiped out from the "wide" dictionary. A better test would exercise the actual code that exposed the bug here by sending a message to a bot with two or more services attached to it. I will do that in a future commit. Fixes #14384	2020-03-29 15:12:27 -07:00
Stefan Weil	d2fa058cc1	text: Fix some typos (most of them found and fixed by codespell). Signed-off-by: Stefan Weil <sw@weilnetz.de>	2020-03-27 17:25:56 -07:00
Steve Howell	ca74cd6e37	bug fix: Fix unread counts for certain API messages. If I send a message from a normal Zulip client, it is considered to be "read" by me. But if I send it via an API program (using my human account), the message is not immediately "read" by me. Now we handle this correctly in `get_raw_unread_data`. The symptom of this was that these messages would get "stuck" in "Private Messages" narrows until the next time you reloaded your app.	2020-03-17 16:26:42 -07:00
Steve Howell	405a529340	server: Sort user_ids in recent PM conversations. This change should prevent test flakes, plus it's more deterministic behavior for clients, who will generally comma-join the ids into a key for their internal data structures. I was able to verify test coverage on this by making the sort reversed, which would cause test_huddle_send_message_events to fail.	2020-01-02 11:59:58 -08:00
Mateusz Mandera	dda3ff41e1	messages: Optimize get_recent_private_conversations. Previously, get_recent_private_messages could take 100ms-1s to run, contributing a substantial portion of the total runtime of `/`. We fix this by taking advantage of the recent denormalization of personal_recipient into the UserProfile model, allowing us to avoid the complex join with Recipient that was previously required. The change that requires additional commentary is the change to the main, big SQL query: 1. We eliminate UserMessage table from the query, because the condition m.recipient_id=%(my_recipient_id)d implies m is a personal message to the user being processed - so joining with usermessage to check for user_profile_id and flags&2048 (which checks the message is private) is redundant. 2. We only need to join the Message table with UserProfile (on sender_id) and get the sender's personal_recipient_id from their UserProfile row. Fixes #13437.	2019-12-09 15:23:10 -08:00
Tim Abbott	54e357e154	avatars: Fix gravatar URLs with EMAIL_ADDRESS_VISIBILITY_ADMINS. Previously, we were using user_profile.email rather than user_profile.delivery_email in all calculations involving Gravatar URLs, which meant that all organizations with the new EMAIL_ADDRESS_VISIBILITY_ADMINS setting enabled had useless gravatars not based on the `user15@host.domain` type fake email addresses we generate for the API to refer to users. The fix is to convert these calculations to use the user's delivery_email. Some refactoring is required to ensure the data is passed through to the parts of the codebase that do the check; fortunately, our automated tests of schemas are effective in verifying that the new `sender_delivery_email` field isn't visible to the API. Fixes #13369.	2019-11-05 12:33:20 -08:00
Tim Abbott	399acea755	outgoing webhooks: Call finalize_payload on payloads. Apparently, the refactor months ago that introduced finalize_payload wasn't applied to the outgoing webhook code path, resulting in message dicts with an unexpected format with no avatar_url and some extra values that were intended to be internal details not relevant to external clients. Because this API is not widely used, we expect there to be little to no impact of converting this back to matching the `get_messages` interface, as it once was and has always been intended to be. The one somewhat tricky detail is that we include both the `content` and `rendered_content` fields, rather than asking the client to pick which they want via the `apply_markdown` flag, because there is no place for the client to configure that setting.	2019-11-05 12:26:33 -08:00
Rohitt Vashishtha	b69213808a	messages: Render topic links in context of stream realm. Priviously, we rendered the topic links using the msg.sender.realm. This resulted in issues with Zulip's internal bots not having access to the realm_filters of the destination stream's realm. For example, sending a message via the email gateway or notification would not linkify any realm filters that a user would expect them to.	2019-10-10 14:33:35 -07:00
Mateusz Mandera	dbe508bb91	models: Migration of Message.pub_date to date_sent, part 2. Fixes #1727. With the server down, apply migrations 0245 and 0246. 0246 will remove the pub_date column, so it's essential that the previous migrations ran correctly to copy data before running this.	2019-10-05 19:01:34 -07:00
Tim Abbott	5e19546c14	messages: Fix unread_msgs accounting of wildcard mentions. Previously, the unread_msgs data structure accounting (used for both the web and mobile apps to determine the "Unread mentions" count displayed in the UI) did not include wildcard mentions at all. We fix this by adding the logic required to include properly that data, with tests. As discussed in #6040, it makes sense to include muted streams and topics for the purpose of this calculation. Fixes part of #6040.	2019-08-25 20:32:12 -07:00
Mateusz Mandera	26f7dd09ce	display_recipient: Eliminate raw_display_recipient from message dicts. This was used as a helper to construct the final display_recipient when fetching messages. With the new mechanism of constructing display_recipient by fetching appropriate users/streams from the database and cache, this shouldn't be needed anymore.	2019-08-20 12:22:52 -07:00
Mateusz Mandera	87ccb4f6c7	types: Consistently use DisplayRecipientT for display_recipient objects. Instead of having the rather unclear type Union[str, List[UserDisplayRecipient]] where display_recipient of message dicts was involved, we use DisplayRecipientT (renamed from DisplayRecipientCacheT - since there wasn't much reason to have the word Cache in there), which makes it clearer what is the actual nature of the objects and gets rid of this pretty big type declaration.	2019-08-20 12:15:30 -07:00
Mateusz Mandera	3ba0a37a92	types: Define UserDisplayRecipient type using TypedDict. Since the display_recipients dictionaries corresponding to users are always dictionaries with keys email, full_name, short_name, id, is_mirror_dummy - instead of using the overly general Dict[str, Any] type, we can define a UserDisplayRecipient type, using an appropriate TypedDict. The type definitions are moved from display_recipient.py to types.py, so that they can be imported in models.py. Appropriate type adjustments are made in various places in the code where we operate on display_recipients.	2019-08-20 12:15:30 -07:00
Mateusz Mandera	c779bb1959	messages: Don't use display_recipient values from cached message_dicts. The user information in display_recipient in cached message_dicts becomes outdated if the information is changed in any way. In particular, since we don't have a way to find all the message objects that might contain PMs after an organization toggles the setting to hide user email addresses from other users, we had a situation where client might see inaccurate cached data from before the transition for a period of up to hours. We address this by using our generic_bulk_cached_fetch toolchain to ensure we always are fetching display_recipient data from the database (and/or a special recipient_id -> display_recipient cache, which we can flush easily). Fixes #12818.	2019-08-20 12:15:30 -07:00
Tim Abbott	27a0e307b6	cache: Fix typing for generic_bulk_cached_fetch. The typing for generic_bulk_cached_fetch is complicated, and was recorded incorrectly previously for the case where a cache_transformer function is required. We fix this by adding the new CacheItemT, and additionally add comments explaining what's going on with these types for future reference. Thanks to Mateusz Mandera for raising this issue.	2019-08-14 11:00:40 -07:00
Anders Kaseorg	becef760bf	cleanup: Delete leading newlines. Previous cleanups (mostly the removals of Python __future__ imports) were done in a way that introduced leading newlines. Delete leading newlines from all files, except static/assets/zulip-emoji/NOTICE, which is a verbatim copy of the Apache 2.0 license. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-06 23:29:11 -07:00
Anders Kaseorg	68dd8e4ec8	mypy: Migrate from mypy_extensions to typing_extensions. This gives us access to typing_extensions.Deque, which was not added to typing until 3.5.4. (PROVISION_VERSION is not bumped because the transitive dependency set in dev.txt hasn’t changed.) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-05 17:24:09 -07:00
Tim Abbott	ba67a9b76a	message: Use RawUnreadMessageResult typing more. This is a cleaner implementation given we have the TypedDict defined.	2019-08-02 17:34:27 -07:00
Tim Abbott	8f7a86f5ec	events: Move remove_message_id_from_unread_mgs to message.py. It belongs there, next to the related code for adding an event.	2019-08-02 17:26:13 -07:00
Roman Godov	a50824e031	models: Rename Subscription.in_home_view field to is_muted. This renames Subscription.in_home_view field to is_muted, for greater clarity as to what it does just from seeing the setting name, without having to look it up. Also disabled an obsolete test_migrations test. Fixes #10042.	2019-05-12 22:08:10 -07:00
Tim Abbott	4c3c669b41	mobile: Add support for recent_private_conversations API. This adds experimental support in /register for sending key statistical data on the last 1000 private messages that the user is a participant in. Because it's experimental, we require developers to request it explicitly in production (we don't use these data yet in the webapp, and it likely carries some perf cost). We expect this to be extremely helpful in initializing the mobile app user experience for showing recent private message conversations. See the code comments, but this has been heavily optimized to be very efficient and do all the filtering work at the database layer so that we minimize network transit with the database. Fixes #11944.	2019-04-04 16:47:43 -07:00
Bennet Sunder	7c5f316cb8	alert_words: Performance improvements in looking for alert_words. This commit leverages the ahocorasick algorithm to build a set of user_ids that have their alert_words present in the message. It runs in linear time of the order of length of the input message as opposed to number of alert_words. This is after building a ahocorasick Automaton which runs in O(number of alert_words in entire realm) which is usually cached.	2019-03-01 15:36:39 -08:00
Greg Price	b0a84cd7ab	message: Add an O(1)-query variant of bulk_access_messages. We'll use this in the push-notifications code, in a context where there should definitely already be UserMessage rows if everything's gone normally... but explicitly checking at the top seems like the right pattern from a secure-coding perspective.	2019-02-26 16:41:54 -08:00
Anders Kaseorg	f0ecb93515	zerver core: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:41:24 -08:00
Anders Kaseorg	4e0a79d146	test_messages: Don’t mock an unused import. Since `8a1794caa3`, we don't do any caching in that system. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:29:53 -08:00
Vishnu Ks	8a1794caa3	message: Store the value of first_visible_message_id in Realm table. This eliminates a bunch of potentially buggy caching code, with no material negative side effects.	2018-12-12 15:11:17 -08:00
Steve Howell	226a2826b5	Remove "subject" from zerver/lib/message.py. We also lock this down file in the linter, so that you can no longer introduce "subject" to `message.py`.	2018-11-08 14:45:16 +00:00
Steve Howell	592dd290b2	Avoid "subject" in MessageDict.	2018-11-08 14:33:44 +00:00
Steve Howell	06f8d0af7e	refactor: Extract do_render_markdown(). For the import-data codepath, we will call the extracted function directly in a subsequent commit. The do_render_markdown() function has more required parameters, which allows for more explicit code and also allows us to flatten out some logic related to alert words. (We just pass in empty sets/dicts as needed).	2018-11-07 10:20:14 -08:00
Steve Howell	35e9e5928f	render: Upstream calculation of translate_emoticons.	2018-11-07 10:20:14 -08:00
Steve Howell	32232377f7	Rename bugdown.subject_links -> topic_links.	2018-11-07 10:03:53 -08:00
Tim Abbott	887d20795f	message flags: Add where_starred helper and use it. The previous query ended up doing a scan of all a user's UserMessage rows, not just the ones tracked in the `starred` index.	2018-08-21 15:28:04 -07:00
Joshua Pan	83d8d662dc	events: Query starred message ids in fetch_initial_state_data.	2018-08-21 13:01:41 -07:00
Rhea Parekh	26300110ca	import: Fix rendered_content in imported messages. After the messages have been imported, set the rendered_content of the messages instead of leaving its value to be 'None'. This is important to ensure that: (1) Performance for users is good after completing the import. (2) The database's full-text indexes have all of the imported messages (which only happens properly when Message rows have their rendered_content field edited). Fixes #9168.	2018-08-09 15:12:53 -07:00
Tim Abbott	f10e006135	message: Bump MAX_UNREAD_MESSAGES to 50000. This doesn't seem to have a huge performance downside (less than 1s extra time for loading / on chat.zulip.org), and it means the possibility of users having so many unreads that we get weird/buggy behavior is much more unlikely to exist. We'll still want a better experience for users who somehow go over this limit, but it can be pretty firmly "you need to go mark some things as read".	2018-08-01 12:02:54 -07:00
Shubham Dhama	c5d8fdf68c	message: Add function to check message access in bulk. This effectively just calls access_message in a loop.	2018-07-28 15:12:55 -07:00
Shubham Dhama	605a6ed3d8	message: Extract function for message access by a user.	2018-07-28 14:58:36 -07:00
Shubham Dhama	9cf67a5e71	models: Add function to get UserMessage from message id.	2018-07-28 14:58:36 -07:00
Tim Abbott	bf9012a559	message: Do future-proofing for public streams with private history. This should have no effect for now, but it'll make things a bit simpler in case we make future changes to support public streams without history public to subscribers (and other organization members).	2018-05-28 12:25:57 -07:00
Shubham Dhama	b410aeb897	message: Limit message access for guest users. Tweaked by tabbott to interact correctly with the history_public_to_subscribers feature.	2018-05-28 12:20:32 -07:00
jkiely	104fdd8bf9	mypy: enable strict optional for lib/message. Add assert in order to pass checks under strict conditions.	2018-05-17 11:12:08 -07:00
Steve Howell	4332fd64f7	Add submessages to message payloads.	2018-05-16 15:13:33 -07:00
Aditya Bansal	a68376e2ba	zerver/lib: Change use of typing.Text to str.	2018-05-12 15:22:39 -07:00
Tim Abbott	53e47e6991	messages: Modify access_message for is_history_public_to_subscribers. This completes the Message side of #2745.	2018-04-04 16:18:47 -07:00
Nikhil Kumar Mishra	3b4ff4f75c	markdown: Clean render_markdown. Remove Message is None and not None case as the render_message_backend passes a fake Message object.	2018-03-19 10:36:26 -07:00
neiljp (Neil Pilgrim)	9e1dbde82d	mypy: Final small migrations to python3.5 annotations in many files.	2018-03-12 11:23:30 -07:00
Vishnu Ks	a11b742984	messages: Calculate value of first visible message ID using cron job. [greg: Fixed buggy time conversion in estimate_recent_messages.]	2018-01-24 17:15:08 -08:00
Vishnu Ks	036dc53d20	messages: Rename last_visible_message_id to first_visible_message_id.	2018-01-22 19:53:44 -08:00
Vishnu Ks	e6d3f8895f	messages: Create function to calculate first visible message id.	2018-01-04 08:52:40 -05:00
Vishnu Ks	277d463d23	messages: Add option to limit users message history.	2018-01-02 13:45:01 -05:00
rht	3f4bf2d22f	zerver/lib: Use python 3 syntax for typing. Extracted from a larger commit by tabbott because these changes will not create significant merge conflicts.	2017-11-21 20:56:40 -08:00
Steve Howell	f54146b6d8	mypy: Add RawUnreadMessagesResult type.	2017-11-10 10:50:57 -08:00
Steve Howell	c799ec0ca6	minor: Inline get_unread_message_ids_per_recipient(). This method was only used in one test.	2017-11-10 10:50:57 -08:00
Steve Howell	8eaeba8615	refactor: Remove dict_with_str_keys(). This function is no longer needed in python3, as it was decoding keys that already came for a JSON-decoded string.	2017-11-09 10:43:20 -08:00
Steve Howell	63024b3a6f	Use encode() in zerver/lib/message.py.	2017-11-09 10:38:58 -08:00
rht	5ee40bf718	Remove usage of six.moves.binary_type.	2017-11-09 10:00:00 -08:00
Harshit Bansal	65838bb825	email_gateway: Disable code block processor for email gateway. Generally emails are not written with markdown in mind and hence sometimes render in strange ways. This commit fixes a particular issue that was causing whitespace before paragraphs to be treated as code block due to which email content was being rendered in a box that scrolls in right direction a lot. Fixes: #7045.	2017-11-09 09:56:35 -08:00
Steve Howell	ae0b27a7ed	Extract messages_for_ids.	2017-11-07 17:48:27 -08:00
Steve Howell	88e1e284bb	Restructure send-message code for gravatars. This refactoring doesn't change behavior, but it sets us up to more easily handle a register setting for `client_gravatar`, which will allow clients to tell us they're going to compute their own gravatar URLs. The `client_gravatar` flag already exists in our code, but it is only used for Django views (users/messages) but not for Zulip events. The main change is to move the call to `set_sender_avatar` into `finalize_payload`, which adds the boolean `client_gravatar` parameter to that function. And then we update various callers to supply that flag. One small performance benefit of this change is that we now lazily compute the client message payloads in `event_queue.process_message_event` now, so this will improve performance if all interested clients have the same value of `apply_markdown`. But the change here is really preparing us for the additional boolean parameter, which will cause us to have four variations of the payload.	2017-11-07 10:36:02 -08:00
rht	e311842a1b	zerver/lib: Remove inheritance from object.	2017-11-06 08:53:48 -08:00
neiljp (Neil Pilgrim)	fed757452c	mypy: Clarify type of lookup_dict in aggregate_message_dict.	2017-11-04 19:47:44 -07:00
rht	fef7d6ba09	zerver/lib: Remove u prefix from strings. License: Apache-2.0 Signed-off-by: rht <rhtbot@protonmail.com>	2017-11-03 15:34:37 -07:00
Umair Khan	636046aec9	user-groups: Add basic backend for UserGroup model. This adds the data model and bugdown support for the new UserGroup mention feature. Before it'll be fully operational, we'll still need: * A backend API for making these. * A UI for interacting with that API. * Typeahead on the frontend. * CSS to make them look pretty and see who's in them.	2017-10-31 15:16:14 -07:00
Steve Howell	b3192d17ab	refactor: Extract get_stream_subscriptions_for_user().	2017-10-29 18:36:35 -07:00
Steve Howell	8ac26dfb9b	refactor: Introduce bugdown.MentionData class. We now have a MentionData class that encapsulates the users who are possibly mentioned in a message. Not that the rendering code may not keep all the mentions, since things like backticks will suppress the mention. We populate this now in do_send_messages, so that we can use the info earlier in the message-sending process. This info now gets passed down the call stack as an optional parameter. Note that bugdown.convert() still populates the data when its callers decline to pass in a MentionData object. This is mostly a preparatory commit, as we don't take advantage of the data yet in do_send_messages.	2017-10-26 22:16:47 -07:00
Tim Abbott	8e2cdedf9a	lint: Fix lines in Python codebase longer than 120 characters.	2017-10-26 17:47:30 -07:00
Steve Howell	635675fe48	Reduce queries needed for sending messages. In do_send_messages, we only produce one dictionary for the event queues, instead of different flavors for text vs. html. This prevents two unnecessary queries to the database. It also means we only put one dictionary on the "message" event queue instead of two, albeit a wider one that has some values that won't be sent to the actual clients. This wider dictionary from MessageDict.wide_dict is also used for the `feedback_messages` queue and service bot queues. Since the extra fields are possibly useful down the road, and they'll just be ignored for now, we don't bother to remove them. Also, those queue processors won't have access to `content_type`, which they shouldn't need. Fixes #6947	2017-10-26 16:35:28 -07:00
Steve Howell	9b6a4d0b16	refactor: Extract MessageDict.finalize_payload().	2017-10-26 16:35:28 -07:00
Steve Howell	df93a99b50	Cache only one row per message. Before this change, we populated two cache entries for each message that we sent. The entries were largely redundant, with the only difference being whether we sent the content as raw markdown or as the rendered HTML. This commit makes it so we only have one cache entry per message, and it includes both content and rendered_content. One legacy source on confusion here is that `content` changes meaning when you're on the front end. Here is the situation going forward: database: content = raw rendered_contented = rendered cache entry: content = raw rendered_contented = rendered payload for the frontend: content = raw (for apply_markdown=False) content = rendered (for apply_markdown=True)	2017-10-26 16:35:28 -07:00
Steve Howell	0e106a2488	Add client_gratavar support to GET /messages. Clients fetching messages can now specify that they are able to compute their avatar, and if they set client_gratavar to True in the request (w/our normal encoding scheme), then the backend will not compute it, and the payload will be smaller. The fix starts with get_messages_backend. The flag gets passed down through these functions: * MessageDict.post_process_dicts. * MessageDict.set_sender_avatar. We also fix up the callers for post_process_dicts to explicitly pass in the client_gravatar path, but for now they all just hard code the value to False.	2017-10-20 15:49:21 -07:00
Steve Howell	6fbaf7e80f	Remove sender-related fields from message cache. This change makes the cache entries smaller for message dictionaries. It also ensures we get valid data put into message dictionaries if, for example, the sender's avatar changes. After this change, all of the attributes for a message sender are only fetched during post-processing with two exceptions: * We get sender_id for "free" from the message, and it's the primary key that we need to figure out which data to fetch in post-processing. * We need sender_realm_id to be able to cache topic links, and a sender's realm id will never change, so it's not a concern for invalidating cache rows. All the other attributes are either likely to change (e.g. sender avatar_version) and/or impact the size of cache entries more severely than the two small id fields above. This change should improve our overall system performance by reducing the amount of memory used by every N message rows we cache, and typically N will be in the thousands or so on a large realm. The other major implication of this change is that when a user changes their avatar, and then later messages that the user sent are fetched, all of the fields that go into computing the avatar url will be pulled from the database, not from cache.	2017-10-16 23:37:10 -07:00
Steve Howell	d909355dc2	refactor: Move methods from models.py -> lib/messages.py. Message.get_raw_db_rows is moved to MessageDict, since its implementation details are highly coupled to other methods in MessageDict. And then sew_messages_and_reactions comes along for the ride. We eventually want to move Reaction.get_raw_db_rows to there as well.	2017-10-16 23:37:10 -07:00
Steve Howell	4919eb4abd	Extract MessageDict.set_sender_avatar(). We now populate the avatar url as part of the post processing step of building message dictionaries, so that the avatar url is no longer in cache. This change makes the cache slimmer, because instead of caching the avatar url (which often includes a long hash), we just cache the smaller fields that are used to compute the url. Note that this commit still has the problem that we're essentially computing the avatar url from cached fields that can be invalid. We will address that a few commits later. An immediate benefit of this change is that how we compute avatar urls (or whether we compute them all) is now decoupled from caching concerns. We will address this later as well. (Some clients will be capable of computing their own gravatar urls, for example.)	2017-10-16 23:37:10 -07:00
Steve Howell	3c6cc3d454	Defer deleting intermediate values in message dictionaries. We're about to have multiple post-processing stages for building message dictionaries. Rather than having individual "hydration" methods remove intermediate values, we just wait until the end. This decouples the hyrdration steps. The potentional problem here is that we may have a field like sender_is_mirror_dummy that isn't part of the final payload, but we need it for calculating display recipients and avatars. We don't want to delete it too early from the objects.	2017-10-16 23:37:10 -07:00
Steve Howell	b0e844c676	refactor: Use get_avatar_field in message.py. This is part of deprecating avatar_url_from_dict and eventually supporting the client_gravatar field in message-related requests from clients.	2017-10-12 14:00:41 -07:00
Steve Howell	fed972d1fb	Fix bug with applying message events to unread counts. The `is_mentioned` flag in message events was buggy. We now look directly at flags. We will kill off `is_mentioned` in a subsequent commit. We also remove some debugging code in the test that was failing before this fix. The test would only fail when `is_mentioned` was wrong, which never happened when you ran a single test, and which would happen randomly when you ran multiple tests.	2017-10-11 16:55:34 -07:00
Steve Howell	7c726a5e77	Remove sender names from the message cache. This removes sender names from the message cache, since they aren't guaranteed to be valid, and they're inexpensive to add. This commit will make the message cache entries smaller by removing sender___full_name and sender__short_name fields. Then we add in the sender fields to the message payloads by doing a query against the unique sender ids of the messages we are processing. This change leads to 2 extra database hops for most of our message-related codepaths. The reason there are 2 hops instead of 1 is that we basically re-calculate way too much data to get a no-markdown dictionary.	2017-10-11 11:37:16 -07:00
Steve Howell	3910448b1d	Extract MessageDict.post_process_dicts(). Introduce MessageDict.post_process_dicts() will allow us the ability to do the following: * use less memory in the cache for repeated data * prevent cache invalidation * format data according to different client needs The first use of this function is pretty inconsequential, but it sets us up for more consequential changes. In this commit we defer the MessageDict.hydrate_recipient_info step until after we pull data out of the cache. This impacts cache size as follows: * streams - negligibly bigger * PMs/huddles - slimmer due to not needing to repeat sender data like email/full_name Again, the main point of this change is to start setting up the infrastructure to do post-processing.	2017-10-11 11:37:16 -07:00
Steve Howell	6bf43e6332	refactor: Extract MessageDict.hydrate_recipient_info(). This is a first step to eventually slimming the message cache, but there are still some moving parts there to be worked through. The more immediate benefit of extracting this function is that we can put tests on it. Also, it isolates some functionality that may go away as our clients gets smarter.	2017-10-11 11:37:16 -07:00
Steve Howell	d6e21b5ca9	Collect sender_ids (by topic) in `unread_msgs`. This will allow the mobile app to say "A, B, and C are talking" in the topic views.	2017-10-05 10:37:15 -07:00
Steve Howell	e56084fcf7	Simplify how we apply events for unread messages. The logic to apply events to page_params['unread_msgs'] was complicated due to the aggregated data structures that we pass down to the client. Now we defer the aggregation logic until after we apply the events. This leads to some simplifications in that codepath, as well as some performance enhancements. The intermediate data structure has sets and dictionaries that generally are keyed by message_id, so most message-related updates are O(1) in nature. Also, by waiting to compute the counts until the end, it's a bit less messy to try to keep track of increments/decrements. Instead, we just update the dictionaries and sets during the event-apply phase. This change also fixes some corner cases: * We now respect mutes when updating counts. * For message updates, instead of bluntly updating the whole topic bucket, we update individual message ids. Unfortunately, this change doesn't seem to address the pesky test that fails sporadically on Travis, related to mention updates. It will change the symptom, slightly, though.	2017-10-05 09:42:20 -07:00
Steve Howell	f55b22e937	Add get_muted_stream_ids(). This function replaces get_muted_recipient_ids(). This will set us up to apply events more easily.	2017-10-05 09:32:16 -07:00
Steve Howell	941b1c781c	Refactor get_unread_message_ids_per_recipient(). We now have two helper functions: * get_raw_unread_data * aggregate_unread_data Separating the concerns is nice. The first function does all the data collection. The second function should be fast, and it only re-organizes the data into an aggregated form that makes the page_params payload smaller and easier for clients to work with. For the first function, we try to return data structures that are easier to manipulate than the end result. This will allow us to apply events more easily, in a subsequent commit.	2017-10-05 09:32:16 -07:00
rht	f43e54d352	zerver/lib: Remove absolute_import.	2017-09-27 10:00:39 -07:00
neiljp (Neil Pilgrim)	133d679feb	mypy: Avoid Message.is_status_message if rendered_content is None.	2017-09-25 16:02:56 -07:00
Steve Howell	ba397b5109	Use user_ids, not full objects, in render path. There is no reason for either render_incoming_message() or render_markdown() to require full UserProfile objects just to triage alert words. By only asking for user_ids, we save extra queries in two callpaths and we make it easier to start using user_ids in do_send_messages().	2017-09-12 04:22:55 -07:00
Steve Howell	848c0803bd	Exclude muted topics from unread count.	2017-09-07 07:06:03 -07:00
Steve Howell	0106add546	mypy: Use TypedDict for UnreadMessageResult.	2017-08-28 14:48:19 -07:00
Tim Abbott	133f005530	markdown: Remove is_me_message UserMessage flags. This never made sense to be a flag on the UserMessage table, since it's not per-user state. And in fact it doesn't need to be in a database at all, since it's easily computed from content anyway. Fixes #1099.	2017-08-27 09:34:24 -07:00
Tim Abbott	1bb09e35d2	message: Add assertions for invalid recipient types.	2017-08-25 00:39:36 -07:00
Steve Howell	ead40d8d08	Exclude muted streams from page_params.unread_msgs.count. This adds one fairly cheap query, and gets the bankruptcy count more in the ballpark of the home unread count. (But we don't account for topics yet.)	2017-08-23 17:39:22 -07:00

1 2 3 4

198 Commits