zulip

Commit Graph

Author	SHA1	Message	Date
Aman Agrawal	961d1d0a68	community_topic_edit: Increase time limit to 3 days. 24hrs is a small time in an asynchronous conversation. Increased time limit of topic editing for non-admins to 3 days.	2021-01-25 14:55:33 -08:00
Anders Kaseorg	a873a68cc7	eslint: Fix unicorn/no-new-array. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-01-25 14:53:19 -08:00
akshatdalton	5f8a10124e	url preview: Update Zulip User-Agent. This commit updates the Zulip User-Agent to 'Mozilla/5.0 (compatible; ZulipURLPreview/{version}; +{external_host})' as the older User-Agent was rendering Markdown YouTube titles as 'YouTube - YouTube'. Fixes #16970.	2021-01-25 14:24:48 -08:00
Alex Vandiver	3381fad258	registration: Stop enqueueing to the signups queue. `c2526844e9` removed the `signups` queue worker, and the command-line tool that enqueues to it -- but not the automated process that enqueues during signups itself. Remove the signup, since it is no longer in use.	2021-01-24 09:42:55 -08:00
Suyash Vardhan Mathur	f4cf5166bb	api docs: Display data type of parameters in API documentation. Previously, the data type of parameters wasn't displayed in the API Documentation, even though that OpenAPI data is carefully validated against the implementation. Here we add a recursive function to render the data types visibly in the API documentation. This only covers the request parameters; we'll want to do something similar for response parameters in a follow-up PR. Fixes part of #15967.	2021-01-21 15:56:07 -08:00
Steve Howell	1498b2ef69	apply_event: Fix broken deepcopy attempt for subs. When we were getting an apply_event call for a subscription/add event, we were trying not to mutate the event itself, but this clumsy code was still mutating the actual event: # Avoid letting 'subscribers' entries end up in the list for i, sub in enumerate(event['subscriptions']): event['subscriptions'][i] = \ copy.deepcopy(event['subscriptions'][i]) del event['subscriptions'][i]['subscribers'] This is only a theoretical bug. The only person who receives a subscription/add event is the current user. And it wouldn't have affected the current user, since the apply_event was correctly updating the state, and we wouldn't actually deliver the event to the client (because the whole point of apply_event is to prevent us from having to piggyback the super-recent events on to our payload or put them into the event queue and possibly race). The new code just cleanly makes a copy of each sub, if necessary, as we add them to state["subscriptions"]. And I updated the event schemas to reflect that subscribers is always present in subscription/add event. Long term we should probably avoid sending subscribers on this event when the clients don't set something like include_subscribers. That's a fairly complicated fix that involves passing in flags to ClientDescriptor. Alternatively, we could just say that our policy is that we never send subscribers there, but we instead use peer_add events. See issue #17089 for more details.	2021-01-21 15:04:07 -08:00
Steve Howell	c6acde9c63	apply_event: Use stream_ids, not names, for add/remove. It's always cleaner to work in id space. It probably would have required a perfect storm to have broken the existing code, but using ids is obviously more robust in theory, and just as simple.	2021-01-21 15:04:07 -08:00
Steve Howell	0519f2d2b9	minor: Move include_subscribers guards in apply_event. This sets us up for a cleaner diff in an upcoming commit.	2021-01-21 15:04:07 -08:00
Steve Howell	3fa595ef85	minor: Clean up args for apply_event. We now require keywords, so that there is no pitfall for mixing up boolean parameters. Positional parameters are basically evil when you have a bunch of bools. I also make user_profile the first argument. Finally, the code is more diff-friendly.	2021-01-21 15:04:07 -08:00
Steve Howell	e42baf9e13	minor: Clean up args for apply_events. I eliminate the defaults, since the existing code was already specificying values for most things. I move all the booleans to the bottom for both parameters and arguments. I require explicit keywords for everything but user_profile (which is now first). And, finally, I format the code in a more diff-friendly manner.	2021-01-21 15:04:07 -08:00
Steve Howell	f2586d2f9b	refactor: Introduce SubscriptionInfo dataclass. We use this as the return type for gather_subscriptions_helper and get_web_public_subs, instead of tuples.	2021-01-21 15:04:07 -08:00
Steve Howell	768117f0ff	refactor: Unify include_subscribers logic.	2021-01-21 15:04:07 -08:00
Steve Howell	e735ce3f01	refactor: Move subscribers logic up to caller. The gather_subscriptions_helper function now updates subscribers instead of delegating.	2021-01-21 15:04:07 -08:00
Steve Howell	d9740045a5	refactor: Eliminate checks in build_stream_dict_for_sub. We eliminate some redundant checks. We also consistently provide a `subscribers` field in our stream data with `[]`, even if our users can't access subscribers. We therefore bump the API version and tweak the docs. (See further down for a detailed justification of the change.) Even though it is sometimes fine to have redundant code that is defensive in nature, some upcoming changes are gonna move subscriber-related logic out of build_stream_dict_for_sub for certain codepaths as part of our effort to streamline the payload for subscribers within page_params. So we can't rely on the code that I removed here inside of build_stream_dict_for_sub. Anyway, it makes more sense to do these checks explicitly in the validate function. The code in build_stream_dict_for_sub was almost effectively a noop, since the validation function was already preventing us from getting subscriber info. The only difference it made was sometimes converting `[]` to `None`, and then subsequently omitting the subscribers field. Neither ZT nor the webapp make any distinction between `[]` or <missing key> for the `subscribers` data in `page_params`. The webapp has had this code for a long time (and now equivalent code elsewhere in this PR): if (!Object.prototype.hasOwnProperty.call(sub, "subscribers")) { sub.subscribers = new LazySet([]); } The webapp calculates access based on booleans, anyway: sub.can_access_subscribers = page_params.is_admin \|\| sub.subscribed \|\| (!page_params.is_guest && !sub.invite_only); And ZT would choke if `subscribers` were missing, except that it never gets to the relevant code due to other checks: def get_other_subscribers_in_stream(<snip>): assert stream_id is not None or stream_name is not None if stream_id: assert self.is_user_subscribed_to_stream(stream_id) return [sub for sub in self.stream_dict[stream_id]['subscribers'] if sub != self.user_id] else: return [sub for _, stream in self.stream_dict.items() for sub in stream['subscribers'] if stream['name'] == stream_name if sub != self.user_id] You could make a semantic argument that we should prefer <missing key> to `[]` when subscribers aren't even available, but we have precedent from the way that `bulk_get_subscriber_user_ids` has traditionally populated its result: result: Dict[int, List[int]] = {stream["id"]: [] for stream in stream_dicts} If we changed `stream_dicts` to `target_stream_dicts` we would faciliate a move toward `None`, but it would just cause headaches for other server code as well as the frontends (which, to reiterate, already prefer the empty array for convenience).	2021-01-21 15:04:07 -08:00
Steve Howell	40b0c36d21	minor: Update comment for guest subscription access. As my comment indicates, I would prefer to handle this explicitly by raising JsonableError in an else statement here, but it's not a big deal. This function can probably be simplified with a bit of work, mostly on the testing side to make sure we are covering all edge cases, but that is out of the scope of my current PR.	2021-01-21 15:04:07 -08:00
Mateusz Mandera	fcc8debc3a	users: Use realm.host in dummy user addresses without email visibility. By moving the relevant logic from realm.get_bot_domain to get_fake_email_domain we will make realm.host be used (if possible) for dummy user addresses. That is, instead of user11@zulipchat.com, the address will become user11@subdomain.zulipchat.com.	2021-01-21 13:04:38 -08:00
Mateusz Mandera	2283aa8a62	bots: Use realm.host for bot email domain if possible. With the change in `d70e1bcdb7`, bots get email like bot@zulip.com with EXTERNAL_HOST="zulip.com", rather than bot@subdomain.zulip.com, which was the old format. That's not desirable, so with this commit, realm.host will be used when possible and only falling back to FAKE_EMAIL_DOMAIN if needed.	2021-01-21 13:04:38 -08:00
Mateusz Mandera	b15dd9147d	create_user: Remove redundant argument of get_display_email_address.	2021-01-21 13:04:38 -08:00
Steve Howell	c693ae8982	event tests: Cover do_update_user_status better. We often send only one field (away or status_text) to be updated. So we have to make our schema support optional keys. As a result of the more flexible schema, we no longer need to exempt the node fixtures from our schema checks.	2021-01-20 13:17:32 -08:00
Steve Howell	36b1794c1d	user_status: Fix bug with resetting away status. The fix is pretty simple here--if the client doesn't send an away status, then don't change it. I improved the tests to cover this case. Fixes #17071	2021-01-20 13:59:35 -05:00
Mateusz Mandera	a9242d6dfc	retention: Eliminate redundant recipient JOIN from cross-realm query. Since recipient_id (id of the PERSONAL Recipient of the user) was denormalized into the UserProfile model, this query can be simplified by getting rid of the zerver_recipient JOIN.	2021-01-18 21:40:37 -08:00
Mateusz Mandera	e3be6db73a	retention: Eliminate redundant userprofile JOIN from cross-realm query.	2021-01-18 21:40:37 -08:00
Tim Abbott	5a02b33f2e	digest: Add a large block comment on correctness.	2021-01-17 11:37:59 -08:00
Steve Howell	1040fb7219	email digests: Remove handle_digest_email shim. The previous commit made it so we only call the shim in tests, so now we completely remove it.	2021-01-17 11:28:30 -08:00
Steve Howell	bfa0bdf3d6	email digests: Process users in chunks of 30. This should make the queue empty more quickly, because we do bulk queries to prevent database hops.	2021-01-17 11:28:30 -08:00
Steve Howell	e0b451730a	email digests: Extract get_new_streams. This makes us more efficient when handling multiple users. We don't have to keep sending the same two queries to the database. Note that as part of this we eliminated a failure mode for the obscure population of users from whom both `user.is_guest` and `user.can_access_public_streams()` returns False. We know this would have only affected Zephyr users (by looking at the code), and we know we don't actually process Zephyr users for email digests (or else we would have raised exceptions in the old code).	2021-01-17 11:28:30 -08:00
Steve Howell	23de94504f	email digests: Query streams for messages up front. This should save us many hops to the database when we process users in bulk.	2021-01-17 11:28:30 -08:00
Steve Howell	3662bf2dcb	minor: Rename stream_map -> user_stream_map.	2021-01-17 11:28:30 -08:00
Steve Howell	11c93aced5	minor: Rename user_profile -> user and avoid shadowing.	2021-01-17 11:28:30 -08:00
Steve Howell	f8bbb7fea9	email digests: Use select_related("realm"). We mostly need realm_id, but when we go to build message lists, we need realm.uri. We could probably be more aggresive about using `only` here, but for now I am just trying to reduce hops to the database.	2021-01-17 11:28:29 -08:00
Steve Howell	bb56f0ec0e	minor: Move get_stream_map to module level. This is a pure code move.	2021-01-17 11:28:29 -08:00
Steve Howell	52e2d5a733	email digests: Avoid long_term_idle check. We want to exclude users with recent subscription activity from emails, regardless of whether the long_term_idle flag is set.	2021-01-17 11:28:29 -08:00
Steve Howell	162b372b93	email digests: Do one query for recent streams. This is another way to limit hops to the database when we process users in bulk.	2021-01-17 11:28:29 -08:00
Alex Vandiver	c2526844e9	worker: Remove SignupWorker and friends. ZULIP_FRIENDS_LIST_ID and MAILCHIMP_API_KEY are not currently used in production. This removes the unused 'signups' queue and worker.	2021-01-17 11:16:35 -08:00
Alex Vandiver	01658e39a9	sentry: Verify version is supported, first. Raven SDK does not send a `title` field.	2021-01-17 11:15:40 -08:00
Alex Vandiver	d688e18de2	errors: Remove references to "deployment", use "host". The `deployment` key was only set in `do_report_error`, which is now only used in one codepath (the queue worker). The logging handlers on staging call notify_server_error directly, which omits the `deployment` key. Remove the odd one-of key, and instead simply do dispatch in `do_report_error`.	2021-01-17 11:08:12 -08:00
Mateusz Mandera	3623681d30	message_edit: Don't rely on .recipient_id change not affecting recipient. The codepath for moving a topic changes the message.recipient_id to the id of the new recipient, but later, in update_messages_for_topic_edit, it uses message.recipient when querying for messages with the matching topic in the old stream (because those are the other messages that need to be moved). This is a bug which happens to work fine, because in Django 2, if message.recipient gets fetched first and then message.recipient_id is mutated, message.recipient will not be altered and thus will retain the outdated, previously fetched value. In Django 3 changing .recipient_id causes .recipient to be updated to the new Recipient objects, which is the Recipient of the new stream. That will cause the bug to manifest. This is a bugfix preparing for the upgrade to Django 3.	2021-01-17 10:39:46 -08:00
Mateusz Mandera	f76202dd59	django3: Save language preference in a cookie rather than the session. Support for saving it in the session is dropped in django3, the cookie is the mechanism that needs to be used. The relevant i18n code doesn't have access to the response objects and thus needs to delegate setting the cookie to LocaleMiddleware. Fixes the LocaleMiddleware point of #16030.	2021-01-17 10:38:58 -08:00
Steve Howell	04b6108e71	minor: Require keywords for verify_action.	2021-01-17 12:31:04 -05:00
Steve Howell	3df507be73	refactor: Clean up args for fetch_initial_state_data. We now require explicit keywords for all arguments to fetch_initial_state_data except user_profile. We provide reasonable defaults to keep the test code concise.	2021-01-17 12:31:04 -05:00
Alex Vandiver	08d716c741	registration: Re-use the redirect_to_email_login_url helper. In the case of reusing a registration link, reuse the redirect_to_email_login_url helper. This does have the side effect of now showing a "you've already registered" note, which did not happen previously, but that seems probably for the best, since the user did just click a "register" link.	2021-01-13 11:28:32 -08:00
Alex Vandiver	ad3d25103b	registration: Pre-fill the email when redirecting to login. `ecfafc05c0` shifted to using a different paramter name to hint that the user had previously signed up -- and in so doing also stopped pre-filling the "email" box. Also send along the email box, to save users time.	2021-01-13 11:28:32 -08:00
Tushar912	c60f48c889	registration: Move "already in realm" check outside of validation. Checking for `validate_email_not_already_in_realm` again (after the form already did so), but only in the case that the form fails to validate, means that we may be spending time pushing totally invalid emails to the DB to check. In the case of emails containing nulls, this can even trigger a 500 error from PostgreSQL. Stop calling `validate_email_not_already_in_realm` in the form validation. The form is currently only used in two places -- in `accounts_home` and in `maybe_send_to_registration`. The latter is only called if the address is known to not currently have an account, so checking in there is unnecessary; and in the former case, we wish different behaviour (the redirect) than just validation failure, which is all the validator can do. Fixes #17015. Co-authored-by: Alex Vandiver <alexmv@zulip.com>	2021-01-13 11:28:32 -08:00
Tushar912	410bb8ad89	imports: Add better checking for subdomains. Add a `--allow-reserved-subdomain` flag which allows creation of reserved keyword domains. This also always enforces that the domain is not in use, which was removed in `0258d7d`. Fixes #16924.	2021-01-12 17:54:01 -08:00
sushant52	6f0e8a9888	auth: Handle the case of invalid subdomain at various points. Fixes #16770.	2021-01-11 22:29:50 -08:00
Siddharth Asthana	6c888977a6	change_subdomain: Create a deactivated realm on updating subdomain. When changing the subdomain of a realm, create a deactivated realm with the old subdomain of the realm, and set its deactivated_redirect to the new subdomain. Doing this will help us to do the following: - When a user visits the old subdomain of a realm, we can tell the user that the realm has been moved. - During the registration process, we can assure that the old subdomain of the realm is not used to create a new realm. If the subdomain is changed multiple times, the deactivated_redirect fields of all the deactivated realms are updated to point to the new uri.	2021-01-07 14:15:22 -08:00
Aman Agrawal	e566e985e4	topic_edit: Store edit history in all the message affected. Instead of just storing the edit history in the message which triggered the topic edit, we store the edit history in all the messages that changed. This helps users track the edit history of a message more reliably.	2021-01-04 18:18:05 -08:00
cozyrohan	16d1ab3d5f	webhooks/github: Fix repeating description for edits and updates. This change updates the GitHub Integration webhook get_opened_or_update_pull_request_body method so that the description is only printed if it actually changes. If the update event is a result of some other attribute update, such as an asignee change, then the description is not included in the message sent to the zulip stream. Fixes #16345	2021-01-04 14:34:17 -08:00
Aman Agrawal	c685d36821	hipchat_import: Remove tool from codebase. Remove functions and scripts used by HipChat import tool and those which will no longer be required in future.	2020-12-23 08:28:49 -08:00
Aman Agrawal	62d721e859	docs: Remove HipChat migration guide. As of Feb 15th 2019, Hipchat Cloud and Stride have reached End Of Life and are no longer supported by Atlassian. Since it is almost 2 years now we can remove the migration guides.	2020-12-23 15:43:13 +05:30
Vishnu KS	9fe39646fa	analytics: Specify exact end_time in realm summary query. Fetchings rows with end_time within the last 25 hours would result in the realmcount queries returning two rows for each realm if the analytics page was opened within an hour since the count stats were updated.	2020-12-22 16:44:31 -08:00
Mateusz Mandera	160cc5120a	api: Require can_create_users permission to create users via API. Allowing any admins to create arbitrary users is not ideal because it can lead to abuse issues. We should require something stronger that requires the server operator's approval and thus we add a new can_create_users permission.	2020-12-21 13:20:21 -08:00
Mateusz Mandera	c9b6d8ddad	models: Remove redundant Meta.permissions on Realm model. This is dead code leftover from the old way of handling admin permissions.	2020-12-21 13:15:40 -08:00
Mateusz Mandera	d0dc04a093	models: Rename is_api_super_user to can_forge_sender,	2020-12-21 13:15:39 -08:00
sahil839	2fa33be683	actions: Refactor check_message to change return dataclass instead of Dict. We change the return type of check_message to be dataclass instead of Dict[str, Any]. This refactoring helps us to understand the context of the data structure returned by check_message clearly which was not possible when using Dict. SendMessageRequest class is added in zerver/lib/message.py inspite of it not being used in that file itself just to maintain consistency as other TypedDicts and dataclasses are defined in that file and to avoid circular dependency as SendMessageRequest is being used in lib/widget.py as well. We also rename local variable to 'send_request' for accessing SendMessageRequest objects.	2020-12-21 12:55:30 -08:00
Tim Abbott	908025bdad	runtornado: Avoid providing a URL for Tornado on startup. The {addr} part isn't directly useful, since connections to Tornado are done on localhost anyway, and made the development environment output a bit more confusing. Also, use the same phrasing for restarts we use for Django.	2020-12-20 12:27:51 -08:00
Tim Abbott	1f036f9bde	tornado: Reduce logging of event queue load/dump. This logging is really only potentially interesting in a development environment when the numbers are nonzero. In production, it seems worth logging for consistency reasons. Probably we'll eventually redo this block by change the log level, but this is good enough to despam the development environment startup output.	2020-12-20 12:14:39 -08:00
Anders Kaseorg	a054f57af6	message: Bundle message stripping, validation, and truncation. We always want to do these at the same time. Previously, message editing did too much stripping (fixes #16837) and failed to check for NUL bytes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-18 17:44:13 -08:00
sahil839	37c8505435	message: Raise exception when trying to mirror an already sent message. Previously we were just returning a dict containing a message id when trying to mirror a already sent message in 'zephyr_mirror' cases. This commit changes this behaviour to raise an exception when trying to mirror an already sent message by adding a new exception class ZephyrMessageAlreadySentException and then the caller returns the message_id directly, instead of calling do_send_messages which also returns a list of size one containing the message_id only. This is a prep commit for changing the return type of check_message to be a dataclass instead of a Dict as now we have only single output for check_message.	2020-12-18 16:40:11 -08:00
sahil839	4e99ec34a9	widget: Use different variable names for message and submessage content. This commit renames the content variable in do_widget_post_save_actions to message_content and is a prep commit for changing the return type of check_message from Dict to dataclass. This change is required because content variable is used two times in this function - one for message content and other for submessage content, so when we change the return type of check_message to dataclass, the type of content variable is considered as str and then when dict is assigned to content in the submessage case, mypy raises 'Incompatible types in assignment' error. This issue is not faced before the dataclass migration because there is no type checking for the values of dict returned by check_message as the return type of check_message is 'Dict[str, Any]'.	2020-12-18 16:19:35 -08:00
sahil839	db85b8a236	actions: Change type of wildcard_mention_user_ids in message_dict to set. The message_dict['wildcard_mention_user_ids'] should be empty set instead of empty list when there are no wildcard mentions similar to the case when there are wildcard mentions, where it is equal to set of user ids and not list of user ids.	2020-12-18 16:17:26 -08:00
Anders Kaseorg	6b8f4782c4	test_mattermost_importer: Fix test for admins-to-owners change. Commit `ed498e2f8e` forgot to update this test. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-17 18:59:08 -08:00
Tim Abbott	ed498e2f8e	import: Import mattermost admins as Zulip owners. Otherwise, we violate the invariant that all organizations have an owner.	2020-12-17 18:45:45 -08:00
Anders Kaseorg	2ab0b3d4fc	validator: Reject ISO 8601 dates missing leading zeros. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-15 16:36:50 -08:00
Max Zawisa	0e40cc72af	newrelic: Added owner field and cleaned up code. I reformatted the tests and view to include information about who acknowledged and closed the alert. Only includes the information about the owner if there was an owner. Made a few small changes to the refactored bit as requested in review.	2020-12-15 12:04:46 -08:00
Max Zawisa	57e847ab89	newrelic: refactor of time input handling. Moved time formatting check and conversion to zerver/lib/webhooks/common.py. Updated tests slightly to match new output. Removed duration from the calculation because the difference is less than the precision of output and it complicated the error handling.	2020-12-15 12:04:46 -08:00
Max Zawisa	ec00557962	docs: Updated New Relic documentation. The docs are updated to work with the new webhook and new process on https://one.newrelic.com.	2020-12-15 12:04:46 -08:00
Mateusz Mandera	b652cc786c	django3: Remove remaining postgresql_psycopg2 use. Removed in Django 3.0.	2020-12-15 11:52:32 -08:00
angela s	64becb20b5	logging: Set decorator tests to use assertLogs. Fixes part of #15331.	2020-12-15 11:46:25 -08:00
Alex Vandiver	438d2aa632	digests: Ensure that the teaser_data can be JSON-serialized. Leaving this as a set means that it fails in zerver.lib.send_email when serializing into a ScheduledEmail object.	2020-12-15 11:44:50 -08:00
Alex Vandiver	7c849fa940	slack: Check token access scopes before importing. The Slack API always (even for failed requests) puts the access scopes of the token passed in, into "X-OAuth-Scopes"[1], which can be used to determine if any are missing -- and if so, which. [1] https://api.slack.com/legacy/oauth-scopes#working-with-scopes	2020-12-15 11:33:15 -08:00
Anders Kaseorg	415897f491	api docs: Use normal async/await code in JavaScript examples. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-15 11:32:18 -08:00
Anders Kaseorg	bf45f921a7	url_preview: Allow Beautiful Soup to get the charset from <meta>. An HTML document sent without a charset in the Content-Type header needs to be scanned for a charset in <meta> tags. We need to pass bytes instead of str to Beautiful Soup to allow it to do this. Fixes #16843. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-15 11:30:57 -08:00
Siddharth Asthana	daac7536f3	accounts/deactivated: Show deactivated_redirect url if present If a user visits a realm which has been deactivated and it's deactivated_redirect field is set, we should have a message telling the user that the realm has moved to the deactivated_redirect url.	2020-12-14 21:04:52 -08:00
Siddharth Asthana	82f5759299	Realm: Add a deactivated_redirect URLField to Realm object. We export a realm's data, and disable the realm, because the user is moving from Zulip Cloud (e.g. https://example.zulipchat.com/) to self-hosting or another platform (e.g. https://zulip.example.com/) which we do not control. This commit adds a field in the realm object called deactivated_redirect to store the url to which the realm has moved.	2020-12-14 21:04:52 -08:00
Anders Kaseorg	2c5e9f65f8	eslint: Fix new-cap errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-10 19:52:22 -08:00
Puneeth Chaganti	5dc3489166	webhooks/sentry: Fix URL generated in transform_webhook_payload. The URL incorrectly had `event` in the URL path, instead of `events`. Closes #16783	2020-12-02 12:28:45 -08:00
Puneeth Chaganti	b7a08323aa	webhooks/sentry: Use received key when timestamp key is absent.	2020-12-02 12:28:45 -08:00
Sundar Guntnur	cbb7fb8ac0	anchor_value: Fix parsing of large anchor values. This handles the conditions when anchor values are larger than LARGER_THAN_MAX_MESSAGE_ID by clamping them down to it. Also added tests for the function parse_anchor_value. Fixes #16768.	2020-12-02 11:00:22 -08:00
Mateusz Mandera	43a0c60e96	exceptions: Make RateLimited into a subclass of JsonableError. This simplifies the code, as it allows using the mechanism of converting JsonableErrors into a response instead of having separate, but ultimately similar, logic in RateLimitMiddleware. We don't touch tests here because "rate limited" error responses are already verified in test_external.py.	2020-12-01 13:40:56 -08:00
Steve Howell	92ce2d0e31	events: Fix apply_event for streams. In `1bcb8d8ee8` I made it so the webapp doesn't include "streams" in its state from `fetch_initial_state_data`, but I didn't address all the places in apply_event.	2020-12-01 13:01:38 -08:00
Steve Howell	c566ecfb30	minor: Remove dead code in events test.	2020-12-01 13:01:38 -08:00
Vishnu KS	dabbc3445a	webhooks: Properly format the currency amount for refunds. By default all Stripe API amounts are in the currency's smallest unit. It's upto us to convert it to a bigger unit and show it to the end user. And refund event used to show the currency in the smallest unit which makes the output wrong when it comes to most currencies like USD, Europ, INR etc which uses a bigger unit(eg Dollar instead of Cents) as the standard.	2020-11-29 18:11:24 -08:00
Max Zawisa	f05a04e000	webhooks: Update NewRelic webhook for new format. Update the New Relic webhook and tests to match the format specified in the New Relic documentation. The new format sends a json body instead of using url parameters. The old format is no longer supported by New Relic according to their support staff; as a result, the fixtures for the old test cases were removed. Added fixtures for new test cases. Fixes: #16393.	2020-11-18 16:19:08 -08:00
Anders Kaseorg	13e35bfa94	mypy: Use sqlalchemy-stubs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-16 18:17:41 -08:00
Anders Kaseorg	8e0240300a	message_fetch: Skip intermediate mutation in limit_query_to_range. This avoids extra mypy annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-16 18:17:41 -08:00
Anders Kaseorg	d0d8c358b3	lint: Migrate typing.Text check to semgrep. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-16 18:17:41 -08:00
Steve Howell	99e725cbde	populate_db: Simplify how we create reactions. For 3000 messages and 400 users, this saved about 30 seconds. We only do two queries per batch of messages now, and the algorithm is easier to analyze, as it's just three nested loops.	2020-11-16 17:19:23 -08:00
Vishnu KS	5eb63ddb7a	webhooks: Handle dispute events with object IDs prefixed with du. Sometimes the dispute object IDs are prefixed with `du` instead of `dp`. https://freenode.logbot.info/stripe/20200605#c4059469 The correct long-term fix here would be to stop using object IDs to detect the object type of these events and instead maybe make use of "object" key instead. https://stripe.com/docs/api/disputes/object#dispute_object-object	2020-11-16 17:05:54 -08:00
Steve Howell	e2e0f06b2a	email digests: Call get_recent_topics once per batch. Once we start processing digests in batch, this will let us amortize the expense of the message query over multiple users.	2020-11-16 08:59:29 -08:00
Steve Howell	428f0564a0	minor: Move context code down in the function. This will make a subsequent diff a bit less noisy.	2020-11-16 08:59:29 -08:00
Steve Howell	1d1e45e9ec	digests: Use UserActivityInterval for user activity. Note that we are much more efficient about finding active users here: - we do one query per realm (instead of per-user) - we pass the cutoff date to the database - we get back just a list of distinct ids	2020-11-16 08:59:29 -08:00
Steve Howell	b52f56080e	performance: Just get user_ids to queue digest emails.	2020-11-16 08:59:29 -08:00
Steve Howell	e13e5d104d	refactor: Only require user_id for inactive_since(). This function is going away completely soon. It is querying everybody's entire UserActivity history instead of passing the cutoff date to the database!	2020-11-16 08:59:29 -08:00
Steve Howell	d0260392f7	digests: Get user objects from the database. The query counts increase here for somewhat contrived reasons. The tests before this commit reflected a successful trip to the UserProfile cache, but that's not actually realistic in practice.	2020-11-16 08:59:29 -08:00
Steve Howell	7737413cec	digest tests: Improve gather_new_streams test. We don't need to mock the dates here. We also explicitly clear out all streams first, and then we explicitly test with both the stream being current and the stream being old.	2020-11-16 08:59:28 -08:00
Steve Howell	9538edde06	digest tests: Simplify bots test. We can use the _enqueue_emails_for_realm helper to avoid all the Tuesday-related logic here. We also don't bother to create UserActivity records, since the bot gets excluded by virtue of its being a bot. (Also, the date ranges here were sketchy due to the time mocking.)	2020-11-16 08:59:28 -08:00
Steve Howell	0624833af6	digest tests: Improve Tuesday tests. If we're mocking time, we should do it consistently.	2020-11-16 08:59:28 -08:00
Steve Howell	2f4d7a6171	tests: Fix test_inactive_users_queued_for_digest. We can avoid all the date mocking now for all but a couple tests that exercise the is-it-Tuesday logic. And this test now correctly tests that we exclude recently active users. And this allows us to remove the other test.	2020-11-16 08:59:28 -08:00
Steve Howell	e49a482baf	email digests: Make transactions atomic.	2020-11-16 08:59:28 -08:00
Steve Howell	cf6bcfb84a	digest emails: Exclude users who had recent digests. This code protects us in case we ever need to re-run email digests twice in the same day.	2020-11-16 08:59:28 -08:00
Steve Howell	fb3d4c1618	digest tests: Avoid warnings about naive time.	2020-11-16 08:59:28 -08:00
Steve Howell	4271442fba	email digests: Write RealmAuditLog rows.	2020-11-16 08:59:28 -08:00
Mateusz Mandera	4f47f35cb4	auth: Handle the case of invalid subdomain at /fetch_api_key endpoint.	2020-11-13 16:43:17 -08:00
Anders Kaseorg	1275613812	requirements: Upgrade mypy to 0.790. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-12 15:44:30 -08:00
Anders Kaseorg	8ba95063d5	test_markdown: Construct FencedBlockPreprocessor with a real Markdown. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:28 -08:00
Anders Kaseorg	e7e1fde6ec	fenced_code: Use immutable type for codehilite_conf. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:28 -08:00
Anders Kaseorg	fbf8ce0305	markdown: Add types for extra Markdown members. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	b48bdc65b9	markdown: Fix AlertWordNotificationProcessor.run type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	9573f6dc00	markdown: Fix build_block_parser type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	4398eecd2b	markdown: Use immutable type for extension config. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	060036dfd5	markdown: Merge build_engine into Markdown constructor. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	08c64f5cfa	markdown: Fix imports for compatibility with typeshed stubs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	2a8a59f548	test_queue_worker: Simplify worker_queue_names computation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:46:04 -08:00
Anders Kaseorg	dc84e9696c	mypy: Fix types for redis. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:46:04 -08:00
Anders Kaseorg	3a8cf869db	python: Convert os.open(…, O_EXCL) to open(…, "x"). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-09 14:31:01 -08:00
Puneeth Chaganti	358f1f9ba7	webhooks/sentry: Support integration configured as webhook. Sentry allows adding simple webhooks without going through the process of creating an Internal Integration in Sentry's Integration Platform[1] (which our docs recommend). The payload from sent from such a (simple) webhook integration is slightly different from the payload sent by an Internal Integration webhook. This commit tries to wrangle this payload into a form that is usable by our webhook handler to send a notification message. [1]: https://sentry.io/integration-platform/	2020-11-09 12:02:49 -08:00
Mateusz Mandera	47228f3a95	actions: Implement do_delete_user. To have a reasonable way of creating the dummy user without duplicating code, we need change create_user to have the optional force_id argument.	2020-11-09 11:58:02 -08:00
akshatdalton	806c1a0b8b	markdown: Fix flickering of embedded link inside Italic. This commit fixes a bug in marked.js which caused it to double-escape HTML when rendering messages of the form: [text](url). This fixes a bug introduced in `3bdc8bbaa5`, where an unnecessary escape() call was added for the <em> code path, likely just because it was adjacent to the others that needed it in the file. Fix this, and add tests to verify that things are still being escaped once after removing this extra escape. Fixes #14845.	2020-11-06 10:09:15 -08:00
Steve Howell	5da4332620	minor: Add order-by-id to digest message query. The order-by-id is now explicit, and I add comments to explain the select_related tables.	2020-11-06 10:05:46 -08:00
Steve Howell	936171d258	refactor: Extract DigestTopic class. This gets us away from a lot of dictionary soup.	2020-11-06 10:05:46 -08:00
Steve Howell	e8b6c56322	refactor: Simplify get_hot_topics(). The code we deleted here was no longer doing anything. Maybe the code was always dead, or maybe it was written during a time when topics_by_diversity and topics_by_length actually had different keys. But now it's clearly cruft. If we have 4 or more topics, then the code above it would already have populated the list with 4 elements, and the `if num_convos < 4` condition would evaluate to False. And if we had 3 or fewer topics, then we would have already put all possible topics into our result, and the `topics_by_diversity[num_convos:4]` slice would be empty. It's possible that we should just have a simple heuristic for topic hotness like `10*num_senders + messages`, so we don't have to maintain this fiddly function, and we can just do something like `topics_by_score[:4]`.	2020-11-06 10:05:46 -08:00
Steve Howell	c5dc9d386f	refactor: Use sets of stream_ids for email digests. I now use sets for stream_ids in more of the digest code. As part of this I replaced exclude_subscription_modified_streams with streams_recently_modified_for_user. It's easier for the caller to just ask for ids to delete from its callee than it is to pass in a set/list to mutate. The simpler boundary between the functions makes the tests easier to write--you can see the `filtered_streams` logic goes away in this diff. I also make the tests a bit more thorough by using combinations of Cordelia/Othello and Verona/Denmark to try to find multiple possible flaws. And I make the time intervals longer than 1s to avoid false negatives from slow CI boxes.	2020-11-05 17:42:43 -08:00
Steve Howell	88a57ed4ac	bulk digest: Get stream subscriptions in bulk. If we have multiple users, this reduces the amount of queries we need to do, because we get all subscriptions for all users in a single query to Subscription. For the single-user case, we are introducing an extra query hop, but the database is doing roughly the same work, because we are just breaking up this complex query into two hops: messages = select ... from message where recipient__type_id in ( select stream_id from subscription where ... ) Now it's more like: stream_ids = select stream_id from subscription where ... messages = select ... from message where recipient__type_id in stream_ids	2020-11-05 09:36:59 -08:00
Steve Howell	c83db37161	email digests: Introduce bulk methods for digest. Note that we are not changing anything semantically or algorithmically yet. The only overhead here for the single-user case is boxing and unboxing data into single-item dicts and lists. The interfaces for callers in the view and the queue processor remain the same for now.	2020-11-05 09:36:59 -08:00
Steve Howell	7c89e46731	minor: Clean up some code formatting.	2020-11-05 09:36:59 -08:00
Steve Howell	4bd02eea19	minor: Use user, not user_profile, in some digest code.	2020-11-05 09:36:59 -08:00
Steve Howell	0e2d02b0a2	digest tests: Count cache tries.	2020-11-05 09:36:59 -08:00
Steve Howell	127f4e1291	digest tests: Add more users to bulk digest test.	2020-11-05 09:36:59 -08:00
Steve Howell	89cb3fa841	digest tests: Localize mocks. We didn't need the enough-traffic mock. We also continue to prep for testing multiple users. I also finally remove a comment that is about to be addressed (and which inaccurately refers to huddles).	2020-11-05 09:36:59 -08:00
Steve Howell	1ec16dd1da	digest tests: Prep to test bulk digests. All this does, essentially, is put the logic we used to test for othello inside of a loop. We'll add more users in the next commit.	2020-11-05 09:36:59 -08:00
Steve Howell	e31326c823	refactor: Extract get_digest_context. This eliminates the union type and boolean parameter, and it makes it a bit easier to migrate to a bulk-get approach.	2020-11-05 09:36:59 -08:00
Steve Howell	217967f743	refactor: Extract get_hot_topics. This extraction will make a bit more sense when we start doing bulk operations on a realm to get digests, but even now, it encapsulates the slightly complex way we cherry-pick the top 4 topics for a user.	2020-11-05 09:36:59 -08:00
Steve Howell	5a6d6f81ff	refactor: Extract get_recent_topic_activity.	2020-11-05 09:36:59 -08:00
Steve Howell	f987b014b3	refactor: Rename conversation to topic. Not only is topic shorter, but the name makes it clear that we're not dealing with abstract conversations here--we are truly bucketing by topic.	2020-11-05 09:36:59 -08:00
Steve Howell	6ac3cd3534	refactor: Use list of topics, not tuples.	2020-11-05 09:36:59 -08:00
Steve Howell	878e938a89	minor: Rename conversation_diversity to conversation_senders.	2020-11-05 09:36:59 -08:00
Steve Howell	6dc8250e9a	mypy: Add TopicKey type for digests.	2020-11-05 09:36:59 -08:00
Steve Howell	96f6064b18	refactor: Move Messages query down the digest stack. This prep step is mostly for diff hygiene; the next commit will make the code a bit nicer. The original code here had the nice property that most (but not all) of the DB work happened up front in `handle_digest_email`, and none of the DB work was delegated to the callers. But I prefer the tradeoff of making the helpers a bit more cohesive--let them get the data they need. And we have query-count coverage in our tests, so there's no real danger of having helpers down in the stack insidiously doing a bunch of extra DB hops.	2020-11-05 09:36:59 -08:00
Anders Kaseorg	13c11ec5f3	openapi: Fix escaping in curl command generation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-05 09:36:31 -08:00
Steve Howell	c1f134a3a4	performance: Use ORM to fetch sender in render_markdown. In `709493cd75` (Feb 2017) I added code to render_markdown that re-fetched the sender of the message, to detect whether the message is a bot. It's better to just let the ORM fetch this. The message object should already have sender. The diff makes it look like we are saving round trips to the database, which is true in some cases. For the main message-send codepath, though, we are only saving a trip to memcached, since the middleware will have put our sender's user object into the cache. The test_message_send test calls internally to check_send_stream_message, so it was actually hitting the database in render_markdown (prior to my change).	2020-11-05 09:35:15 -08:00
Steve Howell	637f596751	tests: Fix queries_captured to clear cache up front. Before this change we were clearing the cache on every SQL usage. The code to do this was added in February 2017 in `6db4879f9c`. Now we clear the cache just one time, but before the action/request under test. Tests that want to count queries with a warm cache now specify keep_cache_warm=True. Those tests were particularly flawed before this change. In general, the old code both over-counted and under-counted queries. It under-counted SQL usage for requests that were able to pull some data out of a warm cache before they did any SQL. Typically this would have bypassed the initial query to get UserProfile, so you will see several off-by-one fixes. The old code over-counted SQL usage to the extent that it's a rather extreme assumption that during an action itself, the entries that you put into the cache will get thrown away. And that's essentially what the prior code simulated. Now, it's still bad if an action keeps hitting the cache for no reason, but it's not as bad as hitting the database. There doesn't appear to be any evidence of us doing something silly like fetching the same data from the cache in a loop, but there are opportunities to prevent second or third round trips to the cache for the same object, if we can re-structure the code so that the same caller doesn't have two callees get the same data. Note that for invites, we have some cache hits that are due to the nature of how we serialize data to our queue processor--we generally just serialize ids, and then re-fetch objects when we pop them off the queue.	2020-11-05 09:35:15 -08:00
Tim Abbott	eae14baa05	api: URL-quote password when testing authentication API. The passwords generated for our development environment / test suite include the `+` character, which needs to be quoted when encoded as an HTTP POST parameter. This is hopefully sufficient to fix the CI failures we've seen with the tests for POST /api/v1/fetch_api_key; I haven't reproduced the failure so am not completely sure.	2020-11-03 15:55:30 -08:00
YashRE42	967efc32d2	widgets: Remove tictactoe example widget. Steve asked me to remove this, since the tictactoe game was always intended as a proof of concept. Now that we have poll and todo widgets, the sample code for tictactoe has much less value. We replace the content and type in test_widgets.py to maintain coverage.	2020-11-03 14:46:39 -08:00
Aman Agrawal	87cdd8433d	home: Allow logged out user through home. We allow user to load webapp without log-in. This is only be enabled for developed purposes now. Production setups will see no changes.	2020-11-02 17:07:12 -08:00
shanukun	be39672026	api_docs: Document the /fetch-api-key endpoint. With tweaks by tabbott to document addition details. Fixes: #16408.	2020-11-02 16:45:42 -08:00
shanukun	da9d586254	openapi: Add parameter examples for fetch api key endpoints.	2020-11-02 16:45:42 -08:00
Anders Kaseorg	ac5cbf7693	Revert "markdown: Escape lang when echoing back custom non-pygments languages." This reverts commit `564b199fe6`, which was part of #16308. Escaping is either required or incorrect; it is never “defensive”. This escaping is incorrect. lxml already escapes attributes during serialization (any other behavior would be a serious bug), and additional escaping just results in double escaping. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-02 16:23:48 -08:00
akshatdalton	620e9cbf72	markdown: Fix merging of separate quotations. Initally, when writing two or more quotes, having a blank line in between them, merges those quotes. This created confusion especially in "quote and reply". This commit fixes such issues. Now two or more quotes having a blank line in between them, will not get merged. This change is correct both for usability and for improving our compatibility with CommonMark. Fixes #14379.	2020-10-30 15:21:15 -07:00
Mateusz Mandera	cbeeadab16	delete_realm: Register a post_delete Realm handler. By registering a post_delete handler to clear appropriate caches in a nicer way, we can get rid of the ugly flush-memcached call in the delete_realm command.	2020-10-30 11:43:03 -07:00
Alex Vandiver	bff503feb4	delete_realm: Add command to completely remove realms. This will need some tweaking in upcoming commits.	2020-10-30 11:42:40 -07:00
Anders Kaseorg	3c663e48db	url_encoding: Skip unnecessary encode before quote. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	df10b306a6	python: Remove force_bytes. We are generally good enough at types to know whether a value is str or bytes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	cc55393671	python: Open text files as text to skip decode operations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	18d0e4664c	python: Replace binascii with bytes.hex to skip some decode operations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	aaa7b766d8	python: Use universal_newlines to get str from subprocess. We can replace ‘universal_newlines’ with ‘text’ when we bump our minimum Python version to 3.7. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	9281dccae4	python: Serialize lxml elements directly to str. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	7c4f68d9cf	python: Skip unnecessary decode before BeautifulSoup parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	86e8d81c7f	python: Skip unnecessary decode before JSON parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	1802a50cc9	python: Use requests.Response.text instead of decoding content. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Tim Abbott	067cd3a97a	docs: Remove incorrect references to chat.zulip.org. Most of these are Help Center links that should be pointing to the production Help Center.	2020-10-29 16:46:40 -07:00
Tim Abbott	3b9c726fc6	outgoing_webhook: Avoid logging a bytes string. This fixes the new assertLogs() tests failing in CI; we fixed the weird use of bytes in the test, but not in the runtime code.	2020-10-29 15:55:11 -07:00
sahil839	7106069d4d	migration: Add migration to remove default status of private streams. This commit adds migration which removes default status of exisitng default private streams, i.e. private stream exists but they are no longer default.	2020-10-29 15:47:34 -07:00
sahil839	b29d39195c	streams: Do not allow default streams to be private. We now do not allow to make a stream private which is already a default stream.	2020-10-29 15:47:32 -07:00
sahil839	557ca0802c	streams: Do not allow private streams to be set as default. We now do not allow to set a private stream as default.	2020-10-29 15:43:37 -07:00
m-e-l-u-h-a-n	cbfd6464a5	logging: replace mock.patch() for logging with assertLogs() This commit removes mock.patch with assertLogs(). * Adds return value to do_rest_call() in outgoing_webhook.py, to support asserting log output in test_outgoing_webhook_system.py. * Logs are not asserted in test_realm.py because it would require to users to be queried using users=User.objects.filter(realm=realm) and the order of resulting queryset varies for each run. * In test_decorators.py, replacement of mock.patch is not done because I'm not sure if it's worth the effort to replace it as it's a return value of a function. Tweaked by tabbott to set proper mypy types.	2020-10-29 15:37:45 -07:00
Hemanth V. Alluri	99cf37dc51	drafts: Make the ID of the draft a part of the draft dict. Then because the ID is now part of the draft dict, we can (and do) change the structure of the "drafts" parameter returned from `GET /drafts` from an object (mapping ID to data) to an array. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Hemanth V. Alluri	8d59fd2f45	tests/drafts: Simplify create_and_check_drafts_for_success. Sometimes we don't need to specify the expected_drafts field. So by removing it, we can reduce the clutter a bit. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Hemanth V. Alluri	e60925b3e8	drafts: Change "timestamp" from float to integer. Now the timestamp returned in a draft dict will always be an int. The endpoints will still accept either an int or a float. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Abhijeet Prasad Bodas	e98a8856c7	logging: Add logging in deferred_work queue processor. Adds logging statements in deferred_work queue consume.	2020-10-29 10:34:53 -07:00
m-e-l-u-h-a-n	be7a70e742	logging: Remove unnecessary mock.patch() for logging. Our test-backend validation confirms that we don't log anything to stdout in the tests, so the fact that CI passes with this removes shows there was nothing being logged.	2020-10-28 23:15:27 -07:00
Vishnu KS	fdea49742c	apps: Use GitHub API for generating the web app download link.	2020-10-28 23:04:14 -07:00
ryanreh99	dfa7ce5637	uploads: Support non-AWS S3-compatible server. Boto3 does not allow setting the endpoint url from the config file. Thus we create a django setting variable (`S3_ENDPOINT_URL`) which is passed to service clients and resources of `boto3.Session`. We also update the uploads-backend documentation and remove the config environment variable as now AWS supports the SIGv4 signature format by default. And the region name is passed as a parameter instead of creating a config file for just this value. Fixes #16246.	2020-10-28 21:59:07 -07:00
ryanreh99	1c370a975c	refactor: Access a bucket by calling `zerver.lib.uploads.get_bucket`.	2020-10-28 21:52:08 -07:00
Alex Vandiver	f4eae83542	export: Only include real, active humans in the displayed count.	2020-10-28 18:31:06 -07:00
Alex Vandiver	3037e22f61	tornado: Never use proxies when talking from Django to Tornado. The `no_proxy` parameter does not work to remove proxying[1]; in this case, since all requests with this adapter are to the internal Tornado process, explicitly pass in an empty set of proxies to disable proxying. [1] https://github.com/psf/requests/issues/4600	2020-10-28 12:13:04 -07:00
Alex Vandiver	97745688ca	docs: Link to the new doc home of the email gateway.	2020-10-28 12:13:04 -07:00
Alex Vandiver	2b0bbbb882	tools: Rename postgres to postgresql in tool names.	2020-10-28 11:57:02 -07:00
Alex Vandiver	5eb8064a1a	install: Rename postgres options to postgresql.	2020-10-28 11:55:32 -07:00
Alex Vandiver	1f7132f50d	docs: Standardize on PostgreSQL, not Postgres.	2020-10-28 11:55:16 -07:00
Alex Vandiver	142de0f670	queue: Increase default timeout to 30s, from 10s. Not all of the workers are known to be safe to interrupt; they might leave inconsistent state. As such, terminating them with timeouts should currently only be a last-resort against stalled queues, not a regular occurrence.	2020-10-27 16:39:31 -07:00
Alex Vandiver	c73dd194f0	sentry: Group all worker timeouts together, by queue. Since the exception can be triggered at arbitrary places in the stack based on whenever the alarm happens to fire, they do not often group together. Explicitly group them together, grouped only by which queue the work is in.	2020-10-27 16:39:31 -07:00
Anders Kaseorg	1352f2f233	python: Replace manual quote_plus usage with urlencode. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	4e9d587535	python: Pass query parameters as a dict when making GET requests. This provides automatic URL-encoding. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	41f509170b	users: Canonicalize the timezone identifier. While working on shifting toward native browser time zone APIs (#16451), it was found that all but very recent Chrome and Node versions reject certain legacy timezone aliases like US/Pacific (https://crbug.com/364374). For now, we only canonicalize the timezone property returned in user objects and not the timezone setting itself. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Anders Kaseorg	a8b1691e97	timezone: Convert get_common_timezones cache to lru_cache. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Anders Kaseorg	0b288f92c9	timezone: Remove get_timezone wrapper. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Anders Kaseorg	0134112b51	timezone: Remove get_all_timezones wrapper. Both callers want a set, and pytz already provides all_timezones_set. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Chris Bobbe	500f92169e	openapi: Fix the "the the" instances.	2020-10-26 12:50:01 -07:00
Chris Bobbe	dd43d4feb8	openapi: Note that a user's `avatar_url` may be missing. Like it already says in the detail about `user_avatar_url_field_optional`, but on the field itself [1]. [1] https://github.com/zulip/zulip-mobile/pull/4230#discussion_r493109645	2020-10-26 12:50:01 -07:00
Tim Abbott	6d7cd351a3	events: Optimize creating streams for new users. During the new user creation code path, there can be no existing active clients for the user being created, so we can skip the code to send events to that user's clients. The tests here reflect that we need to send fewer events, and do fewer queries that would have been spent computing data for these.. Fixes #16503, combined with the long series of recent changes by Steve Howell to fix super-linear behavior in this code path.	2020-10-26 12:47:15 -07:00
Steve Howell	88a7a1b002	events: Optimize peer_add/peer_remove for public streams. We no bulk up peer_add/peer_remove events by user if the same user has subscribed to multiple streams (and just that single user). This mostly optimizes the new-user codepath, but the algorithm is a bit more general in nature.	2020-10-26 12:33:28 -07:00
Alex Vandiver	7cf737988d	queue: Be more explicit about test/real queue division.	2020-10-26 12:32:47 -07:00
Anders Kaseorg	31d0141a30	python: Close opened files. Fixes various instances of ‘ResourceWarning: unclosed file’ with python -Wd. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-26 12:31:30 -07:00
Steve Howell	3ad1335a97	tests: Clear ContentType cache for user test. This keeps the number of queries predictable.	2020-10-26 07:18:08 -04:00
Steve Howell	5ef01b3ad8	tests: Fix test_create_user_with_multiple_streams. This test was flaky due to some date-related non-determinism. I make all the Message objects current to make add_new_user_history reliably try to bulk-update UserMessage rows to read.	2020-10-26 07:18:08 -04:00
Anders Kaseorg	96dee7e40b	decorator: Unfork redirect_to_login. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-25 14:58:53 -07:00
sahil839	a96811ab58	management: Replace knight command with change_user_role command. We replace knight command with change_user_role command which allows us to change role of a user to owner, admins, member and guest. We can also give/revoke api_super_user permission using this command. Tweaked by tabbott to improve the logging output and update documentation. Fixes #16586.	2020-10-25 14:55:08 -07:00
Harsh Srivastava	9b31df009b	openapi: Fix excessively large test_events failure output. Because of the very large `oneOf` clause of the formats of events possible in Zulip's `GET /events` system, we had issues with `test-backend` failures for missing documentation for a new event format being like 1000 lines of output, which was very much unhelpful. Fix this by limiting the output use only the oneOf variants that are broadly similar to the actual payload received. Fixes #16023.	2020-10-23 17:00:17 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	e513b75e86	markdown: Remove handler for old bug with incompatible twitter library. See commit `8b002040e0` and #86. The development environment bug that necessitated this handler has long been irrelevant. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:30:26 -07:00
Anders Kaseorg	b9fd49a2c6	mypy: Correct mistaken *args type annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:29:13 -07:00
Anders Kaseorg	d295da676b	test_message_fetch: Clean up obsolete PGroonga bug workaround. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-22 23:27:23 -07:00
Anders Kaseorg	831d086110	i18n: Fix get_language_translation_data for zh_TW. Fixes #16600. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-22 16:43:02 -07:00
sahil839	571bb62e3d	events: Update subscriber list on peer_add for unsubscribed streams. We update the subscriber list on peer_add event for unsubscribed streams as well.	2020-10-22 15:12:32 -07:00
sahil839	733d26aef2	events: Update subscriber list on peer_remove for never subscribed stream. We now update the subscriber list on peer_remove event for never subscribed streams also.	2020-10-22 15:12:32 -07:00
sahil839	af9b153ee3	events: Update subscriber list on peer_remove for unsubscribed stream. We update the subscriber list on peer_remove event for unsubscribed streams also.	2020-10-22 15:12:32 -07:00
sahil839	709edd29d4	test_events: Fix comment in do_test_subscribe_events. The comment still pointed to 'vacate' event flow, but we have removed the vacate event in `a9356508ca`. This commit fixes the comment to depict the correct purpose of below lines, i.e. to test the remove event flow.	2020-10-22 15:12:32 -07:00
sahil839	e578742b02	test_events: Remove 'realm_user' from event_types in subscription test. We were including 'realm_user' in event_types along with 'subscription', but we don't send event of type 'realm_user' when subscribing to a new stream. This was added in `1c332f5d6a`. This commit removes 'realm_user' from event_types.	2020-10-22 15:12:32 -07:00
sahil839	d0f5537fb2	actions: Modify check_message for handling wildcard_mention_policy setting. This commit adds enforcement for sending messages containing wildcard mentions according to wildcard_mention_policy.	2020-10-22 14:46:32 -07:00
sahil839	25f32d461e	tests: Add tests for all the values of wildcard_mention_policy.	2020-10-22 12:08:22 -07:00
Mateusz Mandera	48f80fcb0a	auth: Expect name in request params in Apple auth. The name used to be included in the id_token, but this seems to have been changed by Apple and now it's sent in the `user` request param. https://github.com/python-social-auth/social-core/pull/483 is the upstream PR for this - but upstream is currently unmaintained, so we have to monkey patch. We also alter the tests to reflect this situation. Tests no longer put the name in the id_token, but rather in the `user` request param in the browser flow, just like it happens in reality. An adaptation has to be made in the native flow - since the name won't be included by Apple in the id_token anymore, the app, when POSTing to the /complete/apple/ endpoint, can (and should for better user experience) add the `user` param formatted as json of {"email": "hamlet@zulip.com", "name": {"firstName": "Full", "lastName": "Name"}} dict. This is also reflected by the change in the native flow tests.	2020-10-22 12:07:46 -07:00
Steve Howell	7ff3859136	subscriber events: Change schema for peer_add/peer_remove. We now can send an implied matrix of user/stream tuples for peer_add and peer_remove events. The client code basically does this: for stream_id in event['stream_ids']: for user_id in event['user_ids']: update_sub(stream_id, user_id) We used to send individual events, which gets real expensive when you are creating new streams. For the case of copy-to-stream case, we should see events go from U to 1, where U is the number of users added. Note that we don't yet fully optimize the potential of this schema. For adding a new user with lots of default streams, we still send S peer_add events. And if you subscribe a bunch of users to a bunch of private streams, we only go from U * S to S; we can't optimize it down to one event easily.	2020-10-22 11:19:53 -07:00
Anders Kaseorg	efa8dd3a47	compilemessages: Sort language list with Unicode Collation Algorithm. Right now the list of languages in Display settings → Default language is sorted in an unintuitive order due to the varying case conventions: British English Chinese (Taiwan) Deutsch English Hindi Indonesian (Indonesia) Lietuviškai Magyar Malayalam Nederlands Português Română Tiếng Việt Türkçe català español français galego italiano polski suomi svenska česky Русский Українська български српски فارسی தமிழ் 日本語简体中文繁體中文 한국어 Fix the sort to use the locale-independent Unicode Collation Algorithm: British English català česky Chinese (Taiwan) Deutsch English español français galego Hindi Indonesian (Indonesia) italiano Lietuviškai Magyar Malayalam Nederlands polski Português Română suomi svenska Tiếng Việt Türkçe български Русский српски Українська فارسی தமிழ் 한국어 日本語简体中文繁體中文 Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-21 15:22:18 -07:00
Steve Howell	85ed6f332a	performance: Avoid Recipient lookup for stream messages. All the fields of a stream's recipient object can be inferred from the Stream, so we just make a local object. Django will create a Message object without checking that the child Recipient object has been saved. If that behavior changes in some upgrade, we should see some pretty obvious symptom, including query counts changing. Tweaked by tabbott to add a longer explanatory comment, and delete a useless old comment.	2020-10-20 11:47:23 -07:00
Steve Howell	7bbcc2ac96	refactor: Compute peers for public streams later. This saves us a query for edge cases like when you try to unsubscribe from a public stream that you have already unsubscribed from. But this is mostly to prep for upcoming optimizations.	2020-10-20 11:31:22 -07:00
Steve Howell	363e5d31a6	refactor: Split out public/private logic for peer events. This doesn't change anything yet, but the goal is to eventually optimize events for the case where one user (typically a new user) gets subscribed to multiple public streams.	2020-10-20 11:31:22 -07:00
Steve Howell	3961e69381	refactor: Extract send_peer_subscriber_events. We now use the same basic code to send peer_add and peer_remove events.	2020-10-20 11:31:22 -07:00
Anders Kaseorg	254b904965	markdown: Migrate off deprecated extension registration interface. Fixes #15205. https://python-markdown.github.io/change_log/release-3.0/#homegrown-ordereddict-has-been-replaced-with-a-purpose-built-registry https://python-markdown.github.io/change_log/release-3.0/#md_globals-keyword-deprecated-from-extension-api The priority numbers are arbitrarily chosen to preserve the existing order. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 18:31:12 -07:00
akshatdalton	287c4ed2bb	markdown: Fix Youtube and Vimeo preview overriding markdown link titles bug. Initially markdown titles were overridden by Youtube and Vimeo preview titles. But now it will check if any markdown title is present to replace Youtube or Vimeo preview titles, if preview of linked websites is enabled. Fixes #16100	2020-10-19 12:06:13 -07:00
Anders Kaseorg	d81a93cdf3	requirements: Upgrade markdown to 3.3.1. Upstream has slightly changed the whitespace around stashes. Take this opportunity to clean up the extra blank lines we were outputting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 11:54:14 -07:00
Anders Kaseorg	2ab15149bc	forms: Convert percent formatting to "".format. Caught by upgrading Semgrep. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 11:54:14 -07:00
Anders Kaseorg	f461a64a6b	i18n: Fix some ineffective calls to ugettext at top level. Translation has no effect when we don’t yet know what language we’re translating for. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-18 14:31:15 -07:00
Anders Kaseorg	bba43f35ca	i18n: Be deliberate about distinguishing ugettext and ugettext_lazy. The early str conversions in zerver.models were defeating the point of ugettext_lazy. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-18 14:31:15 -07:00
Steve Howell	e6f6f8d45f	refactor: Avoid "stream_id" on sub. There was no need to put "stream_id" on the sub dictionary here. It's kinda annoying to introduce the little helper here, but I feel that's better than crufting up the sub data structure.	2020-10-18 14:27:31 -07:00
Steve Howell	628a826aa2	minor: Move code and add comments about three lists.	2020-10-18 14:27:31 -07:00
Steve Howell	ffee129a35	refactor: Clean up is_web_public flag. The is_web_public flag is already in Stream.API_FIELDS, so there is no reason for all this complicated logic. There's no reason to hack it on to the subscription object.	2020-10-18 14:27:31 -07:00
Steve Howell	4dce34ab8b	refactor: Simplify call to bulk_get_subscriber_user_ids. The way we were computing the dictionary was very convoluted--all we need is a set of subscribed user ids.	2020-10-18 14:27:31 -07:00
Steve Howell	b58152abda	refactor: Introduce all_streams_map. We replace all_streams_id with a map. We also use it to populate never_subscribed_streams. And all_streams_map is a superset of stream_hash, which we will soon kill off as well.	2020-10-18 14:27:31 -07:00
Steve Howell	78384ebf1b	minor: Remove confusing parens. Apparently I put these parens in the code as part of `73c30774cb` during 2017. It looks like I extracted is_public during the middle of my change and forgot to remove the unnecessary parens. (The code was correct, but it makes it look like a tuple if you're skimming it too quickly.)	2020-10-18 14:27:31 -07:00
Steve Howell	d60dd94168	refactor: Extract funcs from gather_subscriptions_helper. This is a pure code move, apart from a little bit of quote cleanup and renames: user_profile -> user stream_dict -> result	2020-10-18 14:27:31 -07:00
Steve Howell	79fcf78143	refactor: Exclude "active" from API_FIELDS. We just need to make sure the relevant queries get it for the triage process.	2020-10-18 14:27:31 -07:00
Steve Howell	c5769d31f2	minor: Move code for web_public_stream_ids.	2020-10-18 14:27:31 -07:00
Steve Howell	0ca07ffd3c	peformance: Eliminate StreamRecipientMap. That class is an artifact of when Stream didn't have recipient_id. Now it's simpler to deal with stream subscriptions. We also save a query during page load (and other places where we get subscriber info).	2020-10-18 14:27:31 -07:00
Steve Howell	1951d75796	performance: Avoid select_related("realm"). We also move this query up in the function for some future refactorings.	2020-10-18 14:27:31 -07:00
Steve Howell	2f8ba383ef	tests: Test overhead for creating new users.	2020-10-18 14:27:31 -07:00
Mateusz Mandera	716df658fa	queue_processors: Don't run test queues with run-dev.py.	2020-10-18 14:07:31 -07:00
Steve Howell	57efe9d81a	performance: Streamline list_to_streams. We take advantage of stream.recipient to simplify the query's where clause and avoid the need for select_related("recipient").	2020-10-16 12:58:11 -07:00
Steve Howell	e1bcf6124f	refactor: Remove recipient from access_stream_by_name.	2020-10-16 12:58:11 -07:00
Steve Howell	a51b483f1a	performance: Remove recipient from access_stream_by_id. The Recipient table is now kind of useless for stream-related operations, since we have recipient_id on Stream now.	2020-10-16 12:58:11 -07:00
Steve Howell	31622feb87	refactor: Only return sub from access_stream_common. Let the callers access stream.recipient as needed. It costs the same, and some of the callers can actually stop caring about the actual Recipient object.	2020-10-16 12:58:11 -07:00
Steve Howell	bfd6e2b1fd	refactor: Use recipient_id to get topic history.	2020-10-16 12:58:11 -07:00
Steve Howell	3685fcc701	refactor: Remove recipient arg for do_mute_topic.	2020-10-16 12:58:11 -07:00
Steve Howell	65dbee4837	minor: Ask for recipient_id, not recipient.	2020-10-16 12:58:11 -07:00
Steve Howell	378062cc83	performance: Avoid call to access_stream_by_id. We already trust ids that are put on our queue for deferred work. For example, see the code for "mark_stream_messages_as_read_for_everyone" We now pass stream_recipient_id when we queue up work for do_mark_stream_messages_as_read. This generally saves about 3 queries per user when we unsubscribe them from a stream.	2020-10-16 12:58:11 -07:00
Steve Howell	2256d72015	minor: Add comment to subscriber test.	2020-10-16 12:58:11 -07:00
Steve Howell	31eb97ddde	performance: Fix do_mark_stream_messages_as_read. This function no longer asks for data that it doesn't need.	2020-10-16 12:58:11 -07:00
Steve Howell	6d1f9de7d3	performance: Use SubInfo when removing subscribers. We get two speedups: * The query to get existing subscribers only gets the two fields we need. We no longer need all the overhead of user_profile and recipient data being returned in the query. * We avoid Django making extra hops to the database to get user info.	2020-10-16 12:58:11 -07:00
Steve Howell	73982f6cc9	refactor: Move SubInfo to stream_subscription.py.	2020-10-16 12:58:11 -07:00
Tim Abbott	caa939d2d5	actions: Use transaction.atomic properly when removing subscriptions. Previously, the transaction.atomic() was not properly scoped to ensure that RealmAuditLog entries were created in the same transaction, making it possible for state changes to not be properly recorded in RealmAuditLog.	2020-10-15 15:12:05 -07:00
Steve Howell	0b91526f28	events: Remove "occupied" semantics for "streams". When apps like mobile register for "streams", we will now just use active streams as our baseline, rather than "occupied" streams. This means we will send a stream that is active, even if it happens to have zero occupants. It's actually pretty rare that a stream has zero occupants, and it's not exactly clear that we want to exclude a non-occupied but otherwise active stream from our list of streams. It also happens to be fairly expensive to compute whether a stream is occupied. This change only affects API clients (including possibly our mobile app). The main webapp never used the data from this codepath.	2020-10-15 15:12:01 -07:00
Steve Howell	b4346d0276	performance: Extract subscribers/peers in bulk. We replace get_peer_user_ids_for_stream_change with two bulk functions to get peers and/or subscribers. Note that we have three codepaths that care about peers: subscribing existing users: we need to tell peers about new subscribers we need to tell subscribed user about old subscribers unsubscribing existing users: we only need to tell peers who unsubscribed subscribing new user: we only need to tell peers about the new user (right now we generate send_event calls to tell the new user about existing subscribers, but this is a waste of effort that we will fix soon) The two bulk functions are this: bulk_get_subscriber_peer_info bulk_get_peers They have some overlap in the implementation, but there are some nuanced differences that are described in the comments. Looking up peers/subscribers in bulk leads to some nice optimizations. We will save some memchached traffic if you are subscribing to multiple public streams. We will save a query in the remove-subscriber case if you are only dealing with private streams.	2020-10-15 15:12:01 -07:00
Steve Howell	94e41c71f9	refactor: Use set of ids for altered users.	2020-10-15 15:12:01 -07:00
Steve Howell	b894597fa3	refactor: Use sets of stream_ids for helper args.	2020-10-15 15:12:01 -07:00
Steve Howell	3889554977	refactor: Extract send_peer_remove_events.	2020-10-15 15:12:01 -07:00
Steve Howell	c73f84f275	tests: Improve tests for unsubscribing multiple users. Note that the tests now reflect that we have O(N) behavior for multiple users.	2020-10-15 15:12:01 -07:00
Steve Howell	f86823f82f	tests: Add cache_tries_captured helper.	2020-10-15 15:12:01 -07:00
Steve Howell	ce70d08cbf	test_helpers: Use mock.patch.multiple.	2020-10-15 15:12:01 -07:00
Tim Abbott	bf66e9c4ab	actions: Add transaction.atomic to bulk_add_subs_to_db_with_logging. This will ensure that we always fully execute the database part of modifying subscription objects. In particular, this should prevent invariant failures like #16347 where Subscription objects were created without corresponding RealmAuditLog entries. Fixes #16347.	2020-10-14 11:06:00 -07:00
Steve Howell	5728149e94	performance: Streamline query to add subscribers. We don't need the select_related('user_profile') optimization any more, because we just keep track of user info in our own data structures. In this codepath we are never actually modifying users; we just occasionally need their ids or emails. This can be a pretty substantive improvement if you are adding a bunch of users to a stream who each have a bunch of their own subscriptions. We could also limit the number of full rows in this query by adding an extra hop to the DB just to get colors (using values_list), and then only get full sub info for the streams that we're adding, rather than getting every single subscription, in full, for each user. Apart from finding what colors the user has already used, the only other reason we need all the columns in Subscription here is to handle streams that need to be reactivated. Otherwise we could do only("id", "active", "recipient_id", "user_profile_id") or similar. Fortunately, Subscription isn't an overly wide table; it's mostly bool fields. But by far the biggest thing to avoid is bringing in all the extra user_profile data. We have pretty good coverage on query counts here, so I think this fix is pretty low risk.	2020-10-14 11:03:07 -07:00
Steve Howell	116a441bc5	refactor: Introduce SubInfo class. This class removes a lot of the annoying tuples we were passing around. Also, by including the user everywhere, which is easily available to us when we make instances of SubInfo, it sets the stage to remove select_related('user_profile').	2020-10-14 10:53:10 -07:00
Steve Howell	febef45e38	minor: Add comments to do_get_streams.	2020-10-14 10:53:10 -07:00
Steve Howell	a9356508ca	events: Stop sending occupy/vacate events. We used to send occupy/vacate events when either the first person entered a stream or the last person exited. It appears that our two main apps have never looked at these events. Instead, it's generally the case that clients handle events related to stream creation/deactivation and subscribe/unsubscribe. Note that we removed the apply_events code related to these events. This doesn't affect the webapp, because the webapp doesn't care about the "streams" field in do_events_register. There is a theoretical situation where a third party client could be the victim of a race where the "streams" data includes a stream where the last subscriber has left. I suspect in most of those situations it will be harmless, or possibly even helpful to the extent that they'll learn about streams that are in a "quasi" state where they're activated but not occupied. We could try to patch apply_event to detect when subscriptions get added or removed. Or we could just make the "streams" piece of do_events_register not care about occupy/vacate semantics. I favor the latter, since it might actually be what users what, and it will also simplify the code and improve performance.	2020-10-14 10:53:10 -07:00
Steve Howell	1bcb8d8ee8	performance: Avoid computing page_params.streams in webapp. The query to get "occupied" streams has been expensive in the past. I'm not sure how much any recent attempts to optimize that query have mitigated the issue, but since we clearly aren't sending this data, there is no reason to compute it.	2020-10-14 10:53:10 -07:00
Steve Howell	79803f01f4	minor: Format some code in events.py.	2020-10-14 10:53:10 -07:00
Steve Howell	193ca397f9	tests: Include deactivated users for subscribe test.	2020-10-14 10:53:10 -07:00
Aman Agrawal	fbf7cb82a7	web_public_guest: Rename to web_public_visitor for clarity. Using web_public_guest for anonymous users is confusing since 'guest' is actually a logged-in user compared to web_public_guest which is not logged-in and has only read access to messages. So, we rename it to web_public_visitor.	2020-10-13 16:59:52 -07:00
Steve Howell	e7a8c7ac48	test: Improve tests for bulk-adding subscribers. This is a more thorough test of adding multiple streams for multiple users, including streams that users have already subscribed to. The extra queries here are due to the fact that we call `principal_to_user_profile` in a loop in the view. So that's an example of O(N) overhead. We may be able to bulk-fetch these users eventually.	2020-10-13 18:54:55 -04:00
Steve Howell	c29ba75135	refactor: Extract send_messages_for_new_subscribers. This is a pure extraction, except that I remove a redundant check that `len(principals) > 0`. Whenever that value is false, then `new_subscriptions` will only have one possible entry, which is the current user, and we skip that in the loop.	2020-10-13 18:54:55 -04:00
Steve Howell	3b338ec32e	performance: Optimize filter_stream_authorization. We no longer do O(N) queries to get existing streams. This is a somewhat contrived use case--generally, we are not trying to re-subscribe a user to several streams. Still, we want to avoid this. This commit also makes `test_bulk_subscribe_many` do more work, and the change to the test helped me discover this bug.	2020-10-13 18:54:55 -04:00
Anders Kaseorg	6564540d15	docs: Fix some spelling errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Anders Kaseorg	dd48dbd912	docs: Add spaces to “check out”, “log in”, “set up”, “sign up” as verbs. “Checkout”, “login”, “setup”, and “signup” are nouns, not verbs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Steve Howell	598601e8fc	stream events: Prevent spurious events. If a user asks to be subscribed to a stream that they are already subscribed to, then that stream won't be in new_stream_user_ids, and we won't need to send an event for it. This change makes that happen more automatically.	2020-10-13 11:28:17 -07:00
Steve Howell	18771099e4	performance: Introduce new_stream_user_ids. Let U = number of users to subscribe S = number of streams to subscribe We were technically doing N^3 amount of work when we sent certain events, or to be more precise, U * S * S amount of work. For each stream, we were looping through a list of tuples of size U * S to find the users for the stream. In practice either U or S is usually 1, so the performance gains here are probably negligible, especially since the constant factors here were just slinging around Python data. But the code is actually more readable now, so it's a double win.	2020-10-13 11:28:17 -07:00
Steve Howell	ebb605319b	refactor: Rename stream_map to recipient_id_to_stream. I want to make a new dict called stream_id_to_stream, and stream_map would be confusing.	2020-10-13 11:28:17 -07:00
Steve Howell	b502957184	refactor: Extract new_recipient_ids local. We rename needs_new_sub (which sounds like a boolean!) to new_recipient_ids, and we calculate it explicitly within the loop, so that we don't need to worry as much about subsequent passes through the loop mutating it. This allows us to also remove recipient_ids, which in turn lets us remove recipients_map, albeit with a small tweak for stream_map. I also introduce the my_subs local, which I use to more directly populate used_colors, as well as using it as the loop var.	2020-10-13 11:28:17 -07:00
Steve Howell	766892d8aa	import: Reuse get_last_message_id() helper.	2020-10-13 11:28:17 -07:00
Steve Howell	188cc9bb3b	minor: Fix user/stream in test_subscriptions.	2020-10-13 11:28:17 -07:00
Steve Howell	9df9934ed6	refactor: Pass realm to bulk_add_subscriptions. I think it's important that the callers understand that bulk_add_subscriptions assumes all streams are being created within a single realm, so I make it an explicit parameter. This may be overkill--I would also be happy if we just included the assertions from this commit.	2020-10-13 11:28:17 -07:00
Steve Howell	efc931a671	minor: Extract realm local.	2020-10-13 11:28:17 -07:00
Steve Howell	b2d0a2efb9	refactor: Extract send_subscription_add_events. This function now does all the work that we used to do with notify_subscriptions_added happening inside a loop. There's a small fine-tuning here, where we only get recent traffic on streams that we're actually sending events for.	2020-10-13 11:28:17 -07:00
Steve Howell	223ce83a0a	refactor: Clean up call to notify_subscriptions_added. We now just pass in all_subscribers_by_stream, rather than a callback. We also move sub_tuples_by_user closer to the loop where we call notify_subscriptions_added.	2020-10-13 11:28:17 -07:00
Steve Howell	811426b345	Extract send_stream_creation_events_for_private_streams. We can probably avoid passing in users here.	2020-10-12 16:40:37 -07:00
Steve Howell	1cfaef0d1a	refactor: Simplify pick_color logic. This removes the need to jankily mutate the active flag in the caller, and we don't need to mutate our subs_by_user either.	2020-10-12 16:40:37 -07:00
Steve Howell	13569ff97a	refactor: Eliminate new_subs. We now just process new subs for a user immediately within the loop.	2020-10-12 16:40:37 -07:00
Steve Howell	8c70fbde78	refactor: Use subs_to_add in return value. The subs_to_add is directly related to a var called new_subs, which I hope to eliminate soon.	2020-10-12 16:40:37 -07:00
Steve Howell	1afca3d430	minor: Extract local for stream.	2020-10-12 16:40:37 -07:00
Steve Howell	84aa1389d8	Extract bulk_add_subs_to_db_with_logging. This is a trivial code extraction.	2020-10-12 16:40:37 -07:00
Steve Howell	3ff9ce78ea	refactor: Extract send_peer_add_events.	2020-10-12 16:40:37 -07:00
Alex Vandiver	f3ba227614	create_user: Strip whitespace from initial password file. Fixes #12144.	2020-10-11 16:29:00 -07:00
Cody Piersall	5dab6e9d31	emoji-upload: Fix transparency issues on GIF emoji upload. This preserves the alpha layer on GIF images that need to be resized before being uploaded. Two important changes occur here: 1. The new frame is a copy of the original image, which preserves the GIF info. 2. The disposal method of the original GIF is preserved. This essentially determines what state each frame of the GIF starts from when it is drawn; see PIL's docs: https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html#saving for more info. This resolves some but not all of the test cases in #16370.	2020-10-11 16:23:07 -07:00
Anders Kaseorg	b7a94be152	python: Catch BaseException when we need to clean something up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:16:16 -07:00
Anders Kaseorg	7f69c1d3d5	python: Catch specific exceptions from requests. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	17ac17286c	python: Catch specific exceptions from subprocess. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	aabef3d9be	python: Catch specific exceptions from orjson. Followup to #16120. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	234f7245cf	export_usermessage_batch: Use os.rename. This avoids an extra stat call to check whether the target is a directory. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	83eca256a4	compilemessages: Use polib for get_name_from_po_file. This also corrects the name of zh_TW from “Chinese” to “Chinese (Taiwan)”. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	1346c5397a	zephyr: Use correct shell quoting for ssh. ssh always runs its command through a shell (after naïvely joining multiple arguments with spaces), so it needs an extra level of shell quoting. This should have no effect because we already validated user with a regex, but it’s better for escaping to be locally correct in case the context changes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	82593338ba	report: Show Git commit in a way that works for merges. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	c9fec8f021	deliver_scheduled_messages: Don’t do_send_messages inside a transaction. do_send_messages has side effects outside the database and may not work reliably if its database effects are reordered by being inside a transaction. This also fixes a bug where we were doing the update incorrectly on the Message table. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:09:22 -07:00
Alex Vandiver	c2132a4f9c	queue: Drop register_json_consumer / json_drain_queue interface. Now that all callsites use the same interface, drop the now-unused ones, and their tests.	2020-10-11 14:19:42 -07:00
Alex Vandiver	5477b9d9a1	queue: Switch tests to start_json_consumer interface.	2020-10-11 14:19:42 -07:00
Alex Vandiver	179c387409	tornado: Switch to start_json_consumer interface.	2020-10-11 14:19:42 -07:00
Alex Vandiver	f0b23b0752	queue: Switch non-batch consumer to also use start_json_consumer. This has no effect on consumption rate, but unifies the codepaths. Before: ``` $ ./manage.py queue_rate --count 50000 Purging queue... Enqueue rate: 11187 / sec Dequeue rate: 4158 / sec ``` After: ``` $ ./manage.py queue_rate --count 50000 Purging queue... Enqueue rate: 11010 / sec Dequeue rate: 4113 / sec ```	2020-10-11 14:19:42 -07:00
Alex Vandiver	45c9c3cc30	queue: Monitor user_activity queue, now that it has a consumer. Since this was using repead individual get() calls previously, it could not be monitored for having a consumer. Add it in, by marking it of queue type "consumer" (the default), and adding Nagios lines for it. Also adjust missedmessage_emails to be monitored; it stopped using LoopQueueProcessingWorker in `5cec566cb9`, but was never added back into the set of monitored consumers.	2020-10-11 14:19:42 -07:00
Alex Vandiver	f9358d5330	queue: Switch batch interface to use the channel.consume iterator. This low-level interface allows consuming from a queue with timeouts. This can be used to either consume in batches (with an upper timeout), or one-at-a-time. This is notably more performant than calling `.get()` repeatedly (what json_drain_queue does under the hood), which is "highly discouraged as it is very inefficient"[1]. Before this change: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11158 / sec Dequeue rate: 3075 / sec ``` After: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11511 / sec Dequeue rate: 19938 / sec ``` [1] https://www.rabbitmq.com/consumers.html#fetching	2020-10-11 14:19:40 -07:00
Alex Vandiver	571f8b8664	queue: Use low-level queue_purge to empty at the end of tests. This is O(1) at the RabbitMQ API level, and doesn't rely on the code under test to function correctly during test cleanup.	2020-10-09 20:43:49 -07:00
Alex Vandiver	ac0ba21c2c	tests: Stop reusing a variable name. `loopworker_sleep_mock` is a file-level variable used to mock out the sleep() call in LoopQueueProcessingWorker; don't reuse the variable name for something else.	2020-10-09 20:42:20 -07:00
Alex Vandiver	754638f673	tests: Refactor test_queue_worker to separate queues.	2020-10-09 20:42:12 -07:00
Alex Vandiver	2547bdbf4a	queue: Rename consume_wrapper to a better name.	2020-10-09 20:40:51 -07:00
Alex Vandiver	d5a6b0f99a	queue: Rename queue_size, and update for all local queues. Despite its name, the `queue_size` method does not return the number of items in the queue; it returns the number of items that the local consumer has delivered but unprocessed. These are often, but not always, the same. RabbitMQ's queues maintain the queue of unacknowledged messages; when a consumer connects, it sends to the consumer some number of messages to handle, known as the "prefetch." This is a performance optimization, to ensure the consumer code does not need to wait for a network round-trip before having new data to consume. The default prefetch is 0, which means that RabbitMQ immediately dumps all outstanding messages to the consumer, which slowly processes and acknowledges them. If a second consumer were to connect to the same queue, they would receive no messages to process, as the first consumer has already been allocated them. If the first consumer disconnects or crashes, all prior events sent to it are then made available for other consumers on the queue. The consumer does not know the total size of the queue -- merely how many messages it has been handed. No change is made to the prefetch here; however, future changes may wish to limit the prefetch, either for memory-saving, or to allow multiple consumers to work the same queue. Rename the method to make clear that it only contains information about the local queue in the consumer, not the full RabbitMQ queue. Also include the waiting message count, which is used by the `consume()` iterator for similar purpose to the pending events list.	2020-10-09 20:40:39 -07:00
Alex Vandiver	a1ce1aca3b	queue: Update comment to be more accurate about import errors.	2020-10-09 20:40:32 -07:00
Alex Vandiver	2d71ca1fb8	email: Remove unused `log_digest_event` function. Its last callsite was removed in `e46cbaffa2`. Also ref #6786.	2020-10-08 20:35:53 -07:00
sahil839	4e8f5b5b31	streams: Change access_stream_for_delete_or_update to also return sub. We modify access_stream_for_delete_or_update function to return Subscription object also along with stream. This change will be helpful in avoiding an extra query to get subscription object in code for updating subscription role.	2020-10-08 17:07:30 -07:00
Aman Agrawal	8b419c93e4	message_send: Fix old guests being treated as full members. For streams in which only full members are allowed to post, we block guest users from posting there. Guests users were blocked from posting to admin only streams already. So now, guest users can only post to STREAM_POST_POLICY_EVERYONE streams. This is not a new feature but a bugfix which should have happened when implementing full member stream policy / guest users.	2020-10-08 11:30:11 -07:00
Anders Kaseorg	e1a3aa9c1a	deliver_scheduled_messages: Exit cleanly on Ctrl+C. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 16:15:19 -07:00
Anders Kaseorg	363374346c	management: Use signal.pause to sleep forever. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 16:15:19 -07:00
Anders Kaseorg	c7cbdba257	eslint: Fix unicorn/{new-for-builtins,throw-new-error,error-message}. https://github.com/sindresorhus/eslint-plugin-unicorn/blob/master/docs/rules/new-for-builtins.md https://github.com/sindresorhus/eslint-plugin-unicorn/blob/master/docs/rules/throw-new-error.md https://github.com/sindresorhus/eslint-plugin-unicorn/blob/master/docs/rules/error-message.md Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 16:00:33 -07:00
Anders Kaseorg	9bfbb29763	queue_processors: Use try…finally to prevent leaking an alarm. Otherwise, if consume_func raised an exception for any reason other than the alarm being fired, the still-pending alarm would have fired later at some arbitrary point in the calling code. We need two try…finally blocks in case the signal arrives just before signal.alarm(0). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 15:37:46 -07:00
akshatdalton	52c411df8a	emoji: Add padding around the gif on GIF emoji upload. Replaced ImageOps.fit by ImageOps.pad, in zerver/lib/upload.py, which returns a sized and padded version of the image, expanded to fill the requested aspect ratio and size. Fixes part of #16370.	2020-10-06 17:28:02 -07:00
Alex Vandiver	d47637fa40	queue: Set a max consume timeout with SIGALRM. SIGALRM is the simplest way to set a specific maximum duration that queue workers can take to handle a specific message. This only works in non-threaded environments, however, as signal handlers are per-process, not per-thread. The MAX_CONSUME_SECONDS is set quite high, at 10s -- the longest average worker consume time is embed_links, which hovers near 1s. Since just knowing the recent mean does not give much information[1], it is difficult to know how much variance is expected. As such, we set the threshold to be such that only events which are significant outliers will be timed out. This can be tuned downwards as more statistics are gathered on the runtime of the workers. The exception to this is DeferredWorker, which deals with quite-long requests, and thus has no enforceable SLO. [1] https://www.autodesk.com/research/publications/same-stats-different-graphs	2020-10-06 17:26:14 -07:00
Alex Vandiver	baf882a133	queue: Only ACK drain_queue once it has completed work on the list. Currently, drain_queue and json_drain_queue ack every message as it is pulled off of the queue, until the queue is empty. This means that if the consumer crashes between pulling a batch of messages off the queue, and actually processing them, those messages will be permanently lost. Sending an ACK on every message also results in a significant amount lot of traffic to rabbitmq, with notable performance implications. Send a singular ACK after the processing has completed, by making `drain_queue` into a contextmanager. Additionally, use the `multiple` flag to ACK all of the messages at once -- or explicitly NACK the messages if processing failed. Sending a NACK will re-queue them at the front of the queue. Performance of a no-op dequeue before this change: ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10847 / sec Dequeue rate: 2479 / sec ``` Performance of a no-op dequeue after this change (a 25% increase): ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10752 / sec Dequeue rate: 3079 / sec ```	2020-10-06 17:26:14 -07:00
Alex Vandiver	df86a564dc	queue: Let stop() work with LoopQueueProcessingWorker.	2020-10-06 17:26:14 -07:00
Alex Vandiver	8cf37a0d4b	queue: Add a tool to profile no-op enqueue and dequeue actions.	2020-10-06 17:26:14 -07:00
Mateusz Mandera	6e83bcc0d5	custom_profile_fields: Don't allow leading/trailing whitespaces. Allowing such whitespaces can lead to hard to debug issues e.g. with ldap sync.	2020-10-02 14:58:06 -07:00
Abhijeet Prasad Bodas	a20d22de43	i18n: Move locale select logic in home to i18n.py. Part of #16094. Moved the language selection preference logic from home.py to a new function in i18n.py to avoid repetition in analytics views and home views.	2020-10-02 14:56:20 -07:00
Aman Agrawal	08fbde4e7c	test_move_msgs: Rename variable for clarity.	2020-10-01 17:45:11 -07:00
Tim Abbott	8c8f3ee13b	test_classes: Extract home view helpers for reuse.	2020-10-01 15:14:25 -07:00
Tim Abbott	6d041a3b34	home: Include is_web_public_guest in page_params.	2020-10-01 15:07:19 -07:00
Tim Abbott	351d73ac5a	home: Pass realm to build_page_params_for_home_page_load. This is preparation for this needing to be a separate parameter from the user.	2020-10-01 15:00:36 -07:00
Aman Agrawal	18d852de49	unreads: Add support for web public guests. This handles the case of web public guests by returning RawUnreadMessagesResult with empty initalized values.	2020-10-01 14:53:43 -07:00
Aman Agrawal	e02f0fb1c4	get_raw_unread_data: Extract func to get unreads from ums.	2020-10-01 14:46:46 -07:00
Aman Agrawal	742bb7a9d5	zulip_otp_required: Don't 2fa logged out users. For users who are not authenticated, we don't need to 2fa them, we only need it once they are trying to login. Tweaked by tabbott to be much more readable; the new style might require new test coverage.	2020-10-01 14:46:46 -07:00
Tim Abbott	e8e876d54c	events: Deduplicate anonymous user code. This approach lets us deduplicate much of the fetch_initial_state_data logic around logged-out users.	2020-10-01 14:43:51 -07:00
Aman Agrawal	f46f251688	post_process_state: Allow web public guests. Because the logic already works correctly, we just need to change mypy types.	2020-10-01 14:41:49 -07:00
Aman Agrawal	190f481f49	stream_subscription: Mark notifications disabled for web public users. Users without an account can't get notifications, so we might as well ensure any UI displays them appropriately.	2020-10-01 14:40:48 -07:00
Aman Agrawal	b0d92b3ff6	HomeTest: Extract page_params keys to be used in other functions.	2020-10-01 14:39:54 -07:00
sahil839	78b98d8067	realm: Add wildcard_mention_policy setting. We add a new wildcard_mention_policy setting to handle wildcard mentions in large streams, with a wide range of policies available to organizations. We set the default to the safe option for preventing accidental spam: only stream administrators being able to use wildcard mentions in large streams.	2020-10-01 12:18:03 -07:00
Anders Kaseorg	46babbe9e1	import_realm: Close the memcached connection before forking. This prevents the memcached connection from being shared across multiple processes, and hopefully addresses unexpected behavior from cached functions like get_user_profile_by_id invoked inside the worker processes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-01 11:20:39 -07:00
Anders Kaseorg	cf5ededa35	openapi: Use at most one inline subschema in allOf. This fixes some of the warnings from openapi-generator. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-30 19:45:28 -07:00
sahil839	6c473ed75f	message: Call build_message_send_dict from check_message. We call build_message_send_dict from check_message instead of do_send_messages. This is a prep commit for adding a new setting for handling wildcard mentions in large streams.	2020-09-29 17:18:04 -07:00
sahil839	f1a5fbaeb0	message: Extract build_message_send_dict function. We extract the loop for building message dict in do_send_messages in a separate function named build_message_send_dict. This is a prep commit for moving the code for building of message dict in check_message.	2020-09-29 16:50:47 -07:00
sahil839	0514ba7ecb	message: Add 'links_for_embed' to message_dict. There is a bug where we send event for even those messages which do not have embedded links as we are using single set 'links_for_embed' to check whether we have to send event for embedded links or not. This commit fixes the bug by adding 'links_for_embed' in message dict itself and send the event only if that message has embedded links.	2020-09-29 16:50:47 -07:00
Steve Howell	c199571112	mypy: Add StreamDict. This requires us to rework the view code a little bit to explicitly assign fields.	2020-09-29 16:49:10 -07:00
Steve Howell	bee18c70f0	mypy: Use str in statsd_key.	2020-09-29 16:49:10 -07:00
Steve Howell	2c496d9afd	mypy: Fix do_send_user_group_update_event.	2020-09-29 16:49:10 -07:00
Steve Howell	a37ef208dc	mypy: Add RawReactionRow.	2020-09-29 16:49:10 -07:00
Anders Kaseorg	cfd93096b5	openapi: Remove yamole. As explained in the previous commit, yamole preprocessed allOf with an algorithm that is not standards compliant. We replicate that algorithm, but importantly, we only use it for our own code and not for building the openapi_core RequestValidator. This improves the time taken by OpenAPISpec().check_reload() from 1.69s to 0.53s, nearly all of which is inside openapi_core.create_spec. Closes #10484. Significantly improves #16068. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Anders Kaseorg	fb2d7c6741	openapi: Fix allOf usage to conform to the OpenAPI specification. yamole preprocesses our schema by naïvely merging all the objects in an allOf array together, but this fails to capture the meaning of allOf according to the OpenAPI specification. allOf is supposed to be a strict logical intersection of each subschema interpreted independently. It does not combine their properties maps before interpreting additionalProperties. So according to the old definition of JsonSuccess, every response is invalid: allOf: - additionalProperties: false properties: result: type: string - required: - result - msg properties: msg: type: string because the first subschema disallowed msg and the second subschema required msg. To fix this, whenever we use allOf for schema “inheritence”, the base schema must not specify additionalProperties, and the child schema must explicitly list all properties recursively inherited from the base schema in any subschema that uses additionalProperties. Fixes #16109. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Anders Kaseorg	c5765c9da6	openapi: Remove some keys redundant with a $ref. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Anders Kaseorg	f3ff082107	openapi: Remove trivial allOf wrapping. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Vishnu KS	367c792968	actions: Downgrade realm before scrubbing.	2020-09-28 15:37:49 -07:00
Vishnu KS	0d30f59c97	billing: downgrade_now -> downgrade_now_without_creating_additional_invoice.	2020-09-28 15:37:49 -07:00
Tim Abbott	0c2d1f068d	docs: Extend documentation of event system testing.	2020-09-28 12:37:54 -07:00
Steve Howell	def3dac6ae	event_schema: Add comments to top of the file. The comments basically explain the common coding patterns for making the checkers.	2020-09-28 12:19:28 -07:00
Tim Abbott	3242fc7388	soft_deactivation: Fix typo in logging output.	2020-09-28 12:12:04 -07:00
palash	7a7db69935	test_push_notifications: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_push_notifications.py	2020-09-28 12:12:00 -07:00
palash	0c18113910	soft_deactivation: Change root logger to zulip.soft_deactivation. Update logger in the following files using this logger: test_soft_deactivation, test_home, test_push_notifications	2020-09-28 12:12:00 -07:00
Tim Abbott	899cb41857	MessageDict: Remove _finalize_payload for simplicity. finalize_payload already has a few options; there's little benefit to this one being implemented as a separate helper function.	2020-09-28 12:00:18 -07:00
Tim Abbott	99396b25a6	MessageDict: Add a bit of docstring documentation.	2020-09-28 11:50:02 -07:00
Tim Abbott	90ff62aabc	actions: Rename message local variable to message_dict. This is a preparatory refactor to make it easy to see the changes using `git show` in the next commit.	2020-09-28 11:14:59 -07:00
sahil839	ae74f8aafb	actions: Remove unnecessary comment in do_send_messages function. This commit removes the unnecessary comment which was added in `9454683108`, when we were using message.get() for keys which were also passed as args in do_send_messages, but there are no such keys in the current code.	2020-09-28 10:58:35 -07:00
sahil839	76c75fea92	actions: Remove unnecessary line from do_send_messages. This commit removes the unnecessary line of code to get rendered_content from message dict sent by check_message when it actually does not inlcude 'rendered_content' key. This line was added in `9454683108`, but now we do not send rendered_content in the message dict as we render the message in do_send_messages itself.	2020-09-28 10:58:35 -07:00
Dinesh	acca870480	tests: Add a dummy request to self.client.login(). A later commit alters `authenticate` of EmailAuthBackend to add a store `needs_to_change_password` variable to session which is useful to insist users on changing their weak password. The tests start failing with that change because client.login() runs `authenticate` without a `request` object. So, this commit sends a request object with `request.session=self.client.session` to self.client.login() in tests wherever needed.	2020-09-25 16:24:18 -07:00
Abhijeet Prasad Bodas	d9d51e32c1	i18n: Simplify logic for translation data in page_params. This refactors the get_translation_data function to return an empty dict when 'en' language is passed, to avoid repetition of code in stats and home.	2020-09-25 16:21:37 -07:00
Dinesh	232eb8b7cf	auth: Render config error page on configuration error. We previously used to to redirect to config error page with a different URL. This commit renders config error in the same URL where configuration error is encountered. This way when conifguration error is fixed the user can refresh to continue normally or go back to login page from the link provided to choose any other backend auth. Also moved those URLs to dev_urls.py so that they can be easily accessed to work on styling etc. In tests, removed some of the asserts checking status code to be 200 as the function `assert_in_success_response` does that check.	2020-09-25 16:16:17 -07:00
Clara Dantas	8674287192	digest: Support digest of web public streams for guest users. This change requires some basic plumbing for test code creating web-public streams.	2020-09-25 16:11:04 -07:00
Vishnu KS	a888e65ea4	i18n: Don't include email tags in translation strings.	2020-09-25 15:53:26 -07:00
Tim Abbott	94a9fa1891	event_schema: Add documentation and rename a few functions. This should help make this revised subsystem readable for more new contributors. We still need to make updates to the high-level documentation.	2020-09-25 12:53:00 -07:00
Steve Howell	5b7c9c4714	test_events: Add check_realm_user_remove.	2020-09-25 11:43:20 -07:00
Steve Howell	7bb7f2943f	event_schema: Finish extraction with realm_emoji/update. We now no longer define any schemas in test_events--all of them are in event_schema, which helps our tooling cross-check schemas for openapi and node tests.	2020-09-25 11:43:20 -07:00
Steve Howell	ae4d083a5a	event_schema: Extract check_realm_domains_*.	2020-09-25 11:43:20 -07:00
Steve Howell	298bed9fa1	event_schema: Split check_update_message_flags.	2020-09-25 11:43:20 -07:00
Steve Howell	f6e0171d02	event_schema: Split check_reaction into add/remove. It happens that whether you add a reaction or remove a reaction, we send the exact same fields, just using a different op code. This sort of symmetry is actually kind of rare, as usually "add" events have more fields, and "remove" events might just send an id of something to remove. Our openapi schema treats these as two seperate events, so we are more consistent with it, and it helps our schema-checking tooling for node fixtures, too. Note that we now have to exempt the two events from our openapi checks, due to the is_mirror_dummy field in the deprecated user block. We can decide how to handle this later--one possibility is to just add it as an optional field on the event_schema side.	2020-09-25 11:43:20 -07:00
Steve Howell	b7b2546f44	event_schema: Extract check_subscription_update. Note that we use value_type for value instead of bool, since properties can be non-bool things like color, which we just don't test now. We should test them. We more than compensate for this by checking the actual value of the value in check_subscription_update.	2020-09-25 11:43:20 -07:00
Steve Howell	b920ebce81	event_schema: Extract check_has_zoom_token.	2020-09-25 11:43:20 -07:00
Steve Howell	0c4286222f	event_schema: Extract check_realm_update_dict.	2020-09-25 11:43:20 -07:00
Steve Howell	6ec6525624	event_schema: Extract check_delete_message. There is a legacy format where we send singular "message_id" instead of plural "message_ids". Then there are different fields for "private" and "stream" message types.	2020-09-25 11:43:20 -07:00
Steve Howell	88165aee6b	event_schema: Extract check_user_group_update.	2020-09-25 11:43:20 -07:00
Steve Howell	aaaac11661	event_schema: Extract check_user_group_remove.	2020-09-25 11:43:20 -07:00
Steve Howell	1b7af13f37	event_schema: Extract check_user_group_remove_members.	2020-09-25 11:43:20 -07:00
Steve Howell	19b7739065	event_schema: Extract check_user_group_add_members.	2020-09-25 11:43:20 -07:00
Steve Howell	4084f0b949	event_schema: Extract check_realm_user_add. Note that we make the schema for profile_data slightly more realistic, but it doesn't actually get exercised by our current tests (apart from making sure it's a dict), since we don't have profile data for our test realm. We also don't have the optional fields for bots, since our tests don't exercise that, nor delivery_email. So we exempt realm_user_add_event from openapi checks for now. When we try to match the openapi specs better, we will probably want to add a few tests to test_events. Obviously getting good coverage for adding users would be nice for all these scenarios: * delivery_email matters * bots * realm has profile fields	2020-09-25 11:43:19 -07:00
Steve Howell	dc2176a965	event_schema: Extract check_presence.	2020-09-25 11:43:19 -07:00
Steve Howell	6c74a44697	data_types: Generalize StringDictType. This is a prep commit for supporting "presence" events, where the key of the dictionary is some arbitrary string like "website" but the value of the dictionary is another dictionary itself with keys that are more like variable names.	2020-09-25 11:43:19 -07:00
Steve Howell	4f3d5f2d87	event_schema: Extract check_realm_filters. We have some known issues with representing tuples in openapi, so we exempt realm_filters from the relevant check.	2020-09-25 11:43:19 -07:00
Steve Howell	e40a5400e5	event_schema: Extract check_muted_topics. This also forces us to create TupleType. We exempt this from the openapi check, since we haven't figured out how to model tuples in openapi with the same precision as event_schema (and it may be impossible). Long term we just want to stop dealing in tuples, of course.	2020-09-25 11:43:19 -07:00
orientor	91ca1afe98	data_type: Add StringDict data type. StringDict is a data type for representing dictionaries where all keys and values are strings. Add this data type to data_types.py and edit other files so that this data type is put to use and tested. (slightly tweaked by @showell to remove a comment and shorten a var name now that we have a proper data type)	2020-09-25 11:43:19 -07:00
Steve Howell	78a2059b8d	event schema: Extract attachment checkers.	2020-09-25 11:43:19 -07:00
Steve Howell	4a947c971d	event_schema: Extract check_realm_export. These are all trivial transformations. Note that we don't insist timestamps are floats; the NumberType class allows ints too.	2020-09-25 11:43:19 -07:00
Steve Howell	d28c01284c	event_schema: Extract check_hotspots. This forces us to introduce a NumberType.	2020-09-25 11:43:19 -07:00
Steve Howell	cf26151cea	event_schema: Use realm_user_person_types. For realm_user events, we now structure the person type as a union of dicts, which is more consistent with how we model this in our openapi spec.	2020-09-25 11:43:19 -07:00
Steve Howell	10952394b0	test_events: Use int value of message_retention_days. We also make our schema in event_schema reflect this, which in turn makes us match the already accurate openapi spec, so we no longer need to exempt four types of events from our sanity checks.	2020-09-25 11:43:19 -07:00
Steve Howell	73e7f7edec	check-node-fixtures: Compare python/openapi schemas. We might want to rename the tool to something more general now, since we are really reconciling three things: - node fixtures - event_schema checkers for test_events - openapi specs The way we compare python and openapi schemas is as follows: - first convert openapi schemas to be build from DictType, ListType, etc. with from_opeapi - do a diff on the schemas Most of the new code is just having the FooType family of classes serialize themselves with schema().	2020-09-25 11:43:19 -07:00
Steve Howell	2b76eb767f	event_schema: Change propagate_mode to an enum.	2020-09-25 11:43:19 -07:00
Steve Howell	aca641a4d1	refactor: Extract data_types module. Defining types with an object hierarchy of type classes will allow us to build functionality that was impossible (or really janky) with the validators.py approach of composing functions. Most of the changes to event_schema.py were automated search/replaces. This patch doesn't really yet take advantage of the new FooType classes, but we will use it soon to audit our openapi specs.	2020-09-25 11:43:19 -07:00
Wes Galbraith	9645959ac4	populate_db: Add emoji reactions to development environment database. This change adds automated generated emoji reactions to the data in the development environment's database. Fixes part of #14991.	2020-09-23 16:10:37 -07:00
Steve Howell	f29b2884ca	bitbucket2: Format user info consistently. We now use get_user_info() to format all of our users in messages.	2020-09-23 15:31:38 -07:00
Steve Howell	e0b6619dac	bitbucket2: Simplify how we display user for fork events. Even before GDPR changes, it was strange that we displayed users differently for fork events vs. all other events. After GDPR, we don't even get the `username` field any more. So now we simply use `display_name` if available, and then we try `nickname`. See https://developer.atlassian.com/cloud/bitbucket/bitbucket-api-changes-gdpr/ for more context.	2020-09-23 15:31:38 -07:00
Steve Howell	1ef8d79352	bitbucket: Decouple BITBUCKET_FORK_BODY between versions. We were trying to share the same format string between the two different versions of bitbucket, but this only creates confusion, as the two versions are only close enough to be confusing. The format string might be the same, but the semantics are different, as well as the eventual outputs. For example, the {username} piece here is simple in version 2, but in version 3 we append a url to the user's name.	2020-09-23 15:31:38 -07:00
sahil839	fe370debe5	tests: Rename stream messages tests in test_message_send.py. This commit renames 'test_message_to_self' and 'test_api_message_to_self' tests to 'test_message_to_stream_by_name' and 'test_api_message_to_stream_by_name' to depict the actual purpose of these tests.	2020-09-23 15:28:31 -07:00
Aman Agrawal	5ae2325979	events: Don't send presence data for web public guests. We disable presence and hide list of users in right sidebar via setting realm_presence_disabled to false here for web public guests.	2020-09-23 12:11:22 -07:00
Aman Agrawal	5153a036f2	events: Disable topic edit & msg edit/delete for web public guest. To ensure web public guests cannot change any data, it's natural that we disable topic & msg edit / delete ops.	2020-09-23 12:11:22 -07:00
Aman Agrawal	3ec23e1a9d	fetch_initial_state_data: Handle case of web public guests. user_profile will be None for web_public_guests here. Hence, for settings (of which most be inaccessible by web public guest), which require a user_profile, we either set an empty value for them or set them to a default value. This will help render the frontend or extend support to our clients without breaking a lot of code. Tweaked by tabbott to add many comments.	2020-09-23 12:11:15 -07:00
Aman Agrawal	9cabd8f9cb	process_client: Don't update activity of unauthenticated users. This allows wrapper `add_logging_data` to be used to add client information for unauthenticated users (or web public guests).	2020-09-23 12:07:06 -07:00
Aman Agrawal	6012d3cff2	get_raw_user_data: Support acting_user=None. This already had support for web_public_guest, fixed mypy annotation.	2020-09-23 12:07:06 -07:00
Aman Agrawal	c88b4cba60	format_user_row: Support acting_user=None.	2020-09-23 12:07:06 -07:00
Aman Agrawal	48492a0633	fetch_initial_state_data: Pass realm as independent parameter. This removes dependency of the function on user_profile to get the realm, which will be useful when user_profile is None in case of web public guests.	2020-09-23 12:06:54 -07:00
Alex Vandiver	fd20e54c79	webhooks: Never log JsonableError to webook loggers. These represent known errors in what the user submitted. This is slightly complicated by UnsupportedWebhookEventType being an instance of JsonableError.	2020-09-22 15:11:48 -07:00
Alex Vandiver	7001004ec0	webhooks: Do not predicate on the "payload" key. If we are to log to the webhook logger, do so no matter which arguments are passed.	2020-09-22 15:11:48 -07:00
Alex Vandiver	1a763696f7	webhooks: Only enable webhook logging if it is a webhook. allow_webhook_access may be true if the request allows webhook requests, regardless of if it only used for a webhook integration. Only actually log to the verbose webhook logger if it is explicitly a webhook endpoint, as judged by `webhook_client_name`. This prevents requests for `POST /api/v1/messages` from being logged to the webhook logger if they mistakenly contain a `payload` argument.	2020-09-22 15:11:48 -07:00
Alex Vandiver	77d1a4a5c0	webhooks: Simplify logic around is_webhook_access. We clearly allow webhook access if we are setting the webhook_client_name. This removes the need for the `or`s later.	2020-09-22 15:11:48 -07:00
Alex Vandiver	d24869e484	webhooks: Rename is_webhook to allow_webhook_access. This argument does not define if an endpoint "is a webhook"; it is set for "/api/v1/messages", which is not really a webhook, but allows access from webhooks.	2020-09-22 15:11:48 -07:00
Alex Vandiver	b515c2bcbf	webhooks: Add a missing format for the realm.	2020-09-22 15:11:48 -07:00
Aman Agrawal	1b5b82e712	RealmFilterPattern: Mark converted content as AtomicString. If multiple filters match the same string, we run into an infinite loop of converting string into urls. To fix it, we mark the matched string as atomic after first conversion.	2020-09-22 15:10:38 -07:00
Anders Kaseorg	0228acf0f5	rest: Add rest_path shortcut for path with rest_dispatch. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-22 10:51:00 -07:00
Anders Kaseorg	e70f2ae58d	rest: Specify rest_dispatch handlers by function, not by string. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-22 10:46:28 -07:00
Anders Kaseorg	faf600e9f5	urls: Remove unused URL names and shorten others. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-22 10:46:28 -07:00
Alex Vandiver	db8daf4175	linkifiers: Allow tildes in target URLs.	2020-09-21 21:04:02 -07:00
Alex Vandiver	03c6a0f182	markdown: Skip other common file extensions in linking, sort.	2020-09-21 21:03:29 -07:00
Alex Vandiver	4361ce1246	markdown: Use tlds package to keep updated list of TLDs. Also remove a useage of "blacklist."	2020-09-21 21:03:29 -07:00
Aman Agrawal	37e77c77e2	message_fetch: Better handle unauthorized requests. We raise MissingAuthenticationError now, which adds `www_authenticate=session` header to the error response. This stops modern web-browsers from displaying a login form everytime a 401 response it sent to the client.	2020-09-21 16:07:43 -07:00
Tim Abbott	7fa8bafe81	lint: Fix type of initial 0 in queue monitoring.	2020-09-21 15:47:30 -07:00
Alex Vandiver	e5f62d083e	tornado: Merge the TORNADO_SERVER and TORNADO_PORTS configs. Having both of these is confusing; TORNADO_SERVER is used only when there is one TORNADO_PORT. Its primary use is actually to be _unset_, and signal that in-process handling is to be done. Rename to USING_TORNADO, to parallel the existing USING_RABBITMQ, and switch the places that used it for its contents to using TORNADO_PORTS.	2020-09-21 15:36:16 -07:00
Mateusz Mandera	810514dd9d	queue: Update stats file every 30 seconds. This system can't update stats while the queue is idle, without using threads for this, but at least we ensure to update the file after consuming an event if more than MAX_SECONDS_BEFORE_UPDATE_STATS passed since the last update, regardless of the number of iterations done so far.	2020-09-21 15:24:02 -07:00
Mateusz Mandera	40c4511a9c	queue: Fix misspelled consume_iteration_counter variable.	2020-09-21 15:22:58 -07:00
Mateusz Mandera	2365a53496	queue: Fix a race condition in monitoring after queue stops being idle. The race condition is described in the comment block removed by this commit. This leaves room for another, remaining race condition that should be virtually impossible, but nevertheless it seems worthwhile to have it documented in the code, so we put a new comment describing it. As a final note, this is not a new race condition, it was hypothetically possible with the old code as well.	2020-09-21 15:22:56 -07:00
Amitsinghyadav	a72e9476ee	api_docs: Add fragment references for all parameters. This makes it convenient to link to a specific parameter accepted by Zulip API endpoint.	2020-09-21 12:18:10 -07:00
Sumanth V Rao	c563cdba61	markdown: Add data-code-lang attribute for locally echoed messages. This mimics the backend logic for adding the data-attribute - to know what Pygments language was used to highlight the code block - in locally echoed messages. New test added checks our logic for canonicalizing pygments alias (for both frontend and backend). Other fixtures and tests amended.	2020-09-18 17:12:26 -07:00
Sumanth V Rao	564b199fe6	markdown: Escape lang when echoing back custom non-pygments languages. In `ae58ed5a7` we decided to echo back the text, when no Pygments lexer matching that language was found. When we do so, we must take care to HTML escape the lang before wrapping it in a data-code-language attribute. Tweaked by tabbott to make clear the escaping is defensive.	2020-09-18 17:12:11 -07:00
Alex Vandiver	1928696829	tests: Remove three references to Casper in the comments.	2020-09-18 15:13:40 -07:00
Alex Vandiver	f638518722	tornado: Move default production port to 9800. In development and test, we keep the Tornado port at 9993 and 9983, respectively; this allows tests to run while a dev instance is running. In production, moving to port 9800 consistently removes an odd edge case, when just one worker is on an entirely different port than if two workers are used.	2020-09-18 15:13:40 -07:00
Alex Vandiver	5c806fbd52	tornado: Remove a misleading comment and reformat. tornado.web.Application does not share any inheritance with Django at all; it has a similar router interface, but tornado.web.Application is not an instance of Django anything. Refold the long lines that follow it.	2020-09-18 15:13:40 -07:00
Alex Vandiver	4354386e69	tornado: Remove an unused port argument. This was added in `ec065e92ee` for the WebSocket codepath, which was subsequently removed in `ea6934c26d`.	2020-09-18 15:13:40 -07:00
Alex Vandiver	de1db2c838	sentry: Provide more metadata in queue processors. This allows aggregation by queue, makes the event data more readily accessible, and clears out the breadcrumbs upon every batch that is serviced.	2020-09-18 15:13:08 -07:00
Tim Abbott	9c0d6becc5	docs: Add comment links to i18n documentation.	2020-09-18 11:44:04 -07:00
Alex Vandiver	5a9b6feb18	tornado: Retry POST requests from Django to Tornado. While urllib3 retries all connection errors, it only retries a subset of read errors, since not all requests are safe to retry if they are not idempotent, and the far side may have already processed them once. By default, the only methods that are urllib3 retries read errors on are GET, TRACE, DELETE, OPTIONS, HEAD, and PUT. However, all of the requests into Tornado from Django are POST requests, which limits the effectiveness of `bb754e0902`. POST requests to `/api/v1/events/internal` are safe to retry; at worst, they will result in another event queue, which is low cost and will be GC'd in short order. POST requests to `/notify_tornado` are _not_ safe to retry, but this codepath is only used if USING_RABBITMQ is False, which only occurs during testing. Enable retries for read errors during all POSTs to Tornado, to better handle Tornado restarts without 500's.	2020-09-17 16:50:36 -07:00
Tim Abbott	ae58ed5a74	markdown: Tweak data-code-language testing and comments. This should make it clearer the precise decisions we've made about the intended semantics of this feature.	2020-09-15 12:30:57 -07:00
Sumanth V Rao	b0c9e0a295	markdown: Rename fenced code data-attribute to data-code-language.	2020-09-15 20:09:58 +05:30
Harsh Srivastava	ba36624442	api docs: Removing order dependency of deactivate user test. We create a User using `do_create_user`, before running the deactivation test. This lets us removing the ordering logic introduced in `7c17bdb9c5`.	2020-09-14 22:31:40 -07:00
Alex Vandiver	ff94254598	tornado: Log to files by port number. Without an explicit port number, the `stdout_logfile` values for each port are identical. Supervisor apparently decides that it will de-conflict this by appending an arbitrary number to the end: ``` /var/log/zulip/tornado.log /var/log/zulip/tornado.log.1 /var/log/zulip/tornado.log.10 /var/log/zulip/tornado.log.2 /var/log/zulip/tornado.log.3 /var/log/zulip/tornado.log.7 /var/log/zulip/tornado.log.8 /var/log/zulip/tornado.log.9 ``` This is quite confusing, since most other files in `/var/log/zulip/` use `.1` to mean logrotate was used. Also note that these are not all sequential -- 4, 5, and 6 are mysteriously missing, though they were used in previous restarts. This can make it extremely hard to debug logs from a particular Tornado shard. Give the logfiles a consistent name, and set them up to logrotate.	2020-09-14 22:17:51 -07:00
Alex Vandiver	812af977d3	compilemessages: Switch to canonical override_language().	2020-09-14 22:16:09 -07:00
Alex Vandiver	536bd3188e	middleware: Move locale-setting before domain checking. Calling `render()` in a middleware before LocaleMiddleware has run will pick up the most-recently-set locale. This may be from the _previous_ request, since the current language is thread-local. This results in the "Organization does not exist" page occasionally being in not-English, depending on the preferences of the request which that thread just finished serving. Move HostDomainMiddleware below LocaleMiddleware; none of the earlier middlewares call `render()`, so are safe. This will also allow the "Organization does not exist" page to be localized based on the user's browser preferences. Unfortunately, it also means that the default LocaleMiddleware catches the 404 from the HostDomainMiddlware and helpfully tries to check if the failure is because the URL lacks a language component (e.g. `/en/`) by turning it into a 304 to that new URL. We must subclass the default LocaleMiddleware to remove this unwanted functionality. Doing so exposes a two places in tests that relied (directly or indirectly) upon the redirection: '/confirmation_key' was redirected to '/en/confirmation_key', since the non-i18n version did not exist; and requests to `/stats/realm/not_existing_realm/` incorrectly were expecting a 302, not a 404. This regression likely came in during `f00ff1ef62`, since prior to that, the HostDomainMiddleware ran _after_ the rest of the request had completed.	2020-09-14 22:16:09 -07:00
sahil839	9c3341ad95	openapi: Rearrange users/{user_id}/subscriptions/{stream_id} docs. This commit moves docs for users/{user_id}/subscriptions/{stream_id} enndpoint to be after users/me/subscriptions/muted_topics docs. We are rearranging the docs because after adding the new patch endpoint for users/{user_id}/subscriptions/{stream_id}, openapi_core validator tries to match 'users/me/subscriptions/muted_topics' with 'users/{user_id}/subscriptions/{stream_id}' path in zulip.yaml and thus gives error while running tests. This is a bug in 'openapi_core' as it does not follows OpenAPI specs to match concrete paths before their templated counterparts. Thus, this commit rearranges the docs such that openapi_core validator tries to match muted_topics endpoint with the correct path in zulip.yaml docs.	2020-09-14 22:04:45 -07:00
Sumanth V Rao	033351609d	markdown: Add data-codehilite-language attr for fenced code. When converting fenced code markdown, we add the language (if specified) in a data-attribute by tweaking the HTML generated. Doing so, allows the frontend to make use of this attr to display view-in-playground option for codeblocks. We use pygments to get the lexer subclass name and use that instead of directly using the language in the data-attribute. Doing so, helps us map different language aliases (like `js` and `javascript`) into a common variable (like `JavaScript`) - and avoids the client from dealing with multiple tags corresponding to the same language. The html structure for a message like this: ``` js ..content.. ``` would now be: <div class="codehilite" data-codehilite-language="JavaScript"> <pre>..content..</pre> </div> Tests and fixtures amended.	2020-09-14 21:25:19 -07:00
Anders Kaseorg	a659542d84	parallel: Remove run_parallel. This was a broken abstraction that returned to its caller within multiple forked processes on exceptions, and encouraged ignoring the error code (as all of its callers did). Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	0f16df2f13	transfer: Migrate from run_parallel to multiprocessing. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	7f410ff0de	import_realm: Migrate from run_parallel to multiprocessing. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	9a2aad58d0	import_util: Migrate from run_parallel to multiprocessing. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	73eb0aa891	test_runner: Monkey-patch to allow using multiprocessing in tests. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Tim Abbott	f751acbec5	openapi: Add comments on top of various OpenAPI doc files. This is part of our standard approach for trying to make it easy for folks to find relevant documentation on a system they're trying to understand.	2020-09-14 15:25:46 -07:00
Aman Agrawal	2bc3924672	move_topic_to_stream: Allow moving to/between/from private streams. Fixes #16284. Most of the work for this was done when we implemented correct behavior for guest users, since they treat public streams like private streams anyway. The general method involves moving the messages to the new stream with special care of UserMessage. We delete UserMessages for subs who are losing access to the message. For private streams with protected history, we also create UserMessage elements for users who are not present in the old stream, since that's important for those users to access the moved messages.	2020-09-14 15:00:55 -07:00
Anders Kaseorg	ddf8ec33df	upload: Strip leading slash from deleted S3 export paths. Previously, S3UploadBackend.delete_export_tarball failed to strip the leading ‘/’ from the export path. This mistake is now caught by Moto 1.3.15. I expect it caused deletion failures in the real S3, although I haven’t verified this. We store export_path in the audit log with a leading ‘/’, but the actual S3 keys do not have a leading ‘/’. Changing either system would require a migration. So the new convention is that the variables named ‘export_path’ have a leading ‘/’, while variables named ‘path_id’ or ‘key’ do not. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 20:59:09 -07:00
Anders Kaseorg	42d038f09b	migrations: Fix 0301 to replace a Python loop with SQL. The previous code is correctly flagged by semgrep 0.23 as a violation of our sql-format rule. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 20:59:09 -07:00
Hemanth V. Alluri	f57196b1d0	webhooks/gitlab: Add fixture for MR opened with multiple assignees. Fixes #16173. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-09-13 20:54:14 -07:00
Hemanth V. Alluri	54aba8d402	webhooks/gitlab: Use information about all assignees. Previously, the GitLab webhook code, namely the `get_objects_assignee` method first tried to get a single assignee and if that failed then it looks for multiple assignees and then it would return the first assignee that it found (there's actually a code smell here - a loop which would always return on the first iteration). Instead, this commit will change that behavior to first check for multiple assignees first then for a single assignee if we can't find multiple assignees. Ultimately it will return a list of all of the assignees (however many that might be [0, n]). This method has then aptly been renamed to `get_assignees`. Finally, we tweked the code using this method to always use it's output as an "assignees" parameter to templates (there's also an assignee parameter which we want to avoid here for consistency). Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-09-13 20:54:14 -07:00
Hemanth V. Alluri	bbe7a54171	webhooks/gitlab: Make permissions consistent across all fixtures. For some reasons, some of the fixtures had the +x bit set, while some didn't. What this commit does is make sure that no fixture is marked as "executable" (for anyone). Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-09-13 20:54:14 -07:00
Anders Kaseorg	dfab09b17d	markdown: Replace hyperlink requirement with urllib.parse. The previous code only worked by accident and hyperlink 20.0.0 breaks it. >>> hyperlink.parse("example.com").replace(scheme="https") DecodedURL(url=URL.from_text('https:example.com')) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 15:37:28 -07:00
Anders Kaseorg	f39d4cf1f0	sentry: Pass format arguments to logging. semgrep 0.23 correctly flags this as a violation of our logging-format rule. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 15:37:28 -07:00
Mateusz Mandera	70bcc7a8e9	migrations: Add case-insensitive unique index on realm and stream name. This is closely analogical to `b9b146c809`.	2020-09-13 15:36:43 -07:00
Anders Kaseorg	463929f349	urls: Migrate re_path routes to path. Django treats path("<name>") like re_path(r"(?P<name>[^/]+)") and path("<path:name>") like re_path(r"(?P<name>.+)"). This is more readable and consistent than the mix of slightly different regexes we had before, and fixes various bugs: • The r'apps/(.*)$' regex was missing a start anchor ^, so it incorrectly matched all URLs that included apps/ as a substring anywhere. • The r'accounts/login/(google)/$' regex was missing a start anchor ^, so it incorrectly matched all URLs that ended with accounts/login/google/. • The type annotation of zerver.views.realm_export.delete_realm_export takes export_id as an int, but it was previously passed as a string. • The type annotation of zerver.views.users.avatar takes medium as a bool, but it was previously passed as a string. • The [0-9A-Za-z]+ pattern for uidb64 was missing the - and _ characters that can validly be part of a base64url encoded string (although I think the id is actually a decimal integer here, in which case only 012345ADEIMNOQTUYcgjkwxyz are present in its base64url encoding). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 14:29:26 -07:00
Anders Kaseorg	7c17bdb9c5	openapi: Remove ‘example’ $ref siblings. $ref siblings are ignored according to the OpenAPI specification, and the referenced definitions already have examples. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 11:57:13 -07:00
Anders Kaseorg	caa08d76b5	openapi: Inline parameter references to avoid ‘required’ $ref siblings. $ref siblings are ignored according to the OpenAPI specification. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 11:57:13 -07:00
Anders Kaseorg	b41165fd01	dev_urls: Give the user_avatars URLPattern a name. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 11:21:40 -07:00
palash	2632317b06	test_message_send: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_message_send.py	2020-09-12 11:04:51 -07:00
palash	f2f8034b76	test_markdown: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_markdown.py	2020-09-12 11:04:51 -07:00
palash	588fff236d	test_service_bot_system: Refactor mock.patch with assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_service_bot_system.py	2020-09-12 10:59:36 -07:00
palash	4e2769c31c	test_decorators: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_decorators.py	2020-09-12 10:59:36 -07:00
palash	c3fee5059f	test_email_mirror: Refactor mock.path to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_email_mirror.py	2020-09-12 10:59:36 -07:00
palash	34003fc7f6	email_mirror: Change root logger to zerver.lib.email_mirror.	2020-09-12 10:53:56 -07:00
palash	60f9f87c5d	test_external: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_external.py	2020-09-12 10:53:56 -07:00
palash	485d740fe8	apple auth: Change logger to zulip.auth.apple from root logger. Replace default root logger with zulip.auth.apple for apple auth in file zproject/backends.py and update the test cases accordingly in file zerver/tests/test_auth_backends.py	2020-09-12 10:53:56 -07:00
palash	04f08f45e7	test_auth_backends: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file test_auth_backends.py. This change requires adjusting test_log_into_subdomain_when_email_is_none to use an explicit token since that appears in the log output.	2020-09-12 10:53:37 -07:00
Aman Agrawal	ed8796f23f	report: Allow anonymous queries to report (un)narrow times.	2020-09-11 16:57:27 -07:00
Alex Vandiver	3f6e4ff303	webhooks: Move the extra logging information into a formatter. This clears it out of the data sent to Sentry, where it is duplicative with the indexed metadata -- and potentially exposes PHI if Sentry's "make this issue public" feature is used.	2020-09-11 16:43:29 -07:00
Alex Vandiver	b8a2e6b5f8	webhooks: Configure webhook loggers in zproject/computed_settings.py. This limits the webhook errors to only go to their respective log files, and not to the general server logs.	2020-09-11 16:43:29 -07:00
Alex Vandiver	6323218a0e	request: Maintain a thread-local of the current request. This allows logging (to Sentry, or disk) to be annotated with richer data about the request.	2020-09-11 16:43:29 -07:00
Mateusz Mandera	bb4567f57e	queue: Extract get_remaining_queue_size method.	2020-09-11 15:51:07 -07:00
Mateusz Mandera	1d466a4fc5	queue: Make embed_link updates stats on every iteration.	2020-09-11 15:51:07 -07:00
Tim Abbott	f0c2c640b6	api docs: Add changes for community_topic_editing_limit_seconds. The previous commit fixed this in the changelog, but it should have also been documented directly as well.	2020-09-11 15:50:06 -07:00
Alex Vandiver	4917391133	webhooks: Derive payload from request itself.	2020-09-10 17:47:22 -07:00
Alex Vandiver	a1f5f6502c	webhooks: In logger, pull user from request, rather than parameter. request.user is set by validate_api_key, which is called by webhook_view and authenticated_rest_api_view.	2020-09-10 17:47:22 -07:00
Alex Vandiver	d04db7c5fe	webhooks: Remove repetitive argument to UnsupportedWebhookEventType. The name of the webhook can be added by the webhook decorator.	2020-09-10 17:47:21 -07:00
Alex Vandiver	e2ab7b9e17	webhooks: Update API_KEY_ONLY_WEBHOOK_LOG_PATH to WEBHOOK_LOG_PATH. The existence of "API_KEY" in this configuration variable is confusing. It is fundamentally about webhooks.	2020-09-10 17:47:21 -07:00
Alex Vandiver	cf6ebb9c8d	webhooks: Rename api_key_only_webhook_view to webhook_view. There are no other types of webhook views; this is more concise.	2020-09-10 17:47:21 -07:00
Alex Vandiver	8cfacbf8aa	webhooks: Update comment about typing the webhook decorator. The previous link was to "extended callable" types, which are deprecated in favor of callback protocols. Unfortunately, defining a protocol class can't express the typing -- we need some sort of variadic generics[1]. Specifically, we wish to support hitting the endpoint with additional parameters; thus, this protocol is insufficient: ``` class WebhookHandler(Protocol): def __call__(request: HttpRequest, api_key: str) -> HttpResponse: ... ``` ...since it prohibits additional parameters. And allowing extra arguments: ``` class WebhookHandler(Protocol): def __call__(request: HttpRequest, api_key: str, args: object, *kwargs: object) -> HttpResponse: ... ``` ...is similarly problematic, since the view handlers do not support _arbitrary_ keyword arguments. [1] https://github.com/python/typing/issues/193	2020-09-10 17:47:21 -07:00
Alex Vandiver	ea8823742b	webhooks: Adjust the name of the unsupported logger. `zulip.zerver.lib.webhooks.common` was very opaque previously, especially since none of the logging was actually done from that module. Adjust to a more explicit logger name.	2020-09-10 17:47:21 -07:00
Alex Vandiver	9ea9752e0e	webhooks: Rename UnexpectedWebhookEventType to UnsupportedWebhookEventType. Any exception is an "unexpected event", which means talking about having an "unexpected event logger" or "unexpected event exception" is confusing. As the error message in `exceptions.py` already explains, this is about an _unsupported_ event type. This also switches the path that these exceptions are written to, accordingly.	2020-09-10 17:47:21 -07:00
Alex Vandiver	8016769613	webhooks: Move UnexpectedWebhookEventType into zerver.lib.exceptions. `8e10ab282a` moved UnexpectedWebhookEventType into `zerver.lib.exceptions`, but left the import into `zserver.lib.webhooks.common` so that webhooks could continue to import the exception from there. This clutters things and adds complexity; there is no compelling reason that the exception's source of truth should not move alongside all other exceptions.	2020-09-10 17:47:21 -07:00
Mateusz Mandera	f95dd628bd	email: Fix race conditions with concurrent ScheduledEmail handling. The main race conditions, which actually happened in production was with concurrent execution of deliver_email and clear_scheduled_emails. clear_scheduled_emails could delete all email.users in the middle of deliver_email execution, causing it to pass empty to_user_ids list to send_email. We mitigate this by getting the list of user ids in a single query and moving forward with that snapshot, not having to worry about database data being mutated anymore. clear_scheduled_emails had potential race conditions with concurrent execution of itself due to not locking the appropriate rows upon selecting them for the purpose of potentially deleting them. FOR UPDATE locks need to be acquired to prevent simultaneous mutation. Tested manually with some print+sleep debugging to make some races happen. fixes #zulip-2k (sentry)	2020-09-09 15:59:01 -07:00
Anders Kaseorg	b7b7475672	python: Use standard secrets module to generate random tokens. There are three functional side effects: • Correct an insignificant but mathematically offensive bias toward repeated characters in generate_api_key introduced in commit 47b4283c4b4c70ecde4d3c8de871c90ee2506d87; its entropy is increased from 190.52864 bits to 190.53428 bits. • Use the base32 alphabet in confirmation.models.generate_key; its entropy is reduced from 124.07820 bits to the documented 120 bits, but now it uses 1 syscall instead of 24. • Use the base32 alphabet in get_bigbluebutton_url; its entropy is reduced from 51.69925 bits to 50 bits, but now it uses 1 syscall instead of 10. (The base32 alphabet is A-Z 2-7. We could probably replace all of these with plain secrets.token_urlsafe, since I expect most callers can handle the full urlsafe_b64 alphabet A-Z a-z 0-9 - _ without problems.) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-09 15:52:57 -07:00
Dinesh	c64888048f	puppeteer: Rename CASPER_TESTS env variable to PUPPETEER_TESTS. Also modified few comments to match with the changes.	2020-09-09 13:38:39 -04:00
Aman	c114447e11	topic_history: Allow anonymous access in web-public streams. For web-public streams, clients can access full topic history without being authenticated. They only need to additionally send "streams:web-public" narrow with their request like all the other web-public queries.	2020-09-04 10:17:56 -07:00
Anders Kaseorg	3b301f522b	python: Tweak some magic trailing commas to avoid Black bugs. https://github.com/psf/black/issues/1658 https://github.com/psf/black/issues/1671 Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	f91d287447	python: Pre-fix a few spots for better Black formatting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	bb4fc3c4c7	python: Prefer --flag=option over --flag option. For less inflation by Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	bef46dab3c	python: Prefer kwargs form of dict.update. For less inflation by Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	75c59a820d	python: Convert subprocess.Popen.communicate to run or check_output. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:42:35 -07:00
Vishnu KS	6bbcb622e1	onboarding: Move send_welcome_bot_response to onboarding.	2020-09-03 17:41:08 -07:00
Alex Vandiver	70e8cc5783	tests: Add more tests around exception handling. This verifies that we actually do enqueue a record when there is an error on non-staging. With the previous commit, it verifies that that data serializes correctly.	2020-09-03 17:34:31 -07:00

... 8 9 10 11 12 ...

13342 Commits