zulip

Commit Graph

Author	SHA1	Message	Date
Rex Ferrer	d4c0578560	refactor: Integrate POSTRequestMock into HostRequestMock. Minimized code duplication by integrating POSTRequestMock into HostRequestMock and then updating the required files with HostRequestMock. Fixes part of #1211.	2021-03-03 21:52:05 -08:00
sahil839	4ca21a6982	users: Give moderators same permissions as that of full members. This commit updates the stream creation, subscribing others to stream, wildcard mention settings and stream post policy to allow realm moderators even if they are new and the respective setting is set to allow full members only.	2021-03-02 17:19:31 -08:00
sahil839	b4fd15d516	models: Rename is_new_member to is_provisional_member. This commit renames the is_new_member property in models.py to is_provisional_member which will return true for any user who is not a full member. We will add a condition in further commit such that this returns 'False' for a moderator as we will initially give all the rights to moderator that a full member has.	2021-03-02 17:19:31 -08:00
Mateusz Mandera	6f9f608225	test_home: Fix wrong bot references in test_people. These are all referring to email_gateway_bot, when they're supposed to refer to the notification and welcome bots, respectively. The values are the same though, so the tests were passing anyway.	2021-02-28 17:02:37 -08:00
Sumanth V Rao	829f9272d2	hotspots: Extract INTRO_HOTSPOTS from ALL_HOTSPOTS. Its likely that we would implement new hotspots that aren't a part of the tutorial hotspots, in the future. For instance, a hotspot to advertise new features. Hence, grouping them into categories like INTRO_HOTSPOTS would be a good start. We also have an aggregate of all types of hotspots we may add in the future, under ALL_HOTSPOTS.	2021-02-26 15:02:48 -08:00
Mateusz Mandera	4b903c5dcd	invites: Fix bug revoking user invites in other realms than intended. Fixes #17238. In process_new_human user, the queries were wrong, revoking all invites sent to the email address, even in other realms than the one where the new account just got created.	2021-02-26 08:26:43 -08:00
Mateusz Mandera	b9c1fed18c	invites: Delete old compat code in the invites queue worker. 1.7.* is old enough at this point that we can clean up this code.	2021-02-26 08:26:43 -08:00
shanukun	4b67946605	refactor: Make acting_user a mandatory kwarg for do_create_user.	2021-02-25 17:58:00 -08:00
Alex Vandiver	e53be6d043	email: Set an envelope-from which may be different from the From: field. The envelope-from is used by the MTA if the destination address is not deliverable. Route all such mail to the noreply address.	2021-02-24 17:32:28 -08:00
Mateusz Mandera	1d4badf6ad	tests: Test internal_send_private_message can send to cross-realm bots.	2021-02-23 15:26:47 -08:00
Mateusz Mandera	51d7f24d20	actions: Remove realm argument to internal_send_stream_message. The argument is redundant.	2021-02-23 15:26:47 -08:00
Mateusz Mandera	09fc79f911	actions: Remove realm argument to internal_send_private_message. The argument is redundant.	2021-02-23 15:26:47 -08:00
Mateusz Mandera	a652573169	tests: Fix tests causing internal_send_private_message with wrong realm. test_signup: This test was wrong, because the inviter UserProfile was from a different realm. Such a PreregistrationUser shouldn't be considered valid. test_tutorial: The direct call to internal_send_private_message was using sender's realm as the realm argument which is not valid. It doesn't lead to any error because the codepath seems to mostly not care about the realm arg if the sender is a cross-realm bot. From my reading of the code I think that wrong realm arg here would break user mentions, because it makes its way to check_message() and then to build_message_send_dict - but overall the message gets sent without errors. Either way, this was a bug in the test and should be fixed.	2021-02-23 15:26:47 -08:00
sahil839	d71afc5a26	actions: Include ROLE_MODERATOR in realm_user_count_by_role. This commmit includes ROLE_MODERATOR in realm_user_count_by_role. We also update test_change_role in test_audit_log.py to include changes for moderator role as well.	2021-02-23 15:01:14 -08:00
sahil839	6b5cf231a1	users: Add new user 'shiva' as realm moderator. Note that at this point, it's not possible to create moderator users; this just will make it easier to write tests for logic involving them as we develop the feature.	2021-02-23 15:00:49 -08:00
sahil839	15e74a637c	tests: Check cases when full members and their bots can send messages. Currently there are only tests for verifying the error case and there are no tests to check the case where messages are sent successfully in 'STREAM_POST_POLICY_RESTRICT_NEW_MEMBERS' stream. This commit adds tests for checking that full members and bots owned by them can send message successfully in streams with post policy as 'STREAM_POST_POLICY_RESTRICT_NEW_MEMBERS'.	2021-02-18 18:38:52 -08:00
sahil839	3df87d0901	stream: Fix error handling in access_stream_for_send_message. According to tests we should not allow bot without owners to post in streams with STREAM_POST_POLICY_RESTRICT_NEW_MEMBERS. But the code does not handle this and the related test passes and raises error for case of bots without owner because the bot is itself a new member. This commit fixes this by adding a condition to check if there is no bot owner and then raise error if there is no owner.	2021-02-18 18:38:52 -08:00
Tushar912	dfafdda9b3	api: Add REST API endpoint for looking up a user by email address. Add new rest api endpoint GET users/{email} for looking up a user by email, which is useful especially for corporate API applications that might already have a user's email address. Fixes #14302.	2021-02-15 17:38:33 -08:00
Anders Kaseorg	d001676728	streams: Fix compose_views type safety. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-15 17:05:28 -08:00
Anders Kaseorg	dd2a3b45cd	test_service_bot_system: Strengthen for_all_bot_types decorator type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-15 17:05:28 -08:00
Anders Kaseorg	04a5e0c339	test_report: Avoid Any type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-15 17:05:28 -08:00
Shanu	7f196967ad	event_queue: Remove internal fields being leaked to the API. A few internal fields used for tracking which types of notifications have already been sent for a given message, like `hander_id` and the `push_notified` bundle of fields were being incorrectly included in message events delivered to clients clients. One could argue these fields might be useful hints to clients, but because notifications can be triggered later on via `missedmessage_hook`, they have no useful purpose in the API. This commit move these extended event field on a `internal_data` object within the event object, and delete this field in `contents()` for call points that would serve data to clients. Tweaked by tabbott to provide a cleaner interface. We're not bumping API_FEATURE_LEVEL because these fields have always been documented as being present only due to a bug, so no clients should be expecting or relying on them. Fixes: #15947.	2021-02-14 21:42:19 -08:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	5028c081cb	python: Merge concatenated string literals that Black would uglify. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Mateusz Mandera	b8c8ea5262	tests: Fix bugs confusing recipient.type_id for other ids. These tests were accidentally passing due to numbers coinciding.	2021-02-09 17:45:34 -08:00
Vishnu KS	5c026d67e3	digest: Sort topics in descending order in get_hot_topics. We want topics with high diversity and large lengths. So they should be sorted with reverse=True. This bug seems to be introduced in `936171d258`	2021-02-09 10:35:47 -08:00
Alex Vandiver	d0f0c2f2ed	digest: Fix the structure that we enqueue across when digesting. This rename was missed in `bfa0bdf3d6`. Without this fix, digest messages fail to send.	2021-02-08 17:28:59 -08:00
m-e-l-u-h-a-n	0e6343c071	users: Clarify readability issues related to access_user_by_id. zerver/lib/users.py has a function named access_user_by_id, which is used in /users views to fetch a user by it's id. Along with fetching the user this function also does important validations regarding checking of required permissions for fetching the target user. In an attempt to solve the above problem this commit introduces following changes: 1. Make all the parameters except user_profile, target_user_id to be keyword only. 2. Use for_admin parameter instead of read_only. 3. Adds a documentary note to the function describing the reason for changes along with recommended way to call this function in future. 4. Changes in views and tests to call this function in this changed format. Changes were tested using ./tools/test-backend. Fixes #17111.	2021-02-05 17:31:45 -08:00
m-e-l-u-h-a-n	ccf520ff13	logging: Migrate many backend tests to use assertLogs. This commit migrates some of the backend tests to use assertLogs(), instead of mock.patch() as planned in #15331. Tweaked by tabbott to avoid tautological assertions.	2021-02-03 17:55:49 -08:00
m-e-l-u-h-a-n	7417ac9165	logging: Remove unncessary logging patches in backend tests. There were some tests that had mock patches for logging, although no logging was actually happening there. This commit removes such patches in `corporate/tests/test_stripe.py`, `zerver/tests/test_cache.py`, `zerver/tests/test_queue_worker.py`, and `zerver/tests/test_signup.py`.	2021-02-03 17:47:38 -08:00
Vishnu KS	edac24acf1	email_log: Inherit EmailLogBackEnd from smtp.EmailBackend. EmailLogBackend used to create a new EmailMessage and copy only certain values from the original EmailMultiAlternatives object. This resulted in the loss of information and made it harder to test PRs like https://github.com/zulip/zulip/pull/17121. So instead of creating a new EmailMessage, tweak and send the existing EmailMultiAlternatives object.	2021-01-29 14:51:38 -08:00
Aman Agrawal	b26727ed16	invite-new-users: Specify that the limit spans for the whole day.	2021-01-29 09:51:11 -08:00
Ganesh Pawar	a42f7a67e1	populate_db: Add images in test data. This isn't quite the right model, because we're not actually going through the upload code path, but it does at least provide some inline image previews in the data. Fixes part of #14991.	2021-01-27 17:52:28 -08:00
Anders Kaseorg	4ca66e7278	timezone: Correct common_timezones dictionary. The changes are as follows: • Fix one day offset in all western zones. • Correct CST from -64800 to -21600 and CDT from -68400 to -18000. • Disambiguate PST in favor of -28000 over +28000. • Add GMT, UTC, WET, previously excluded for being at offset 0. • Add ACDT, AEDT, AKST, MET, MSK, NST, NZDT, PKT, which the previous code did not find. • Remove numbered abbreviations -12, …, +14, which are unnecessary. • Remove MSD and PKST, which are no longer used. Hardcode the dict and verify it with a test, so that future discrepancies won’t go silently unnoticed. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-01-27 15:23:15 -08:00
Mateusz Mandera	bf9e5e52ce	dependencies: Upgrade to Django 3.0. Adjustments made due to changes in Django 3.0: (https://docs.djangoproject.com/en/3.0/releases/3.0/) - test_signup: INTERNAL_RESET_URL_TOKEN was moved to PasswordResetConfirmView.reset_url_token - test_message_fetch: "add_never_cache_headers() and never_cache() now add the private directive to Cache-Control headers." - "django.utils.html.escape() now uses html.escape() to escape HTML. This converts ' to ' instead of the previous equivalent decimal code '." - this requires adjusting the expected decimal code in some of the string fixtures in tests.	2021-01-26 10:20:00 -08:00
Aman Agrawal	961d1d0a68	community_topic_edit: Increase time limit to 3 days. 24hrs is a small time in an asynchronous conversation. Increased time limit of topic editing for non-admins to 3 days.	2021-01-25 14:55:33 -08:00
Steve Howell	1498b2ef69	apply_event: Fix broken deepcopy attempt for subs. When we were getting an apply_event call for a subscription/add event, we were trying not to mutate the event itself, but this clumsy code was still mutating the actual event: # Avoid letting 'subscribers' entries end up in the list for i, sub in enumerate(event['subscriptions']): event['subscriptions'][i] = \ copy.deepcopy(event['subscriptions'][i]) del event['subscriptions'][i]['subscribers'] This is only a theoretical bug. The only person who receives a subscription/add event is the current user. And it wouldn't have affected the current user, since the apply_event was correctly updating the state, and we wouldn't actually deliver the event to the client (because the whole point of apply_event is to prevent us from having to piggyback the super-recent events on to our payload or put them into the event queue and possibly race). The new code just cleanly makes a copy of each sub, if necessary, as we add them to state["subscriptions"]. And I updated the event schemas to reflect that subscribers is always present in subscription/add event. Long term we should probably avoid sending subscribers on this event when the clients don't set something like include_subscribers. That's a fairly complicated fix that involves passing in flags to ClientDescriptor. Alternatively, we could just say that our policy is that we never send subscribers there, but we instead use peer_add events. See issue #17089 for more details.	2021-01-21 15:04:07 -08:00
Steve Howell	e42baf9e13	minor: Clean up args for apply_events. I eliminate the defaults, since the existing code was already specificying values for most things. I move all the booleans to the bottom for both parameters and arguments. I require explicit keywords for everything but user_profile (which is now first). And, finally, I format the code in a more diff-friendly manner.	2021-01-21 15:04:07 -08:00
Steve Howell	f2586d2f9b	refactor: Introduce SubscriptionInfo dataclass. We use this as the return type for gather_subscriptions_helper and get_web_public_subs, instead of tuples.	2021-01-21 15:04:07 -08:00
Steve Howell	d9740045a5	refactor: Eliminate checks in build_stream_dict_for_sub. We eliminate some redundant checks. We also consistently provide a `subscribers` field in our stream data with `[]`, even if our users can't access subscribers. We therefore bump the API version and tweak the docs. (See further down for a detailed justification of the change.) Even though it is sometimes fine to have redundant code that is defensive in nature, some upcoming changes are gonna move subscriber-related logic out of build_stream_dict_for_sub for certain codepaths as part of our effort to streamline the payload for subscribers within page_params. So we can't rely on the code that I removed here inside of build_stream_dict_for_sub. Anyway, it makes more sense to do these checks explicitly in the validate function. The code in build_stream_dict_for_sub was almost effectively a noop, since the validation function was already preventing us from getting subscriber info. The only difference it made was sometimes converting `[]` to `None`, and then subsequently omitting the subscribers field. Neither ZT nor the webapp make any distinction between `[]` or <missing key> for the `subscribers` data in `page_params`. The webapp has had this code for a long time (and now equivalent code elsewhere in this PR): if (!Object.prototype.hasOwnProperty.call(sub, "subscribers")) { sub.subscribers = new LazySet([]); } The webapp calculates access based on booleans, anyway: sub.can_access_subscribers = page_params.is_admin \|\| sub.subscribed \|\| (!page_params.is_guest && !sub.invite_only); And ZT would choke if `subscribers` were missing, except that it never gets to the relevant code due to other checks: def get_other_subscribers_in_stream(<snip>): assert stream_id is not None or stream_name is not None if stream_id: assert self.is_user_subscribed_to_stream(stream_id) return [sub for sub in self.stream_dict[stream_id]['subscribers'] if sub != self.user_id] else: return [sub for _, stream in self.stream_dict.items() for sub in stream['subscribers'] if stream['name'] == stream_name if sub != self.user_id] You could make a semantic argument that we should prefer <missing key> to `[]` when subscribers aren't even available, but we have precedent from the way that `bulk_get_subscriber_user_ids` has traditionally populated its result: result: Dict[int, List[int]] = {stream["id"]: [] for stream in stream_dicts} If we changed `stream_dicts` to `target_stream_dicts` we would faciliate a move toward `None`, but it would just cause headaches for other server code as well as the frontends (which, to reiterate, already prefer the empty array for convenience).	2021-01-21 15:04:07 -08:00
Mateusz Mandera	fcc8debc3a	users: Use realm.host in dummy user addresses without email visibility. By moving the relevant logic from realm.get_bot_domain to get_fake_email_domain we will make realm.host be used (if possible) for dummy user addresses. That is, instead of user11@zulipchat.com, the address will become user11@subdomain.zulipchat.com.	2021-01-21 13:04:38 -08:00
Mateusz Mandera	2283aa8a62	bots: Use realm.host for bot email domain if possible. With the change in `d70e1bcdb7`, bots get email like bot@zulip.com with EXTERNAL_HOST="zulip.com", rather than bot@subdomain.zulip.com, which was the old format. That's not desirable, so with this commit, realm.host will be used when possible and only falling back to FAKE_EMAIL_DOMAIN if needed.	2021-01-21 13:04:38 -08:00
Steve Howell	c693ae8982	event tests: Cover do_update_user_status better. We often send only one field (away or status_text) to be updated. So we have to make our schema support optional keys. As a result of the more flexible schema, we no longer need to exempt the node fixtures from our schema checks.	2021-01-20 13:17:32 -08:00
Steve Howell	36b1794c1d	user_status: Fix bug with resetting away status. The fix is pretty simple here--if the client doesn't send an away status, then don't change it. I improved the tests to cover this case. Fixes #17071	2021-01-20 13:59:35 -05:00
Steve Howell	1040fb7219	email digests: Remove handle_digest_email shim. The previous commit made it so we only call the shim in tests, so now we completely remove it.	2021-01-17 11:28:30 -08:00
Steve Howell	bfa0bdf3d6	email digests: Process users in chunks of 30. This should make the queue empty more quickly, because we do bulk queries to prevent database hops.	2021-01-17 11:28:30 -08:00
Steve Howell	e0b451730a	email digests: Extract get_new_streams. This makes us more efficient when handling multiple users. We don't have to keep sending the same two queries to the database. Note that as part of this we eliminated a failure mode for the obscure population of users from whom both `user.is_guest` and `user.can_access_public_streams()` returns False. We know this would have only affected Zephyr users (by looking at the code), and we know we don't actually process Zephyr users for email digests (or else we would have raised exceptions in the old code).	2021-01-17 11:28:30 -08:00
Steve Howell	23de94504f	email digests: Query streams for messages up front. This should save us many hops to the database when we process users in bulk.	2021-01-17 11:28:30 -08:00
Steve Howell	f8bbb7fea9	email digests: Use select_related("realm"). We mostly need realm_id, but when we go to build message lists, we need realm.uri. We could probably be more aggresive about using `only` here, but for now I am just trying to reduce hops to the database.	2021-01-17 11:28:29 -08:00
Steve Howell	52e2d5a733	email digests: Avoid long_term_idle check. We want to exclude users with recent subscription activity from emails, regardless of whether the long_term_idle flag is set.	2021-01-17 11:28:29 -08:00
Steve Howell	162b372b93	email digests: Do one query for recent streams. This is another way to limit hops to the database when we process users in bulk.	2021-01-17 11:28:29 -08:00
Alex Vandiver	c2526844e9	worker: Remove SignupWorker and friends. ZULIP_FRIENDS_LIST_ID and MAILCHIMP_API_KEY are not currently used in production. This removes the unused 'signups' queue and worker.	2021-01-17 11:16:35 -08:00
Steve Howell	04b6108e71	minor: Require keywords for verify_action.	2021-01-17 12:31:04 -05:00
Steve Howell	3df507be73	refactor: Clean up args for fetch_initial_state_data. We now require explicit keywords for all arguments to fetch_initial_state_data except user_profile. We provide reasonable defaults to keep the test code concise.	2021-01-17 12:31:04 -05:00
Alex Vandiver	08d716c741	registration: Re-use the redirect_to_email_login_url helper. In the case of reusing a registration link, reuse the redirect_to_email_login_url helper. This does have the side effect of now showing a "you've already registered" note, which did not happen previously, but that seems probably for the best, since the user did just click a "register" link.	2021-01-13 11:28:32 -08:00
Tushar912	c60f48c889	registration: Move "already in realm" check outside of validation. Checking for `validate_email_not_already_in_realm` again (after the form already did so), but only in the case that the form fails to validate, means that we may be spending time pushing totally invalid emails to the DB to check. In the case of emails containing nulls, this can even trigger a 500 error from PostgreSQL. Stop calling `validate_email_not_already_in_realm` in the form validation. The form is currently only used in two places -- in `accounts_home` and in `maybe_send_to_registration`. The latter is only called if the address is known to not currently have an account, so checking in there is unnecessary; and in the former case, we wish different behaviour (the redirect) than just validation failure, which is all the validator can do. Fixes #17015. Co-authored-by: Alex Vandiver <alexmv@zulip.com>	2021-01-13 11:28:32 -08:00
Tushar912	410bb8ad89	imports: Add better checking for subdomains. Add a `--allow-reserved-subdomain` flag which allows creation of reserved keyword domains. This also always enforces that the domain is not in use, which was removed in `0258d7d`. Fixes #16924.	2021-01-12 17:54:01 -08:00
sushant52	6f0e8a9888	auth: Handle the case of invalid subdomain at various points. Fixes #16770.	2021-01-11 22:29:50 -08:00
Siddharth Asthana	6c888977a6	change_subdomain: Create a deactivated realm on updating subdomain. When changing the subdomain of a realm, create a deactivated realm with the old subdomain of the realm, and set its deactivated_redirect to the new subdomain. Doing this will help us to do the following: - When a user visits the old subdomain of a realm, we can tell the user that the realm has been moved. - During the registration process, we can assure that the old subdomain of the realm is not used to create a new realm. If the subdomain is changed multiple times, the deactivated_redirect fields of all the deactivated realms are updated to point to the new uri.	2021-01-07 14:15:22 -08:00
Aman Agrawal	e566e985e4	topic_edit: Store edit history in all the message affected. Instead of just storing the edit history in the message which triggered the topic edit, we store the edit history in all the messages that changed. This helps users track the edit history of a message more reliably.	2021-01-04 18:18:05 -08:00
Aman Agrawal	c685d36821	hipchat_import: Remove tool from codebase. Remove functions and scripts used by HipChat import tool and those which will no longer be required in future.	2020-12-23 08:28:49 -08:00
Aman Agrawal	62d721e859	docs: Remove HipChat migration guide. As of Feb 15th 2019, Hipchat Cloud and Stride have reached End Of Life and are no longer supported by Atlassian. Since it is almost 2 years now we can remove the migration guides.	2020-12-23 15:43:13 +05:30
Vishnu KS	9fe39646fa	analytics: Specify exact end_time in realm summary query. Fetchings rows with end_time within the last 25 hours would result in the realmcount queries returning two rows for each realm if the analytics page was opened within an hour since the count stats were updated.	2020-12-22 16:44:31 -08:00
Mateusz Mandera	160cc5120a	api: Require can_create_users permission to create users via API. Allowing any admins to create arbitrary users is not ideal because it can lead to abuse issues. We should require something stronger that requires the server operator's approval and thus we add a new can_create_users permission.	2020-12-21 13:20:21 -08:00
Mateusz Mandera	d0dc04a093	models: Rename is_api_super_user to can_forge_sender,	2020-12-21 13:15:39 -08:00
sahil839	2fa33be683	actions: Refactor check_message to change return dataclass instead of Dict. We change the return type of check_message to be dataclass instead of Dict[str, Any]. This refactoring helps us to understand the context of the data structure returned by check_message clearly which was not possible when using Dict. SendMessageRequest class is added in zerver/lib/message.py inspite of it not being used in that file itself just to maintain consistency as other TypedDicts and dataclasses are defined in that file and to avoid circular dependency as SendMessageRequest is being used in lib/widget.py as well. We also rename local variable to 'send_request' for accessing SendMessageRequest objects.	2020-12-21 12:55:30 -08:00
Anders Kaseorg	a054f57af6	message: Bundle message stripping, validation, and truncation. We always want to do these at the same time. Previously, message editing did too much stripping (fixes #16837) and failed to check for NUL bytes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-18 17:44:13 -08:00
Anders Kaseorg	6b8f4782c4	test_mattermost_importer: Fix test for admins-to-owners change. Commit `ed498e2f8e` forgot to update this test. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-17 18:59:08 -08:00
Anders Kaseorg	2ab0b3d4fc	validator: Reject ISO 8601 dates missing leading zeros. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-15 16:36:50 -08:00
angela s	64becb20b5	logging: Set decorator tests to use assertLogs. Fixes part of #15331.	2020-12-15 11:46:25 -08:00
Alex Vandiver	7c849fa940	slack: Check token access scopes before importing. The Slack API always (even for failed requests) puts the access scopes of the token passed in, into "X-OAuth-Scopes"[1], which can be used to determine if any are missing -- and if so, which. [1] https://api.slack.com/legacy/oauth-scopes#working-with-scopes	2020-12-15 11:33:15 -08:00
Anders Kaseorg	bf45f921a7	url_preview: Allow Beautiful Soup to get the charset from <meta>. An HTML document sent without a charset in the Content-Type header needs to be scanned for a charset in <meta> tags. We need to pass bytes instead of str to Beautiful Soup to allow it to do this. Fixes #16843. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-15 11:30:57 -08:00
Siddharth Asthana	daac7536f3	accounts/deactivated: Show deactivated_redirect url if present If a user visits a realm which has been deactivated and it's deactivated_redirect field is set, we should have a message telling the user that the realm has moved to the deactivated_redirect url.	2020-12-14 21:04:52 -08:00
Siddharth Asthana	82f5759299	Realm: Add a deactivated_redirect URLField to Realm object. We export a realm's data, and disable the realm, because the user is moving from Zulip Cloud (e.g. https://example.zulipchat.com/) to self-hosting or another platform (e.g. https://zulip.example.com/) which we do not control. This commit adds a field in the realm object called deactivated_redirect to store the url to which the realm has moved.	2020-12-14 21:04:52 -08:00
Sundar Guntnur	cbb7fb8ac0	anchor_value: Fix parsing of large anchor values. This handles the conditions when anchor values are larger than LARGER_THAN_MAX_MESSAGE_ID by clamping them down to it. Also added tests for the function parse_anchor_value. Fixes #16768.	2020-12-02 11:00:22 -08:00
Steve Howell	92ce2d0e31	events: Fix apply_event for streams. In `1bcb8d8ee8` I made it so the webapp doesn't include "streams" in its state from `fetch_initial_state_data`, but I didn't address all the places in apply_event.	2020-12-01 13:01:38 -08:00
Steve Howell	c566ecfb30	minor: Remove dead code in events test.	2020-12-01 13:01:38 -08:00
Anders Kaseorg	13e35bfa94	mypy: Use sqlalchemy-stubs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-16 18:17:41 -08:00
Steve Howell	99e725cbde	populate_db: Simplify how we create reactions. For 3000 messages and 400 users, this saved about 30 seconds. We only do two queries per batch of messages now, and the algorithm is easier to analyze, as it's just three nested loops.	2020-11-16 17:19:23 -08:00
Steve Howell	e2e0f06b2a	email digests: Call get_recent_topics once per batch. Once we start processing digests in batch, this will let us amortize the expense of the message query over multiple users.	2020-11-16 08:59:29 -08:00
Steve Howell	1d1e45e9ec	digests: Use UserActivityInterval for user activity. Note that we are much more efficient about finding active users here: - we do one query per realm (instead of per-user) - we pass the cutoff date to the database - we get back just a list of distinct ids	2020-11-16 08:59:29 -08:00
Steve Howell	b52f56080e	performance: Just get user_ids to queue digest emails.	2020-11-16 08:59:29 -08:00
Steve Howell	d0260392f7	digests: Get user objects from the database. The query counts increase here for somewhat contrived reasons. The tests before this commit reflected a successful trip to the UserProfile cache, but that's not actually realistic in practice.	2020-11-16 08:59:29 -08:00
Steve Howell	7737413cec	digest tests: Improve gather_new_streams test. We don't need to mock the dates here. We also explicitly clear out all streams first, and then we explicitly test with both the stream being current and the stream being old.	2020-11-16 08:59:28 -08:00
Steve Howell	9538edde06	digest tests: Simplify bots test. We can use the _enqueue_emails_for_realm helper to avoid all the Tuesday-related logic here. We also don't bother to create UserActivity records, since the bot gets excluded by virtue of its being a bot. (Also, the date ranges here were sketchy due to the time mocking.)	2020-11-16 08:59:28 -08:00
Steve Howell	0624833af6	digest tests: Improve Tuesday tests. If we're mocking time, we should do it consistently.	2020-11-16 08:59:28 -08:00
Steve Howell	2f4d7a6171	tests: Fix test_inactive_users_queued_for_digest. We can avoid all the date mocking now for all but a couple tests that exercise the is-it-Tuesday logic. And this test now correctly tests that we exclude recently active users. And this allows us to remove the other test.	2020-11-16 08:59:28 -08:00
Steve Howell	cf6bcfb84a	digest emails: Exclude users who had recent digests. This code protects us in case we ever need to re-run email digests twice in the same day.	2020-11-16 08:59:28 -08:00
Steve Howell	fb3d4c1618	digest tests: Avoid warnings about naive time.	2020-11-16 08:59:28 -08:00
Steve Howell	4271442fba	email digests: Write RealmAuditLog rows.	2020-11-16 08:59:28 -08:00
Mateusz Mandera	4f47f35cb4	auth: Handle the case of invalid subdomain at /fetch_api_key endpoint.	2020-11-13 16:43:17 -08:00
Anders Kaseorg	8ba95063d5	test_markdown: Construct FencedBlockPreprocessor with a real Markdown. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:28 -08:00
Anders Kaseorg	2a8a59f548	test_queue_worker: Simplify worker_queue_names computation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:46:04 -08:00
Mateusz Mandera	47228f3a95	actions: Implement do_delete_user. To have a reasonable way of creating the dummy user without duplicating code, we need change create_user to have the optional force_id argument.	2020-11-09 11:58:02 -08:00
akshatdalton	806c1a0b8b	markdown: Fix flickering of embedded link inside Italic. This commit fixes a bug in marked.js which caused it to double-escape HTML when rendering messages of the form: [text](url). This fixes a bug introduced in `3bdc8bbaa5`, where an unnecessary escape() call was added for the <em> code path, likely just because it was adjacent to the others that needed it in the file. Fix this, and add tests to verify that things are still being escaped once after removing this extra escape. Fixes #14845.	2020-11-06 10:09:15 -08:00
Steve Howell	c5dc9d386f	refactor: Use sets of stream_ids for email digests. I now use sets for stream_ids in more of the digest code. As part of this I replaced exclude_subscription_modified_streams with streams_recently_modified_for_user. It's easier for the caller to just ask for ids to delete from its callee than it is to pass in a set/list to mutate. The simpler boundary between the functions makes the tests easier to write--you can see the `filtered_streams` logic goes away in this diff. I also make the tests a bit more thorough by using combinations of Cordelia/Othello and Verona/Denmark to try to find multiple possible flaws. And I make the time intervals longer than 1s to avoid false negatives from slow CI boxes.	2020-11-05 17:42:43 -08:00
Steve Howell	88a57ed4ac	bulk digest: Get stream subscriptions in bulk. If we have multiple users, this reduces the amount of queries we need to do, because we get all subscriptions for all users in a single query to Subscription. For the single-user case, we are introducing an extra query hop, but the database is doing roughly the same work, because we are just breaking up this complex query into two hops: messages = select ... from message where recipient__type_id in ( select stream_id from subscription where ... ) Now it's more like: stream_ids = select stream_id from subscription where ... messages = select ... from message where recipient__type_id in stream_ids	2020-11-05 09:36:59 -08:00
Steve Howell	c83db37161	email digests: Introduce bulk methods for digest. Note that we are not changing anything semantically or algorithmically yet. The only overhead here for the single-user case is boxing and unboxing data into single-item dicts and lists. The interfaces for callers in the view and the queue processor remain the same for now.	2020-11-05 09:36:59 -08:00
Steve Howell	0e2d02b0a2	digest tests: Count cache tries.	2020-11-05 09:36:59 -08:00
Steve Howell	127f4e1291	digest tests: Add more users to bulk digest test.	2020-11-05 09:36:59 -08:00
Steve Howell	89cb3fa841	digest tests: Localize mocks. We didn't need the enough-traffic mock. We also continue to prep for testing multiple users. I also finally remove a comment that is about to be addressed (and which inaccurately refers to huddles).	2020-11-05 09:36:59 -08:00
Steve Howell	1ec16dd1da	digest tests: Prep to test bulk digests. All this does, essentially, is put the logic we used to test for othello inside of a loop. We'll add more users in the next commit.	2020-11-05 09:36:59 -08:00
Anders Kaseorg	13c11ec5f3	openapi: Fix escaping in curl command generation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-05 09:36:31 -08:00
Steve Howell	c1f134a3a4	performance: Use ORM to fetch sender in render_markdown. In `709493cd75` (Feb 2017) I added code to render_markdown that re-fetched the sender of the message, to detect whether the message is a bot. It's better to just let the ORM fetch this. The message object should already have sender. The diff makes it look like we are saving round trips to the database, which is true in some cases. For the main message-send codepath, though, we are only saving a trip to memcached, since the middleware will have put our sender's user object into the cache. The test_message_send test calls internally to check_send_stream_message, so it was actually hitting the database in render_markdown (prior to my change).	2020-11-05 09:35:15 -08:00
Steve Howell	637f596751	tests: Fix queries_captured to clear cache up front. Before this change we were clearing the cache on every SQL usage. The code to do this was added in February 2017 in `6db4879f9c`. Now we clear the cache just one time, but before the action/request under test. Tests that want to count queries with a warm cache now specify keep_cache_warm=True. Those tests were particularly flawed before this change. In general, the old code both over-counted and under-counted queries. It under-counted SQL usage for requests that were able to pull some data out of a warm cache before they did any SQL. Typically this would have bypassed the initial query to get UserProfile, so you will see several off-by-one fixes. The old code over-counted SQL usage to the extent that it's a rather extreme assumption that during an action itself, the entries that you put into the cache will get thrown away. And that's essentially what the prior code simulated. Now, it's still bad if an action keeps hitting the cache for no reason, but it's not as bad as hitting the database. There doesn't appear to be any evidence of us doing something silly like fetching the same data from the cache in a loop, but there are opportunities to prevent second or third round trips to the cache for the same object, if we can re-structure the code so that the same caller doesn't have two callees get the same data. Note that for invites, we have some cache hits that are due to the nature of how we serialize data to our queue processor--we generally just serialize ids, and then re-fetch objects when we pop them off the queue.	2020-11-05 09:35:15 -08:00
YashRE42	967efc32d2	widgets: Remove tictactoe example widget. Steve asked me to remove this, since the tictactoe game was always intended as a proof of concept. Now that we have poll and todo widgets, the sample code for tictactoe has much less value. We replace the content and type in test_widgets.py to maintain coverage.	2020-11-03 14:46:39 -08:00
Aman Agrawal	87cdd8433d	home: Allow logged out user through home. We allow user to load webapp without log-in. This is only be enabled for developed purposes now. Production setups will see no changes.	2020-11-02 17:07:12 -08:00
akshatdalton	620e9cbf72	markdown: Fix merging of separate quotations. Initally, when writing two or more quotes, having a blank line in between them, merges those quotes. This created confusion especially in "quote and reply". This commit fixes such issues. Now two or more quotes having a blank line in between them, will not get merged. This change is correct both for usability and for improving our compatibility with CommonMark. Fixes #14379.	2020-10-30 15:21:15 -07:00
Anders Kaseorg	aaa7b766d8	python: Use universal_newlines to get str from subprocess. We can replace ‘universal_newlines’ with ‘text’ when we bump our minimum Python version to 3.7. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	7c4f68d9cf	python: Skip unnecessary decode before BeautifulSoup parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	86e8d81c7f	python: Skip unnecessary decode before JSON parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	1802a50cc9	python: Use requests.Response.text instead of decoding content. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
sahil839	b29d39195c	streams: Do not allow default streams to be private. We now do not allow to make a stream private which is already a default stream.	2020-10-29 15:47:32 -07:00
sahil839	557ca0802c	streams: Do not allow private streams to be set as default. We now do not allow to set a private stream as default.	2020-10-29 15:43:37 -07:00
m-e-l-u-h-a-n	cbfd6464a5	logging: replace mock.patch() for logging with assertLogs() This commit removes mock.patch with assertLogs(). * Adds return value to do_rest_call() in outgoing_webhook.py, to support asserting log output in test_outgoing_webhook_system.py. * Logs are not asserted in test_realm.py because it would require to users to be queried using users=User.objects.filter(realm=realm) and the order of resulting queryset varies for each run. * In test_decorators.py, replacement of mock.patch is not done because I'm not sure if it's worth the effort to replace it as it's a return value of a function. Tweaked by tabbott to set proper mypy types.	2020-10-29 15:37:45 -07:00
Hemanth V. Alluri	99cf37dc51	drafts: Make the ID of the draft a part of the draft dict. Then because the ID is now part of the draft dict, we can (and do) change the structure of the "drafts" parameter returned from `GET /drafts` from an object (mapping ID to data) to an array. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Hemanth V. Alluri	8d59fd2f45	tests/drafts: Simplify create_and_check_drafts_for_success. Sometimes we don't need to specify the expected_drafts field. So by removing it, we can reduce the clutter a bit. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Hemanth V. Alluri	e60925b3e8	drafts: Change "timestamp" from float to integer. Now the timestamp returned in a draft dict will always be an int. The endpoints will still accept either an int or a float. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
m-e-l-u-h-a-n	be7a70e742	logging: Remove unnecessary mock.patch() for logging. Our test-backend validation confirms that we don't log anything to stdout in the tests, so the fact that CI passes with this removes shows there was nothing being logged.	2020-10-28 23:15:27 -07:00
Vishnu KS	fdea49742c	apps: Use GitHub API for generating the web app download link.	2020-10-28 23:04:14 -07:00
Alex Vandiver	f4eae83542	export: Only include real, active humans in the displayed count.	2020-10-28 18:31:06 -07:00
Anders Kaseorg	1352f2f233	python: Replace manual quote_plus usage with urlencode. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	4e9d587535	python: Pass query parameters as a dict when making GET requests. This provides automatic URL-encoding. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	41f509170b	users: Canonicalize the timezone identifier. While working on shifting toward native browser time zone APIs (#16451), it was found that all but very recent Chrome and Node versions reject certain legacy timezone aliases like US/Pacific (https://crbug.com/364374). For now, we only canonicalize the timezone property returned in user objects and not the timezone setting itself. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Anders Kaseorg	0b288f92c9	timezone: Remove get_timezone wrapper. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Tim Abbott	6d7cd351a3	events: Optimize creating streams for new users. During the new user creation code path, there can be no existing active clients for the user being created, so we can skip the code to send events to that user's clients. The tests here reflect that we need to send fewer events, and do fewer queries that would have been spent computing data for these.. Fixes #16503, combined with the long series of recent changes by Steve Howell to fix super-linear behavior in this code path.	2020-10-26 12:47:15 -07:00
Steve Howell	88a7a1b002	events: Optimize peer_add/peer_remove for public streams. We no bulk up peer_add/peer_remove events by user if the same user has subscribed to multiple streams (and just that single user). This mostly optimizes the new-user codepath, but the algorithm is a bit more general in nature.	2020-10-26 12:33:28 -07:00
Alex Vandiver	7cf737988d	queue: Be more explicit about test/real queue division.	2020-10-26 12:32:47 -07:00
Anders Kaseorg	31d0141a30	python: Close opened files. Fixes various instances of ‘ResourceWarning: unclosed file’ with python -Wd. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-26 12:31:30 -07:00
Steve Howell	3ad1335a97	tests: Clear ContentType cache for user test. This keeps the number of queries predictable.	2020-10-26 07:18:08 -04:00
Steve Howell	5ef01b3ad8	tests: Fix test_create_user_with_multiple_streams. This test was flaky due to some date-related non-determinism. I make all the Message objects current to make add_new_user_history reliably try to bulk-update UserMessage rows to read.	2020-10-26 07:18:08 -04:00
Harsh Srivastava	9b31df009b	openapi: Fix excessively large test_events failure output. Because of the very large `oneOf` clause of the formats of events possible in Zulip's `GET /events` system, we had issues with `test-backend` failures for missing documentation for a new event format being like 1000 lines of output, which was very much unhelpful. Fix this by limiting the output use only the oneOf variants that are broadly similar to the actual payload received. Fixes #16023.	2020-10-23 17:00:17 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	b9fd49a2c6	mypy: Correct mistaken *args type annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:29:13 -07:00
Anders Kaseorg	d295da676b	test_message_fetch: Clean up obsolete PGroonga bug workaround. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-22 23:27:23 -07:00
sahil839	571bb62e3d	events: Update subscriber list on peer_add for unsubscribed streams. We update the subscriber list on peer_add event for unsubscribed streams as well.	2020-10-22 15:12:32 -07:00
sahil839	733d26aef2	events: Update subscriber list on peer_remove for never subscribed stream. We now update the subscriber list on peer_remove event for never subscribed streams also.	2020-10-22 15:12:32 -07:00
sahil839	af9b153ee3	events: Update subscriber list on peer_remove for unsubscribed stream. We update the subscriber list on peer_remove event for unsubscribed streams also.	2020-10-22 15:12:32 -07:00
sahil839	709edd29d4	test_events: Fix comment in do_test_subscribe_events. The comment still pointed to 'vacate' event flow, but we have removed the vacate event in `a9356508ca`. This commit fixes the comment to depict the correct purpose of below lines, i.e. to test the remove event flow.	2020-10-22 15:12:32 -07:00
sahil839	e578742b02	test_events: Remove 'realm_user' from event_types in subscription test. We were including 'realm_user' in event_types along with 'subscription', but we don't send event of type 'realm_user' when subscribing to a new stream. This was added in `1c332f5d6a`. This commit removes 'realm_user' from event_types.	2020-10-22 15:12:32 -07:00
sahil839	d0f5537fb2	actions: Modify check_message for handling wildcard_mention_policy setting. This commit adds enforcement for sending messages containing wildcard mentions according to wildcard_mention_policy.	2020-10-22 14:46:32 -07:00
sahil839	25f32d461e	tests: Add tests for all the values of wildcard_mention_policy.	2020-10-22 12:08:22 -07:00
Mateusz Mandera	48f80fcb0a	auth: Expect name in request params in Apple auth. The name used to be included in the id_token, but this seems to have been changed by Apple and now it's sent in the `user` request param. https://github.com/python-social-auth/social-core/pull/483 is the upstream PR for this - but upstream is currently unmaintained, so we have to monkey patch. We also alter the tests to reflect this situation. Tests no longer put the name in the id_token, but rather in the `user` request param in the browser flow, just like it happens in reality. An adaptation has to be made in the native flow - since the name won't be included by Apple in the id_token anymore, the app, when POSTing to the /complete/apple/ endpoint, can (and should for better user experience) add the `user` param formatted as json of {"email": "hamlet@zulip.com", "name": {"firstName": "Full", "lastName": "Name"}} dict. This is also reflected by the change in the native flow tests.	2020-10-22 12:07:46 -07:00
Steve Howell	7ff3859136	subscriber events: Change schema for peer_add/peer_remove. We now can send an implied matrix of user/stream tuples for peer_add and peer_remove events. The client code basically does this: for stream_id in event['stream_ids']: for user_id in event['user_ids']: update_sub(stream_id, user_id) We used to send individual events, which gets real expensive when you are creating new streams. For the case of copy-to-stream case, we should see events go from U to 1, where U is the number of users added. Note that we don't yet fully optimize the potential of this schema. For adding a new user with lots of default streams, we still send S peer_add events. And if you subscribe a bunch of users to a bunch of private streams, we only go from U * S to S; we can't optimize it down to one event easily.	2020-10-22 11:19:53 -07:00
Steve Howell	85ed6f332a	performance: Avoid Recipient lookup for stream messages. All the fields of a stream's recipient object can be inferred from the Stream, so we just make a local object. Django will create a Message object without checking that the child Recipient object has been saved. If that behavior changes in some upgrade, we should see some pretty obvious symptom, including query counts changing. Tweaked by tabbott to add a longer explanatory comment, and delete a useless old comment.	2020-10-20 11:47:23 -07:00
Steve Howell	7bbcc2ac96	refactor: Compute peers for public streams later. This saves us a query for edge cases like when you try to unsubscribe from a public stream that you have already unsubscribed from. But this is mostly to prep for upcoming optimizations.	2020-10-20 11:31:22 -07:00
akshatdalton	287c4ed2bb	markdown: Fix Youtube and Vimeo preview overriding markdown link titles bug. Initially markdown titles were overridden by Youtube and Vimeo preview titles. But now it will check if any markdown title is present to replace Youtube or Vimeo preview titles, if preview of linked websites is enabled. Fixes #16100	2020-10-19 12:06:13 -07:00
Anders Kaseorg	d81a93cdf3	requirements: Upgrade markdown to 3.3.1. Upstream has slightly changed the whitespace around stashes. Take this opportunity to clean up the extra blank lines we were outputting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 11:54:14 -07:00
Steve Howell	4dce34ab8b	refactor: Simplify call to bulk_get_subscriber_user_ids. The way we were computing the dictionary was very convoluted--all we need is a set of subscribed user ids.	2020-10-18 14:27:31 -07:00
Steve Howell	0ca07ffd3c	peformance: Eliminate StreamRecipientMap. That class is an artifact of when Stream didn't have recipient_id. Now it's simpler to deal with stream subscriptions. We also save a query during page load (and other places where we get subscriber info).	2020-10-18 14:27:31 -07:00
Steve Howell	2f8ba383ef	tests: Test overhead for creating new users.	2020-10-18 14:27:31 -07:00
Mateusz Mandera	716df658fa	queue_processors: Don't run test queues with run-dev.py.	2020-10-18 14:07:31 -07:00
Steve Howell	e1bcf6124f	refactor: Remove recipient from access_stream_by_name.	2020-10-16 12:58:11 -07:00
Steve Howell	a51b483f1a	performance: Remove recipient from access_stream_by_id. The Recipient table is now kind of useless for stream-related operations, since we have recipient_id on Stream now.	2020-10-16 12:58:11 -07:00
Steve Howell	3685fcc701	refactor: Remove recipient arg for do_mute_topic.	2020-10-16 12:58:11 -07:00
Steve Howell	378062cc83	performance: Avoid call to access_stream_by_id. We already trust ids that are put on our queue for deferred work. For example, see the code for "mark_stream_messages_as_read_for_everyone" We now pass stream_recipient_id when we queue up work for do_mark_stream_messages_as_read. This generally saves about 3 queries per user when we unsubscribe them from a stream.	2020-10-16 12:58:11 -07:00
Steve Howell	2256d72015	minor: Add comment to subscriber test.	2020-10-16 12:58:11 -07:00
Steve Howell	31eb97ddde	performance: Fix do_mark_stream_messages_as_read. This function no longer asks for data that it doesn't need.	2020-10-16 12:58:11 -07:00
Steve Howell	6d1f9de7d3	performance: Use SubInfo when removing subscribers. We get two speedups: * The query to get existing subscribers only gets the two fields we need. We no longer need all the overhead of user_profile and recipient data being returned in the query. * We avoid Django making extra hops to the database to get user info.	2020-10-16 12:58:11 -07:00
Steve Howell	b4346d0276	performance: Extract subscribers/peers in bulk. We replace get_peer_user_ids_for_stream_change with two bulk functions to get peers and/or subscribers. Note that we have three codepaths that care about peers: subscribing existing users: we need to tell peers about new subscribers we need to tell subscribed user about old subscribers unsubscribing existing users: we only need to tell peers who unsubscribed subscribing new user: we only need to tell peers about the new user (right now we generate send_event calls to tell the new user about existing subscribers, but this is a waste of effort that we will fix soon) The two bulk functions are this: bulk_get_subscriber_peer_info bulk_get_peers They have some overlap in the implementation, but there are some nuanced differences that are described in the comments. Looking up peers/subscribers in bulk leads to some nice optimizations. We will save some memchached traffic if you are subscribing to multiple public streams. We will save a query in the remove-subscriber case if you are only dealing with private streams.	2020-10-15 15:12:01 -07:00
Steve Howell	c73f84f275	tests: Improve tests for unsubscribing multiple users. Note that the tests now reflect that we have O(N) behavior for multiple users.	2020-10-15 15:12:01 -07:00
Steve Howell	f86823f82f	tests: Add cache_tries_captured helper.	2020-10-15 15:12:01 -07:00
Steve Howell	a9356508ca	events: Stop sending occupy/vacate events. We used to send occupy/vacate events when either the first person entered a stream or the last person exited. It appears that our two main apps have never looked at these events. Instead, it's generally the case that clients handle events related to stream creation/deactivation and subscribe/unsubscribe. Note that we removed the apply_events code related to these events. This doesn't affect the webapp, because the webapp doesn't care about the "streams" field in do_events_register. There is a theoretical situation where a third party client could be the victim of a race where the "streams" data includes a stream where the last subscriber has left. I suspect in most of those situations it will be harmless, or possibly even helpful to the extent that they'll learn about streams that are in a "quasi" state where they're activated but not occupied. We could try to patch apply_event to detect when subscriptions get added or removed. Or we could just make the "streams" piece of do_events_register not care about occupy/vacate semantics. I favor the latter, since it might actually be what users what, and it will also simplify the code and improve performance.	2020-10-14 10:53:10 -07:00
Steve Howell	1bcb8d8ee8	performance: Avoid computing page_params.streams in webapp. The query to get "occupied" streams has been expensive in the past. I'm not sure how much any recent attempts to optimize that query have mitigated the issue, but since we clearly aren't sending this data, there is no reason to compute it.	2020-10-14 10:53:10 -07:00
Steve Howell	193ca397f9	tests: Include deactivated users for subscribe test.	2020-10-14 10:53:10 -07:00
Aman Agrawal	fbf7cb82a7	web_public_guest: Rename to web_public_visitor for clarity. Using web_public_guest for anonymous users is confusing since 'guest' is actually a logged-in user compared to web_public_guest which is not logged-in and has only read access to messages. So, we rename it to web_public_visitor.	2020-10-13 16:59:52 -07:00
Steve Howell	e7a8c7ac48	test: Improve tests for bulk-adding subscribers. This is a more thorough test of adding multiple streams for multiple users, including streams that users have already subscribed to. The extra queries here are due to the fact that we call `principal_to_user_profile` in a loop in the view. So that's an example of O(N) overhead. We may be able to bulk-fetch these users eventually.	2020-10-13 18:54:55 -04:00
Steve Howell	c29ba75135	refactor: Extract send_messages_for_new_subscribers. This is a pure extraction, except that I remove a redundant check that `len(principals) > 0`. Whenever that value is false, then `new_subscriptions` will only have one possible entry, which is the current user, and we skip that in the loop.	2020-10-13 18:54:55 -04:00
Steve Howell	3b338ec32e	performance: Optimize filter_stream_authorization. We no longer do O(N) queries to get existing streams. This is a somewhat contrived use case--generally, we are not trying to re-subscribe a user to several streams. Still, we want to avoid this. This commit also makes `test_bulk_subscribe_many` do more work, and the change to the test helped me discover this bug.	2020-10-13 18:54:55 -04:00
Anders Kaseorg	6564540d15	docs: Fix some spelling errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Anders Kaseorg	dd48dbd912	docs: Add spaces to “check out”, “log in”, “set up”, “sign up” as verbs. “Checkout”, “login”, “setup”, and “signup” are nouns, not verbs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Steve Howell	598601e8fc	stream events: Prevent spurious events. If a user asks to be subscribed to a stream that they are already subscribed to, then that stream won't be in new_stream_user_ids, and we won't need to send an event for it. This change makes that happen more automatically.	2020-10-13 11:28:17 -07:00
Steve Howell	766892d8aa	import: Reuse get_last_message_id() helper.	2020-10-13 11:28:17 -07:00
Steve Howell	188cc9bb3b	minor: Fix user/stream in test_subscriptions.	2020-10-13 11:28:17 -07:00
Steve Howell	9df9934ed6	refactor: Pass realm to bulk_add_subscriptions. I think it's important that the callers understand that bulk_add_subscriptions assumes all streams are being created within a single realm, so I make it an explicit parameter. This may be overkill--I would also be happy if we just included the assertions from this commit.	2020-10-13 11:28:17 -07:00
Anders Kaseorg	17ac17286c	python: Catch specific exceptions from subprocess. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	1346c5397a	zephyr: Use correct shell quoting for ssh. ssh always runs its command through a shell (after naïvely joining multiple arguments with spaces), so it needs an extra level of shell quoting. This should have no effect because we already validated user with a regex, but it’s better for escaping to be locally correct in case the context changes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Alex Vandiver	c2132a4f9c	queue: Drop register_json_consumer / json_drain_queue interface. Now that all callsites use the same interface, drop the now-unused ones, and their tests.	2020-10-11 14:19:42 -07:00
Alex Vandiver	5477b9d9a1	queue: Switch tests to start_json_consumer interface.	2020-10-11 14:19:42 -07:00
Alex Vandiver	f9358d5330	queue: Switch batch interface to use the channel.consume iterator. This low-level interface allows consuming from a queue with timeouts. This can be used to either consume in batches (with an upper timeout), or one-at-a-time. This is notably more performant than calling `.get()` repeatedly (what json_drain_queue does under the hood), which is "highly discouraged as it is very inefficient"[1]. Before this change: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11158 / sec Dequeue rate: 3075 / sec ``` After: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11511 / sec Dequeue rate: 19938 / sec ``` [1] https://www.rabbitmq.com/consumers.html#fetching	2020-10-11 14:19:40 -07:00
Alex Vandiver	571f8b8664	queue: Use low-level queue_purge to empty at the end of tests. This is O(1) at the RabbitMQ API level, and doesn't rely on the code under test to function correctly during test cleanup.	2020-10-09 20:43:49 -07:00
Alex Vandiver	ac0ba21c2c	tests: Stop reusing a variable name. `loopworker_sleep_mock` is a file-level variable used to mock out the sleep() call in LoopQueueProcessingWorker; don't reuse the variable name for something else.	2020-10-09 20:42:20 -07:00
Alex Vandiver	754638f673	tests: Refactor test_queue_worker to separate queues.	2020-10-09 20:42:12 -07:00
Alex Vandiver	d5a6b0f99a	queue: Rename queue_size, and update for all local queues. Despite its name, the `queue_size` method does not return the number of items in the queue; it returns the number of items that the local consumer has delivered but unprocessed. These are often, but not always, the same. RabbitMQ's queues maintain the queue of unacknowledged messages; when a consumer connects, it sends to the consumer some number of messages to handle, known as the "prefetch." This is a performance optimization, to ensure the consumer code does not need to wait for a network round-trip before having new data to consume. The default prefetch is 0, which means that RabbitMQ immediately dumps all outstanding messages to the consumer, which slowly processes and acknowledges them. If a second consumer were to connect to the same queue, they would receive no messages to process, as the first consumer has already been allocated them. If the first consumer disconnects or crashes, all prior events sent to it are then made available for other consumers on the queue. The consumer does not know the total size of the queue -- merely how many messages it has been handed. No change is made to the prefetch here; however, future changes may wish to limit the prefetch, either for memory-saving, or to allow multiple consumers to work the same queue. Rename the method to make clear that it only contains information about the local queue in the consumer, not the full RabbitMQ queue. Also include the waiting message count, which is used by the `consume()` iterator for similar purpose to the pending events list.	2020-10-09 20:40:39 -07:00
Aman Agrawal	8b419c93e4	message_send: Fix old guests being treated as full members. For streams in which only full members are allowed to post, we block guest users from posting there. Guests users were blocked from posting to admin only streams already. So now, guest users can only post to STREAM_POST_POLICY_EVERYONE streams. This is not a new feature but a bugfix which should have happened when implementing full member stream policy / guest users.	2020-10-08 11:30:11 -07:00
Alex Vandiver	d47637fa40	queue: Set a max consume timeout with SIGALRM. SIGALRM is the simplest way to set a specific maximum duration that queue workers can take to handle a specific message. This only works in non-threaded environments, however, as signal handlers are per-process, not per-thread. The MAX_CONSUME_SECONDS is set quite high, at 10s -- the longest average worker consume time is embed_links, which hovers near 1s. Since just knowing the recent mean does not give much information[1], it is difficult to know how much variance is expected. As such, we set the threshold to be such that only events which are significant outliers will be timed out. This can be tuned downwards as more statistics are gathered on the runtime of the workers. The exception to this is DeferredWorker, which deals with quite-long requests, and thus has no enforceable SLO. [1] https://www.autodesk.com/research/publications/same-stats-different-graphs	2020-10-06 17:26:14 -07:00
Alex Vandiver	baf882a133	queue: Only ACK drain_queue once it has completed work on the list. Currently, drain_queue and json_drain_queue ack every message as it is pulled off of the queue, until the queue is empty. This means that if the consumer crashes between pulling a batch of messages off the queue, and actually processing them, those messages will be permanently lost. Sending an ACK on every message also results in a significant amount lot of traffic to rabbitmq, with notable performance implications. Send a singular ACK after the processing has completed, by making `drain_queue` into a contextmanager. Additionally, use the `multiple` flag to ACK all of the messages at once -- or explicitly NACK the messages if processing failed. Sending a NACK will re-queue them at the front of the queue. Performance of a no-op dequeue before this change: ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10847 / sec Dequeue rate: 2479 / sec ``` Performance of a no-op dequeue after this change (a 25% increase): ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10752 / sec Dequeue rate: 3079 / sec ```	2020-10-06 17:26:14 -07:00
Alex Vandiver	8cf37a0d4b	queue: Add a tool to profile no-op enqueue and dequeue actions.	2020-10-06 17:26:14 -07:00
Mateusz Mandera	6e83bcc0d5	custom_profile_fields: Don't allow leading/trailing whitespaces. Allowing such whitespaces can lead to hard to debug issues e.g. with ldap sync.	2020-10-02 14:58:06 -07:00
Aman Agrawal	08fbde4e7c	test_move_msgs: Rename variable for clarity.	2020-10-01 17:45:11 -07:00
Tim Abbott	8c8f3ee13b	test_classes: Extract home view helpers for reuse.	2020-10-01 15:14:25 -07:00
Tim Abbott	6d041a3b34	home: Include is_web_public_guest in page_params.	2020-10-01 15:07:19 -07:00
Aman Agrawal	b0d92b3ff6	HomeTest: Extract page_params keys to be used in other functions.	2020-10-01 14:39:54 -07:00
sahil839	78b98d8067	realm: Add wildcard_mention_policy setting. We add a new wildcard_mention_policy setting to handle wildcard mentions in large streams, with a wide range of policies available to organizations. We set the default to the safe option for preventing accidental spam: only stream administrators being able to use wildcard mentions in large streams.	2020-10-01 12:18:03 -07:00
sahil839	6c473ed75f	message: Call build_message_send_dict from check_message. We call build_message_send_dict from check_message instead of do_send_messages. This is a prep commit for adding a new setting for handling wildcard mentions in large streams.	2020-09-29 17:18:04 -07:00
Steve Howell	c199571112	mypy: Add StreamDict. This requires us to rework the view code a little bit to explicitly assign fields.	2020-09-29 16:49:10 -07:00
Tim Abbott	0c2d1f068d	docs: Extend documentation of event system testing.	2020-09-28 12:37:54 -07:00
Tim Abbott	3242fc7388	soft_deactivation: Fix typo in logging output.	2020-09-28 12:12:04 -07:00
palash	7a7db69935	test_push_notifications: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_push_notifications.py	2020-09-28 12:12:00 -07:00

... 2 3 4 5 6 ...

5426 Commits