zulip

Commit Graph

Author	SHA1	Message	Date
Max Zawisa	0e40cc72af	newrelic: Added owner field and cleaned up code. I reformatted the tests and view to include information about who acknowledged and closed the alert. Only includes the information about the owner if there was an owner. Made a few small changes to the refactored bit as requested in review.	2020-12-15 12:04:46 -08:00
Max Zawisa	57e847ab89	newrelic: refactor of time input handling. Moved time formatting check and conversion to zerver/lib/webhooks/common.py. Updated tests slightly to match new output. Removed duration from the calculation because the difference is less than the precision of output and it complicated the error handling.	2020-12-15 12:04:46 -08:00
Max Zawisa	ec00557962	docs: Updated New Relic documentation. The docs are updated to work with the new webhook and new process on https://one.newrelic.com.	2020-12-15 12:04:46 -08:00
Mateusz Mandera	b652cc786c	django3: Remove remaining postgresql_psycopg2 use. Removed in Django 3.0.	2020-12-15 11:52:32 -08:00
angela s	64becb20b5	logging: Set decorator tests to use assertLogs. Fixes part of #15331.	2020-12-15 11:46:25 -08:00
Alex Vandiver	438d2aa632	digests: Ensure that the teaser_data can be JSON-serialized. Leaving this as a set means that it fails in zerver.lib.send_email when serializing into a ScheduledEmail object.	2020-12-15 11:44:50 -08:00
Alex Vandiver	7c849fa940	slack: Check token access scopes before importing. The Slack API always (even for failed requests) puts the access scopes of the token passed in, into "X-OAuth-Scopes"[1], which can be used to determine if any are missing -- and if so, which. [1] https://api.slack.com/legacy/oauth-scopes#working-with-scopes	2020-12-15 11:33:15 -08:00
Anders Kaseorg	415897f491	api docs: Use normal async/await code in JavaScript examples. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-15 11:32:18 -08:00
Anders Kaseorg	bf45f921a7	url_preview: Allow Beautiful Soup to get the charset from <meta>. An HTML document sent without a charset in the Content-Type header needs to be scanned for a charset in <meta> tags. We need to pass bytes instead of str to Beautiful Soup to allow it to do this. Fixes #16843. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-15 11:30:57 -08:00
Siddharth Asthana	daac7536f3	accounts/deactivated: Show deactivated_redirect url if present If a user visits a realm which has been deactivated and it's deactivated_redirect field is set, we should have a message telling the user that the realm has moved to the deactivated_redirect url.	2020-12-14 21:04:52 -08:00
Siddharth Asthana	82f5759299	Realm: Add a deactivated_redirect URLField to Realm object. We export a realm's data, and disable the realm, because the user is moving from Zulip Cloud (e.g. https://example.zulipchat.com/) to self-hosting or another platform (e.g. https://zulip.example.com/) which we do not control. This commit adds a field in the realm object called deactivated_redirect to store the url to which the realm has moved.	2020-12-14 21:04:52 -08:00
Anders Kaseorg	2c5e9f65f8	eslint: Fix new-cap errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-10 19:52:22 -08:00
Puneeth Chaganti	5dc3489166	webhooks/sentry: Fix URL generated in transform_webhook_payload. The URL incorrectly had `event` in the URL path, instead of `events`. Closes #16783	2020-12-02 12:28:45 -08:00
Puneeth Chaganti	b7a08323aa	webhooks/sentry: Use received key when timestamp key is absent.	2020-12-02 12:28:45 -08:00
Sundar Guntnur	cbb7fb8ac0	anchor_value: Fix parsing of large anchor values. This handles the conditions when anchor values are larger than LARGER_THAN_MAX_MESSAGE_ID by clamping them down to it. Also added tests for the function parse_anchor_value. Fixes #16768.	2020-12-02 11:00:22 -08:00
Mateusz Mandera	43a0c60e96	exceptions: Make RateLimited into a subclass of JsonableError. This simplifies the code, as it allows using the mechanism of converting JsonableErrors into a response instead of having separate, but ultimately similar, logic in RateLimitMiddleware. We don't touch tests here because "rate limited" error responses are already verified in test_external.py.	2020-12-01 13:40:56 -08:00
Steve Howell	92ce2d0e31	events: Fix apply_event for streams. In `1bcb8d8ee8` I made it so the webapp doesn't include "streams" in its state from `fetch_initial_state_data`, but I didn't address all the places in apply_event.	2020-12-01 13:01:38 -08:00
Steve Howell	c566ecfb30	minor: Remove dead code in events test.	2020-12-01 13:01:38 -08:00
Vishnu KS	dabbc3445a	webhooks: Properly format the currency amount for refunds. By default all Stripe API amounts are in the currency's smallest unit. It's upto us to convert it to a bigger unit and show it to the end user. And refund event used to show the currency in the smallest unit which makes the output wrong when it comes to most currencies like USD, Europ, INR etc which uses a bigger unit(eg Dollar instead of Cents) as the standard.	2020-11-29 18:11:24 -08:00
Max Zawisa	f05a04e000	webhooks: Update NewRelic webhook for new format. Update the New Relic webhook and tests to match the format specified in the New Relic documentation. The new format sends a json body instead of using url parameters. The old format is no longer supported by New Relic according to their support staff; as a result, the fixtures for the old test cases were removed. Added fixtures for new test cases. Fixes: #16393.	2020-11-18 16:19:08 -08:00
Anders Kaseorg	13e35bfa94	mypy: Use sqlalchemy-stubs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-16 18:17:41 -08:00
Anders Kaseorg	8e0240300a	message_fetch: Skip intermediate mutation in limit_query_to_range. This avoids extra mypy annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-16 18:17:41 -08:00
Anders Kaseorg	d0d8c358b3	lint: Migrate typing.Text check to semgrep. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-16 18:17:41 -08:00
Steve Howell	99e725cbde	populate_db: Simplify how we create reactions. For 3000 messages and 400 users, this saved about 30 seconds. We only do two queries per batch of messages now, and the algorithm is easier to analyze, as it's just three nested loops.	2020-11-16 17:19:23 -08:00
Vishnu KS	5eb63ddb7a	webhooks: Handle dispute events with object IDs prefixed with du. Sometimes the dispute object IDs are prefixed with `du` instead of `dp`. https://freenode.logbot.info/stripe/20200605#c4059469 The correct long-term fix here would be to stop using object IDs to detect the object type of these events and instead maybe make use of "object" key instead. https://stripe.com/docs/api/disputes/object#dispute_object-object	2020-11-16 17:05:54 -08:00
Steve Howell	e2e0f06b2a	email digests: Call get_recent_topics once per batch. Once we start processing digests in batch, this will let us amortize the expense of the message query over multiple users.	2020-11-16 08:59:29 -08:00
Steve Howell	428f0564a0	minor: Move context code down in the function. This will make a subsequent diff a bit less noisy.	2020-11-16 08:59:29 -08:00
Steve Howell	1d1e45e9ec	digests: Use UserActivityInterval for user activity. Note that we are much more efficient about finding active users here: - we do one query per realm (instead of per-user) - we pass the cutoff date to the database - we get back just a list of distinct ids	2020-11-16 08:59:29 -08:00
Steve Howell	b52f56080e	performance: Just get user_ids to queue digest emails.	2020-11-16 08:59:29 -08:00
Steve Howell	e13e5d104d	refactor: Only require user_id for inactive_since(). This function is going away completely soon. It is querying everybody's entire UserActivity history instead of passing the cutoff date to the database!	2020-11-16 08:59:29 -08:00
Steve Howell	d0260392f7	digests: Get user objects from the database. The query counts increase here for somewhat contrived reasons. The tests before this commit reflected a successful trip to the UserProfile cache, but that's not actually realistic in practice.	2020-11-16 08:59:29 -08:00
Steve Howell	7737413cec	digest tests: Improve gather_new_streams test. We don't need to mock the dates here. We also explicitly clear out all streams first, and then we explicitly test with both the stream being current and the stream being old.	2020-11-16 08:59:28 -08:00
Steve Howell	9538edde06	digest tests: Simplify bots test. We can use the _enqueue_emails_for_realm helper to avoid all the Tuesday-related logic here. We also don't bother to create UserActivity records, since the bot gets excluded by virtue of its being a bot. (Also, the date ranges here were sketchy due to the time mocking.)	2020-11-16 08:59:28 -08:00
Steve Howell	0624833af6	digest tests: Improve Tuesday tests. If we're mocking time, we should do it consistently.	2020-11-16 08:59:28 -08:00
Steve Howell	2f4d7a6171	tests: Fix test_inactive_users_queued_for_digest. We can avoid all the date mocking now for all but a couple tests that exercise the is-it-Tuesday logic. And this test now correctly tests that we exclude recently active users. And this allows us to remove the other test.	2020-11-16 08:59:28 -08:00
Steve Howell	e49a482baf	email digests: Make transactions atomic.	2020-11-16 08:59:28 -08:00
Steve Howell	cf6bcfb84a	digest emails: Exclude users who had recent digests. This code protects us in case we ever need to re-run email digests twice in the same day.	2020-11-16 08:59:28 -08:00
Steve Howell	fb3d4c1618	digest tests: Avoid warnings about naive time.	2020-11-16 08:59:28 -08:00
Steve Howell	4271442fba	email digests: Write RealmAuditLog rows.	2020-11-16 08:59:28 -08:00
Mateusz Mandera	4f47f35cb4	auth: Handle the case of invalid subdomain at /fetch_api_key endpoint.	2020-11-13 16:43:17 -08:00
Anders Kaseorg	1275613812	requirements: Upgrade mypy to 0.790. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-12 15:44:30 -08:00
Anders Kaseorg	8ba95063d5	test_markdown: Construct FencedBlockPreprocessor with a real Markdown. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:28 -08:00
Anders Kaseorg	e7e1fde6ec	fenced_code: Use immutable type for codehilite_conf. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:28 -08:00
Anders Kaseorg	fbf8ce0305	markdown: Add types for extra Markdown members. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	b48bdc65b9	markdown: Fix AlertWordNotificationProcessor.run type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	9573f6dc00	markdown: Fix build_block_parser type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	4398eecd2b	markdown: Use immutable type for extension config. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	060036dfd5	markdown: Merge build_engine into Markdown constructor. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	08c64f5cfa	markdown: Fix imports for compatibility with typeshed stubs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	2a8a59f548	test_queue_worker: Simplify worker_queue_names computation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:46:04 -08:00
Anders Kaseorg	dc84e9696c	mypy: Fix types for redis. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:46:04 -08:00
Anders Kaseorg	3a8cf869db	python: Convert os.open(…, O_EXCL) to open(…, "x"). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-09 14:31:01 -08:00
Puneeth Chaganti	358f1f9ba7	webhooks/sentry: Support integration configured as webhook. Sentry allows adding simple webhooks without going through the process of creating an Internal Integration in Sentry's Integration Platform[1] (which our docs recommend). The payload from sent from such a (simple) webhook integration is slightly different from the payload sent by an Internal Integration webhook. This commit tries to wrangle this payload into a form that is usable by our webhook handler to send a notification message. [1]: https://sentry.io/integration-platform/	2020-11-09 12:02:49 -08:00
Mateusz Mandera	47228f3a95	actions: Implement do_delete_user. To have a reasonable way of creating the dummy user without duplicating code, we need change create_user to have the optional force_id argument.	2020-11-09 11:58:02 -08:00
akshatdalton	806c1a0b8b	markdown: Fix flickering of embedded link inside Italic. This commit fixes a bug in marked.js which caused it to double-escape HTML when rendering messages of the form: [text](url). This fixes a bug introduced in `3bdc8bbaa5`, where an unnecessary escape() call was added for the <em> code path, likely just because it was adjacent to the others that needed it in the file. Fix this, and add tests to verify that things are still being escaped once after removing this extra escape. Fixes #14845.	2020-11-06 10:09:15 -08:00
Steve Howell	5da4332620	minor: Add order-by-id to digest message query. The order-by-id is now explicit, and I add comments to explain the select_related tables.	2020-11-06 10:05:46 -08:00
Steve Howell	936171d258	refactor: Extract DigestTopic class. This gets us away from a lot of dictionary soup.	2020-11-06 10:05:46 -08:00
Steve Howell	e8b6c56322	refactor: Simplify get_hot_topics(). The code we deleted here was no longer doing anything. Maybe the code was always dead, or maybe it was written during a time when topics_by_diversity and topics_by_length actually had different keys. But now it's clearly cruft. If we have 4 or more topics, then the code above it would already have populated the list with 4 elements, and the `if num_convos < 4` condition would evaluate to False. And if we had 3 or fewer topics, then we would have already put all possible topics into our result, and the `topics_by_diversity[num_convos:4]` slice would be empty. It's possible that we should just have a simple heuristic for topic hotness like `10*num_senders + messages`, so we don't have to maintain this fiddly function, and we can just do something like `topics_by_score[:4]`.	2020-11-06 10:05:46 -08:00
Steve Howell	c5dc9d386f	refactor: Use sets of stream_ids for email digests. I now use sets for stream_ids in more of the digest code. As part of this I replaced exclude_subscription_modified_streams with streams_recently_modified_for_user. It's easier for the caller to just ask for ids to delete from its callee than it is to pass in a set/list to mutate. The simpler boundary between the functions makes the tests easier to write--you can see the `filtered_streams` logic goes away in this diff. I also make the tests a bit more thorough by using combinations of Cordelia/Othello and Verona/Denmark to try to find multiple possible flaws. And I make the time intervals longer than 1s to avoid false negatives from slow CI boxes.	2020-11-05 17:42:43 -08:00
Steve Howell	88a57ed4ac	bulk digest: Get stream subscriptions in bulk. If we have multiple users, this reduces the amount of queries we need to do, because we get all subscriptions for all users in a single query to Subscription. For the single-user case, we are introducing an extra query hop, but the database is doing roughly the same work, because we are just breaking up this complex query into two hops: messages = select ... from message where recipient__type_id in ( select stream_id from subscription where ... ) Now it's more like: stream_ids = select stream_id from subscription where ... messages = select ... from message where recipient__type_id in stream_ids	2020-11-05 09:36:59 -08:00
Steve Howell	c83db37161	email digests: Introduce bulk methods for digest. Note that we are not changing anything semantically or algorithmically yet. The only overhead here for the single-user case is boxing and unboxing data into single-item dicts and lists. The interfaces for callers in the view and the queue processor remain the same for now.	2020-11-05 09:36:59 -08:00
Steve Howell	7c89e46731	minor: Clean up some code formatting.	2020-11-05 09:36:59 -08:00
Steve Howell	4bd02eea19	minor: Use user, not user_profile, in some digest code.	2020-11-05 09:36:59 -08:00
Steve Howell	0e2d02b0a2	digest tests: Count cache tries.	2020-11-05 09:36:59 -08:00
Steve Howell	127f4e1291	digest tests: Add more users to bulk digest test.	2020-11-05 09:36:59 -08:00
Steve Howell	89cb3fa841	digest tests: Localize mocks. We didn't need the enough-traffic mock. We also continue to prep for testing multiple users. I also finally remove a comment that is about to be addressed (and which inaccurately refers to huddles).	2020-11-05 09:36:59 -08:00
Steve Howell	1ec16dd1da	digest tests: Prep to test bulk digests. All this does, essentially, is put the logic we used to test for othello inside of a loop. We'll add more users in the next commit.	2020-11-05 09:36:59 -08:00
Steve Howell	e31326c823	refactor: Extract get_digest_context. This eliminates the union type and boolean parameter, and it makes it a bit easier to migrate to a bulk-get approach.	2020-11-05 09:36:59 -08:00
Steve Howell	217967f743	refactor: Extract get_hot_topics. This extraction will make a bit more sense when we start doing bulk operations on a realm to get digests, but even now, it encapsulates the slightly complex way we cherry-pick the top 4 topics for a user.	2020-11-05 09:36:59 -08:00
Steve Howell	5a6d6f81ff	refactor: Extract get_recent_topic_activity.	2020-11-05 09:36:59 -08:00
Steve Howell	f987b014b3	refactor: Rename conversation to topic. Not only is topic shorter, but the name makes it clear that we're not dealing with abstract conversations here--we are truly bucketing by topic.	2020-11-05 09:36:59 -08:00
Steve Howell	6ac3cd3534	refactor: Use list of topics, not tuples.	2020-11-05 09:36:59 -08:00
Steve Howell	878e938a89	minor: Rename conversation_diversity to conversation_senders.	2020-11-05 09:36:59 -08:00
Steve Howell	6dc8250e9a	mypy: Add TopicKey type for digests.	2020-11-05 09:36:59 -08:00
Steve Howell	96f6064b18	refactor: Move Messages query down the digest stack. This prep step is mostly for diff hygiene; the next commit will make the code a bit nicer. The original code here had the nice property that most (but not all) of the DB work happened up front in `handle_digest_email`, and none of the DB work was delegated to the callers. But I prefer the tradeoff of making the helpers a bit more cohesive--let them get the data they need. And we have query-count coverage in our tests, so there's no real danger of having helpers down in the stack insidiously doing a bunch of extra DB hops.	2020-11-05 09:36:59 -08:00
Anders Kaseorg	13c11ec5f3	openapi: Fix escaping in curl command generation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-05 09:36:31 -08:00
Steve Howell	c1f134a3a4	performance: Use ORM to fetch sender in render_markdown. In `709493cd75` (Feb 2017) I added code to render_markdown that re-fetched the sender of the message, to detect whether the message is a bot. It's better to just let the ORM fetch this. The message object should already have sender. The diff makes it look like we are saving round trips to the database, which is true in some cases. For the main message-send codepath, though, we are only saving a trip to memcached, since the middleware will have put our sender's user object into the cache. The test_message_send test calls internally to check_send_stream_message, so it was actually hitting the database in render_markdown (prior to my change).	2020-11-05 09:35:15 -08:00
Steve Howell	637f596751	tests: Fix queries_captured to clear cache up front. Before this change we were clearing the cache on every SQL usage. The code to do this was added in February 2017 in `6db4879f9c`. Now we clear the cache just one time, but before the action/request under test. Tests that want to count queries with a warm cache now specify keep_cache_warm=True. Those tests were particularly flawed before this change. In general, the old code both over-counted and under-counted queries. It under-counted SQL usage for requests that were able to pull some data out of a warm cache before they did any SQL. Typically this would have bypassed the initial query to get UserProfile, so you will see several off-by-one fixes. The old code over-counted SQL usage to the extent that it's a rather extreme assumption that during an action itself, the entries that you put into the cache will get thrown away. And that's essentially what the prior code simulated. Now, it's still bad if an action keeps hitting the cache for no reason, but it's not as bad as hitting the database. There doesn't appear to be any evidence of us doing something silly like fetching the same data from the cache in a loop, but there are opportunities to prevent second or third round trips to the cache for the same object, if we can re-structure the code so that the same caller doesn't have two callees get the same data. Note that for invites, we have some cache hits that are due to the nature of how we serialize data to our queue processor--we generally just serialize ids, and then re-fetch objects when we pop them off the queue.	2020-11-05 09:35:15 -08:00
Tim Abbott	eae14baa05	api: URL-quote password when testing authentication API. The passwords generated for our development environment / test suite include the `+` character, which needs to be quoted when encoded as an HTTP POST parameter. This is hopefully sufficient to fix the CI failures we've seen with the tests for POST /api/v1/fetch_api_key; I haven't reproduced the failure so am not completely sure.	2020-11-03 15:55:30 -08:00
YashRE42	967efc32d2	widgets: Remove tictactoe example widget. Steve asked me to remove this, since the tictactoe game was always intended as a proof of concept. Now that we have poll and todo widgets, the sample code for tictactoe has much less value. We replace the content and type in test_widgets.py to maintain coverage.	2020-11-03 14:46:39 -08:00
Aman Agrawal	87cdd8433d	home: Allow logged out user through home. We allow user to load webapp without log-in. This is only be enabled for developed purposes now. Production setups will see no changes.	2020-11-02 17:07:12 -08:00
shanukun	be39672026	api_docs: Document the /fetch-api-key endpoint. With tweaks by tabbott to document addition details. Fixes: #16408.	2020-11-02 16:45:42 -08:00
shanukun	da9d586254	openapi: Add parameter examples for fetch api key endpoints.	2020-11-02 16:45:42 -08:00
Anders Kaseorg	ac5cbf7693	Revert "markdown: Escape lang when echoing back custom non-pygments languages." This reverts commit `564b199fe6`, which was part of #16308. Escaping is either required or incorrect; it is never “defensive”. This escaping is incorrect. lxml already escapes attributes during serialization (any other behavior would be a serious bug), and additional escaping just results in double escaping. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-02 16:23:48 -08:00
akshatdalton	620e9cbf72	markdown: Fix merging of separate quotations. Initally, when writing two or more quotes, having a blank line in between them, merges those quotes. This created confusion especially in "quote and reply". This commit fixes such issues. Now two or more quotes having a blank line in between them, will not get merged. This change is correct both for usability and for improving our compatibility with CommonMark. Fixes #14379.	2020-10-30 15:21:15 -07:00
Mateusz Mandera	cbeeadab16	delete_realm: Register a post_delete Realm handler. By registering a post_delete handler to clear appropriate caches in a nicer way, we can get rid of the ugly flush-memcached call in the delete_realm command.	2020-10-30 11:43:03 -07:00
Alex Vandiver	bff503feb4	delete_realm: Add command to completely remove realms. This will need some tweaking in upcoming commits.	2020-10-30 11:42:40 -07:00
Anders Kaseorg	3c663e48db	url_encoding: Skip unnecessary encode before quote. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	df10b306a6	python: Remove force_bytes. We are generally good enough at types to know whether a value is str or bytes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	cc55393671	python: Open text files as text to skip decode operations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	18d0e4664c	python: Replace binascii with bytes.hex to skip some decode operations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	aaa7b766d8	python: Use universal_newlines to get str from subprocess. We can replace ‘universal_newlines’ with ‘text’ when we bump our minimum Python version to 3.7. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	9281dccae4	python: Serialize lxml elements directly to str. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	7c4f68d9cf	python: Skip unnecessary decode before BeautifulSoup parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	86e8d81c7f	python: Skip unnecessary decode before JSON parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Anders Kaseorg	1802a50cc9	python: Use requests.Response.text instead of decoding content. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-30 11:36:38 -07:00
Tim Abbott	067cd3a97a	docs: Remove incorrect references to chat.zulip.org. Most of these are Help Center links that should be pointing to the production Help Center.	2020-10-29 16:46:40 -07:00
Tim Abbott	3b9c726fc6	outgoing_webhook: Avoid logging a bytes string. This fixes the new assertLogs() tests failing in CI; we fixed the weird use of bytes in the test, but not in the runtime code.	2020-10-29 15:55:11 -07:00
sahil839	7106069d4d	migration: Add migration to remove default status of private streams. This commit adds migration which removes default status of exisitng default private streams, i.e. private stream exists but they are no longer default.	2020-10-29 15:47:34 -07:00
sahil839	b29d39195c	streams: Do not allow default streams to be private. We now do not allow to make a stream private which is already a default stream.	2020-10-29 15:47:32 -07:00
sahil839	557ca0802c	streams: Do not allow private streams to be set as default. We now do not allow to set a private stream as default.	2020-10-29 15:43:37 -07:00
m-e-l-u-h-a-n	cbfd6464a5	logging: replace mock.patch() for logging with assertLogs() This commit removes mock.patch with assertLogs(). * Adds return value to do_rest_call() in outgoing_webhook.py, to support asserting log output in test_outgoing_webhook_system.py. * Logs are not asserted in test_realm.py because it would require to users to be queried using users=User.objects.filter(realm=realm) and the order of resulting queryset varies for each run. * In test_decorators.py, replacement of mock.patch is not done because I'm not sure if it's worth the effort to replace it as it's a return value of a function. Tweaked by tabbott to set proper mypy types.	2020-10-29 15:37:45 -07:00
Hemanth V. Alluri	99cf37dc51	drafts: Make the ID of the draft a part of the draft dict. Then because the ID is now part of the draft dict, we can (and do) change the structure of the "drafts" parameter returned from `GET /drafts` from an object (mapping ID to data) to an array. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Hemanth V. Alluri	8d59fd2f45	tests/drafts: Simplify create_and_check_drafts_for_success. Sometimes we don't need to specify the expected_drafts field. So by removing it, we can reduce the clutter a bit. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Hemanth V. Alluri	e60925b3e8	drafts: Change "timestamp" from float to integer. Now the timestamp returned in a draft dict will always be an int. The endpoints will still accept either an int or a float. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-10-29 11:06:04 -07:00
Abhijeet Prasad Bodas	e98a8856c7	logging: Add logging in deferred_work queue processor. Adds logging statements in deferred_work queue consume.	2020-10-29 10:34:53 -07:00
m-e-l-u-h-a-n	be7a70e742	logging: Remove unnecessary mock.patch() for logging. Our test-backend validation confirms that we don't log anything to stdout in the tests, so the fact that CI passes with this removes shows there was nothing being logged.	2020-10-28 23:15:27 -07:00
Vishnu KS	fdea49742c	apps: Use GitHub API for generating the web app download link.	2020-10-28 23:04:14 -07:00
ryanreh99	dfa7ce5637	uploads: Support non-AWS S3-compatible server. Boto3 does not allow setting the endpoint url from the config file. Thus we create a django setting variable (`S3_ENDPOINT_URL`) which is passed to service clients and resources of `boto3.Session`. We also update the uploads-backend documentation and remove the config environment variable as now AWS supports the SIGv4 signature format by default. And the region name is passed as a parameter instead of creating a config file for just this value. Fixes #16246.	2020-10-28 21:59:07 -07:00
ryanreh99	1c370a975c	refactor: Access a bucket by calling `zerver.lib.uploads.get_bucket`.	2020-10-28 21:52:08 -07:00
Alex Vandiver	f4eae83542	export: Only include real, active humans in the displayed count.	2020-10-28 18:31:06 -07:00
Alex Vandiver	3037e22f61	tornado: Never use proxies when talking from Django to Tornado. The `no_proxy` parameter does not work to remove proxying[1]; in this case, since all requests with this adapter are to the internal Tornado process, explicitly pass in an empty set of proxies to disable proxying. [1] https://github.com/psf/requests/issues/4600	2020-10-28 12:13:04 -07:00
Alex Vandiver	97745688ca	docs: Link to the new doc home of the email gateway.	2020-10-28 12:13:04 -07:00
Alex Vandiver	2b0bbbb882	tools: Rename postgres to postgresql in tool names.	2020-10-28 11:57:02 -07:00
Alex Vandiver	5eb8064a1a	install: Rename postgres options to postgresql.	2020-10-28 11:55:32 -07:00
Alex Vandiver	1f7132f50d	docs: Standardize on PostgreSQL, not Postgres.	2020-10-28 11:55:16 -07:00
Alex Vandiver	142de0f670	queue: Increase default timeout to 30s, from 10s. Not all of the workers are known to be safe to interrupt; they might leave inconsistent state. As such, terminating them with timeouts should currently only be a last-resort against stalled queues, not a regular occurrence.	2020-10-27 16:39:31 -07:00
Alex Vandiver	c73dd194f0	sentry: Group all worker timeouts together, by queue. Since the exception can be triggered at arbitrary places in the stack based on whenever the alarm happens to fire, they do not often group together. Explicitly group them together, grouped only by which queue the work is in.	2020-10-27 16:39:31 -07:00
Anders Kaseorg	1352f2f233	python: Replace manual quote_plus usage with urlencode. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	4e9d587535	python: Pass query parameters as a dict when making GET requests. This provides automatic URL-encoding. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	41f509170b	users: Canonicalize the timezone identifier. While working on shifting toward native browser time zone APIs (#16451), it was found that all but very recent Chrome and Node versions reject certain legacy timezone aliases like US/Pacific (https://crbug.com/364374). For now, we only canonicalize the timezone property returned in user objects and not the timezone setting itself. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Anders Kaseorg	a8b1691e97	timezone: Convert get_common_timezones cache to lru_cache. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Anders Kaseorg	0b288f92c9	timezone: Remove get_timezone wrapper. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Anders Kaseorg	0134112b51	timezone: Remove get_all_timezones wrapper. Both callers want a set, and pytz already provides all_timezones_set. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:42:54 -07:00
Chris Bobbe	500f92169e	openapi: Fix the "the the" instances.	2020-10-26 12:50:01 -07:00
Chris Bobbe	dd43d4feb8	openapi: Note that a user's `avatar_url` may be missing. Like it already says in the detail about `user_avatar_url_field_optional`, but on the field itself [1]. [1] https://github.com/zulip/zulip-mobile/pull/4230#discussion_r493109645	2020-10-26 12:50:01 -07:00
Tim Abbott	6d7cd351a3	events: Optimize creating streams for new users. During the new user creation code path, there can be no existing active clients for the user being created, so we can skip the code to send events to that user's clients. The tests here reflect that we need to send fewer events, and do fewer queries that would have been spent computing data for these.. Fixes #16503, combined with the long series of recent changes by Steve Howell to fix super-linear behavior in this code path.	2020-10-26 12:47:15 -07:00
Steve Howell	88a7a1b002	events: Optimize peer_add/peer_remove for public streams. We no bulk up peer_add/peer_remove events by user if the same user has subscribed to multiple streams (and just that single user). This mostly optimizes the new-user codepath, but the algorithm is a bit more general in nature.	2020-10-26 12:33:28 -07:00
Alex Vandiver	7cf737988d	queue: Be more explicit about test/real queue division.	2020-10-26 12:32:47 -07:00
Anders Kaseorg	31d0141a30	python: Close opened files. Fixes various instances of ‘ResourceWarning: unclosed file’ with python -Wd. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-26 12:31:30 -07:00
Steve Howell	3ad1335a97	tests: Clear ContentType cache for user test. This keeps the number of queries predictable.	2020-10-26 07:18:08 -04:00
Steve Howell	5ef01b3ad8	tests: Fix test_create_user_with_multiple_streams. This test was flaky due to some date-related non-determinism. I make all the Message objects current to make add_new_user_history reliably try to bulk-update UserMessage rows to read.	2020-10-26 07:18:08 -04:00
Anders Kaseorg	96dee7e40b	decorator: Unfork redirect_to_login. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-25 14:58:53 -07:00
sahil839	a96811ab58	management: Replace knight command with change_user_role command. We replace knight command with change_user_role command which allows us to change role of a user to owner, admins, member and guest. We can also give/revoke api_super_user permission using this command. Tweaked by tabbott to improve the logging output and update documentation. Fixes #16586.	2020-10-25 14:55:08 -07:00
Harsh Srivastava	9b31df009b	openapi: Fix excessively large test_events failure output. Because of the very large `oneOf` clause of the formats of events possible in Zulip's `GET /events` system, we had issues with `test-backend` failures for missing documentation for a new event format being like 1000 lines of output, which was very much unhelpful. Fix this by limiting the output use only the oneOf variants that are broadly similar to the actual payload received. Fixes #16023.	2020-10-23 17:00:17 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	e513b75e86	markdown: Remove handler for old bug with incompatible twitter library. See commit `8b002040e0` and #86. The development environment bug that necessitated this handler has long been irrelevant. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:30:26 -07:00
Anders Kaseorg	b9fd49a2c6	mypy: Correct mistaken *args type annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:29:13 -07:00
Anders Kaseorg	d295da676b	test_message_fetch: Clean up obsolete PGroonga bug workaround. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-22 23:27:23 -07:00
Anders Kaseorg	831d086110	i18n: Fix get_language_translation_data for zh_TW. Fixes #16600. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-22 16:43:02 -07:00
sahil839	571bb62e3d	events: Update subscriber list on peer_add for unsubscribed streams. We update the subscriber list on peer_add event for unsubscribed streams as well.	2020-10-22 15:12:32 -07:00
sahil839	733d26aef2	events: Update subscriber list on peer_remove for never subscribed stream. We now update the subscriber list on peer_remove event for never subscribed streams also.	2020-10-22 15:12:32 -07:00
sahil839	af9b153ee3	events: Update subscriber list on peer_remove for unsubscribed stream. We update the subscriber list on peer_remove event for unsubscribed streams also.	2020-10-22 15:12:32 -07:00
sahil839	709edd29d4	test_events: Fix comment in do_test_subscribe_events. The comment still pointed to 'vacate' event flow, but we have removed the vacate event in `a9356508ca`. This commit fixes the comment to depict the correct purpose of below lines, i.e. to test the remove event flow.	2020-10-22 15:12:32 -07:00
sahil839	e578742b02	test_events: Remove 'realm_user' from event_types in subscription test. We were including 'realm_user' in event_types along with 'subscription', but we don't send event of type 'realm_user' when subscribing to a new stream. This was added in `1c332f5d6a`. This commit removes 'realm_user' from event_types.	2020-10-22 15:12:32 -07:00
sahil839	d0f5537fb2	actions: Modify check_message for handling wildcard_mention_policy setting. This commit adds enforcement for sending messages containing wildcard mentions according to wildcard_mention_policy.	2020-10-22 14:46:32 -07:00
sahil839	25f32d461e	tests: Add tests for all the values of wildcard_mention_policy.	2020-10-22 12:08:22 -07:00
Mateusz Mandera	48f80fcb0a	auth: Expect name in request params in Apple auth. The name used to be included in the id_token, but this seems to have been changed by Apple and now it's sent in the `user` request param. https://github.com/python-social-auth/social-core/pull/483 is the upstream PR for this - but upstream is currently unmaintained, so we have to monkey patch. We also alter the tests to reflect this situation. Tests no longer put the name in the id_token, but rather in the `user` request param in the browser flow, just like it happens in reality. An adaptation has to be made in the native flow - since the name won't be included by Apple in the id_token anymore, the app, when POSTing to the /complete/apple/ endpoint, can (and should for better user experience) add the `user` param formatted as json of {"email": "hamlet@zulip.com", "name": {"firstName": "Full", "lastName": "Name"}} dict. This is also reflected by the change in the native flow tests.	2020-10-22 12:07:46 -07:00
Steve Howell	7ff3859136	subscriber events: Change schema for peer_add/peer_remove. We now can send an implied matrix of user/stream tuples for peer_add and peer_remove events. The client code basically does this: for stream_id in event['stream_ids']: for user_id in event['user_ids']: update_sub(stream_id, user_id) We used to send individual events, which gets real expensive when you are creating new streams. For the case of copy-to-stream case, we should see events go from U to 1, where U is the number of users added. Note that we don't yet fully optimize the potential of this schema. For adding a new user with lots of default streams, we still send S peer_add events. And if you subscribe a bunch of users to a bunch of private streams, we only go from U * S to S; we can't optimize it down to one event easily.	2020-10-22 11:19:53 -07:00
Anders Kaseorg	efa8dd3a47	compilemessages: Sort language list with Unicode Collation Algorithm. Right now the list of languages in Display settings → Default language is sorted in an unintuitive order due to the varying case conventions: British English Chinese (Taiwan) Deutsch English Hindi Indonesian (Indonesia) Lietuviškai Magyar Malayalam Nederlands Português Română Tiếng Việt Türkçe català español français galego italiano polski suomi svenska česky Русский Українська български српски فارسی தமிழ் 日本語简体中文繁體中文 한국어 Fix the sort to use the locale-independent Unicode Collation Algorithm: British English català česky Chinese (Taiwan) Deutsch English español français galego Hindi Indonesian (Indonesia) italiano Lietuviškai Magyar Malayalam Nederlands polski Português Română suomi svenska Tiếng Việt Türkçe български Русский српски Українська فارسی தமிழ் 한국어 日本語简体中文繁體中文 Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-21 15:22:18 -07:00
Steve Howell	85ed6f332a	performance: Avoid Recipient lookup for stream messages. All the fields of a stream's recipient object can be inferred from the Stream, so we just make a local object. Django will create a Message object without checking that the child Recipient object has been saved. If that behavior changes in some upgrade, we should see some pretty obvious symptom, including query counts changing. Tweaked by tabbott to add a longer explanatory comment, and delete a useless old comment.	2020-10-20 11:47:23 -07:00
Steve Howell	7bbcc2ac96	refactor: Compute peers for public streams later. This saves us a query for edge cases like when you try to unsubscribe from a public stream that you have already unsubscribed from. But this is mostly to prep for upcoming optimizations.	2020-10-20 11:31:22 -07:00
Steve Howell	363e5d31a6	refactor: Split out public/private logic for peer events. This doesn't change anything yet, but the goal is to eventually optimize events for the case where one user (typically a new user) gets subscribed to multiple public streams.	2020-10-20 11:31:22 -07:00
Steve Howell	3961e69381	refactor: Extract send_peer_subscriber_events. We now use the same basic code to send peer_add and peer_remove events.	2020-10-20 11:31:22 -07:00
Anders Kaseorg	254b904965	markdown: Migrate off deprecated extension registration interface. Fixes #15205. https://python-markdown.github.io/change_log/release-3.0/#homegrown-ordereddict-has-been-replaced-with-a-purpose-built-registry https://python-markdown.github.io/change_log/release-3.0/#md_globals-keyword-deprecated-from-extension-api The priority numbers are arbitrarily chosen to preserve the existing order. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 18:31:12 -07:00
akshatdalton	287c4ed2bb	markdown: Fix Youtube and Vimeo preview overriding markdown link titles bug. Initially markdown titles were overridden by Youtube and Vimeo preview titles. But now it will check if any markdown title is present to replace Youtube or Vimeo preview titles, if preview of linked websites is enabled. Fixes #16100	2020-10-19 12:06:13 -07:00
Anders Kaseorg	d81a93cdf3	requirements: Upgrade markdown to 3.3.1. Upstream has slightly changed the whitespace around stashes. Take this opportunity to clean up the extra blank lines we were outputting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 11:54:14 -07:00
Anders Kaseorg	2ab15149bc	forms: Convert percent formatting to "".format. Caught by upgrading Semgrep. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 11:54:14 -07:00
Anders Kaseorg	f461a64a6b	i18n: Fix some ineffective calls to ugettext at top level. Translation has no effect when we don’t yet know what language we’re translating for. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-18 14:31:15 -07:00
Anders Kaseorg	bba43f35ca	i18n: Be deliberate about distinguishing ugettext and ugettext_lazy. The early str conversions in zerver.models were defeating the point of ugettext_lazy. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-18 14:31:15 -07:00
Steve Howell	e6f6f8d45f	refactor: Avoid "stream_id" on sub. There was no need to put "stream_id" on the sub dictionary here. It's kinda annoying to introduce the little helper here, but I feel that's better than crufting up the sub data structure.	2020-10-18 14:27:31 -07:00
Steve Howell	628a826aa2	minor: Move code and add comments about three lists.	2020-10-18 14:27:31 -07:00
Steve Howell	ffee129a35	refactor: Clean up is_web_public flag. The is_web_public flag is already in Stream.API_FIELDS, so there is no reason for all this complicated logic. There's no reason to hack it on to the subscription object.	2020-10-18 14:27:31 -07:00
Steve Howell	4dce34ab8b	refactor: Simplify call to bulk_get_subscriber_user_ids. The way we were computing the dictionary was very convoluted--all we need is a set of subscribed user ids.	2020-10-18 14:27:31 -07:00
Steve Howell	b58152abda	refactor: Introduce all_streams_map. We replace all_streams_id with a map. We also use it to populate never_subscribed_streams. And all_streams_map is a superset of stream_hash, which we will soon kill off as well.	2020-10-18 14:27:31 -07:00
Steve Howell	78384ebf1b	minor: Remove confusing parens. Apparently I put these parens in the code as part of `73c30774cb` during 2017. It looks like I extracted is_public during the middle of my change and forgot to remove the unnecessary parens. (The code was correct, but it makes it look like a tuple if you're skimming it too quickly.)	2020-10-18 14:27:31 -07:00
Steve Howell	d60dd94168	refactor: Extract funcs from gather_subscriptions_helper. This is a pure code move, apart from a little bit of quote cleanup and renames: user_profile -> user stream_dict -> result	2020-10-18 14:27:31 -07:00
Steve Howell	79fcf78143	refactor: Exclude "active" from API_FIELDS. We just need to make sure the relevant queries get it for the triage process.	2020-10-18 14:27:31 -07:00
Steve Howell	c5769d31f2	minor: Move code for web_public_stream_ids.	2020-10-18 14:27:31 -07:00
Steve Howell	0ca07ffd3c	peformance: Eliminate StreamRecipientMap. That class is an artifact of when Stream didn't have recipient_id. Now it's simpler to deal with stream subscriptions. We also save a query during page load (and other places where we get subscriber info).	2020-10-18 14:27:31 -07:00
Steve Howell	1951d75796	performance: Avoid select_related("realm"). We also move this query up in the function for some future refactorings.	2020-10-18 14:27:31 -07:00
Steve Howell	2f8ba383ef	tests: Test overhead for creating new users.	2020-10-18 14:27:31 -07:00
Mateusz Mandera	716df658fa	queue_processors: Don't run test queues with run-dev.py.	2020-10-18 14:07:31 -07:00
Steve Howell	57efe9d81a	performance: Streamline list_to_streams. We take advantage of stream.recipient to simplify the query's where clause and avoid the need for select_related("recipient").	2020-10-16 12:58:11 -07:00
Steve Howell	e1bcf6124f	refactor: Remove recipient from access_stream_by_name.	2020-10-16 12:58:11 -07:00
Steve Howell	a51b483f1a	performance: Remove recipient from access_stream_by_id. The Recipient table is now kind of useless for stream-related operations, since we have recipient_id on Stream now.	2020-10-16 12:58:11 -07:00
Steve Howell	31622feb87	refactor: Only return sub from access_stream_common. Let the callers access stream.recipient as needed. It costs the same, and some of the callers can actually stop caring about the actual Recipient object.	2020-10-16 12:58:11 -07:00
Steve Howell	bfd6e2b1fd	refactor: Use recipient_id to get topic history.	2020-10-16 12:58:11 -07:00
Steve Howell	3685fcc701	refactor: Remove recipient arg for do_mute_topic.	2020-10-16 12:58:11 -07:00
Steve Howell	65dbee4837	minor: Ask for recipient_id, not recipient.	2020-10-16 12:58:11 -07:00
Steve Howell	378062cc83	performance: Avoid call to access_stream_by_id. We already trust ids that are put on our queue for deferred work. For example, see the code for "mark_stream_messages_as_read_for_everyone" We now pass stream_recipient_id when we queue up work for do_mark_stream_messages_as_read. This generally saves about 3 queries per user when we unsubscribe them from a stream.	2020-10-16 12:58:11 -07:00
Steve Howell	2256d72015	minor: Add comment to subscriber test.	2020-10-16 12:58:11 -07:00
Steve Howell	31eb97ddde	performance: Fix do_mark_stream_messages_as_read. This function no longer asks for data that it doesn't need.	2020-10-16 12:58:11 -07:00
Steve Howell	6d1f9de7d3	performance: Use SubInfo when removing subscribers. We get two speedups: * The query to get existing subscribers only gets the two fields we need. We no longer need all the overhead of user_profile and recipient data being returned in the query. * We avoid Django making extra hops to the database to get user info.	2020-10-16 12:58:11 -07:00
Steve Howell	73982f6cc9	refactor: Move SubInfo to stream_subscription.py.	2020-10-16 12:58:11 -07:00
Tim Abbott	caa939d2d5	actions: Use transaction.atomic properly when removing subscriptions. Previously, the transaction.atomic() was not properly scoped to ensure that RealmAuditLog entries were created in the same transaction, making it possible for state changes to not be properly recorded in RealmAuditLog.	2020-10-15 15:12:05 -07:00
Steve Howell	0b91526f28	events: Remove "occupied" semantics for "streams". When apps like mobile register for "streams", we will now just use active streams as our baseline, rather than "occupied" streams. This means we will send a stream that is active, even if it happens to have zero occupants. It's actually pretty rare that a stream has zero occupants, and it's not exactly clear that we want to exclude a non-occupied but otherwise active stream from our list of streams. It also happens to be fairly expensive to compute whether a stream is occupied. This change only affects API clients (including possibly our mobile app). The main webapp never used the data from this codepath.	2020-10-15 15:12:01 -07:00
Steve Howell	b4346d0276	performance: Extract subscribers/peers in bulk. We replace get_peer_user_ids_for_stream_change with two bulk functions to get peers and/or subscribers. Note that we have three codepaths that care about peers: subscribing existing users: we need to tell peers about new subscribers we need to tell subscribed user about old subscribers unsubscribing existing users: we only need to tell peers who unsubscribed subscribing new user: we only need to tell peers about the new user (right now we generate send_event calls to tell the new user about existing subscribers, but this is a waste of effort that we will fix soon) The two bulk functions are this: bulk_get_subscriber_peer_info bulk_get_peers They have some overlap in the implementation, but there are some nuanced differences that are described in the comments. Looking up peers/subscribers in bulk leads to some nice optimizations. We will save some memchached traffic if you are subscribing to multiple public streams. We will save a query in the remove-subscriber case if you are only dealing with private streams.	2020-10-15 15:12:01 -07:00
Steve Howell	94e41c71f9	refactor: Use set of ids for altered users.	2020-10-15 15:12:01 -07:00
Steve Howell	b894597fa3	refactor: Use sets of stream_ids for helper args.	2020-10-15 15:12:01 -07:00
Steve Howell	3889554977	refactor: Extract send_peer_remove_events.	2020-10-15 15:12:01 -07:00
Steve Howell	c73f84f275	tests: Improve tests for unsubscribing multiple users. Note that the tests now reflect that we have O(N) behavior for multiple users.	2020-10-15 15:12:01 -07:00
Steve Howell	f86823f82f	tests: Add cache_tries_captured helper.	2020-10-15 15:12:01 -07:00
Steve Howell	ce70d08cbf	test_helpers: Use mock.patch.multiple.	2020-10-15 15:12:01 -07:00
Tim Abbott	bf66e9c4ab	actions: Add transaction.atomic to bulk_add_subs_to_db_with_logging. This will ensure that we always fully execute the database part of modifying subscription objects. In particular, this should prevent invariant failures like #16347 where Subscription objects were created without corresponding RealmAuditLog entries. Fixes #16347.	2020-10-14 11:06:00 -07:00
Steve Howell	5728149e94	performance: Streamline query to add subscribers. We don't need the select_related('user_profile') optimization any more, because we just keep track of user info in our own data structures. In this codepath we are never actually modifying users; we just occasionally need their ids or emails. This can be a pretty substantive improvement if you are adding a bunch of users to a stream who each have a bunch of their own subscriptions. We could also limit the number of full rows in this query by adding an extra hop to the DB just to get colors (using values_list), and then only get full sub info for the streams that we're adding, rather than getting every single subscription, in full, for each user. Apart from finding what colors the user has already used, the only other reason we need all the columns in Subscription here is to handle streams that need to be reactivated. Otherwise we could do only("id", "active", "recipient_id", "user_profile_id") or similar. Fortunately, Subscription isn't an overly wide table; it's mostly bool fields. But by far the biggest thing to avoid is bringing in all the extra user_profile data. We have pretty good coverage on query counts here, so I think this fix is pretty low risk.	2020-10-14 11:03:07 -07:00
Steve Howell	116a441bc5	refactor: Introduce SubInfo class. This class removes a lot of the annoying tuples we were passing around. Also, by including the user everywhere, which is easily available to us when we make instances of SubInfo, it sets the stage to remove select_related('user_profile').	2020-10-14 10:53:10 -07:00
Steve Howell	febef45e38	minor: Add comments to do_get_streams.	2020-10-14 10:53:10 -07:00
Steve Howell	a9356508ca	events: Stop sending occupy/vacate events. We used to send occupy/vacate events when either the first person entered a stream or the last person exited. It appears that our two main apps have never looked at these events. Instead, it's generally the case that clients handle events related to stream creation/deactivation and subscribe/unsubscribe. Note that we removed the apply_events code related to these events. This doesn't affect the webapp, because the webapp doesn't care about the "streams" field in do_events_register. There is a theoretical situation where a third party client could be the victim of a race where the "streams" data includes a stream where the last subscriber has left. I suspect in most of those situations it will be harmless, or possibly even helpful to the extent that they'll learn about streams that are in a "quasi" state where they're activated but not occupied. We could try to patch apply_event to detect when subscriptions get added or removed. Or we could just make the "streams" piece of do_events_register not care about occupy/vacate semantics. I favor the latter, since it might actually be what users what, and it will also simplify the code and improve performance.	2020-10-14 10:53:10 -07:00
Steve Howell	1bcb8d8ee8	performance: Avoid computing page_params.streams in webapp. The query to get "occupied" streams has been expensive in the past. I'm not sure how much any recent attempts to optimize that query have mitigated the issue, but since we clearly aren't sending this data, there is no reason to compute it.	2020-10-14 10:53:10 -07:00

... 2 3 4 5 6 ...

12978 Commits