zulip

Commit Graph

Author	SHA1	Message	Date
Steve Howell	811426b345	Extract send_stream_creation_events_for_private_streams. We can probably avoid passing in users here.	2020-10-12 16:40:37 -07:00
Steve Howell	1cfaef0d1a	refactor: Simplify pick_color logic. This removes the need to jankily mutate the active flag in the caller, and we don't need to mutate our subs_by_user either.	2020-10-12 16:40:37 -07:00
Steve Howell	13569ff97a	refactor: Eliminate new_subs. We now just process new subs for a user immediately within the loop.	2020-10-12 16:40:37 -07:00
Steve Howell	8c70fbde78	refactor: Use subs_to_add in return value. The subs_to_add is directly related to a var called new_subs, which I hope to eliminate soon.	2020-10-12 16:40:37 -07:00
Steve Howell	1afca3d430	minor: Extract local for stream.	2020-10-12 16:40:37 -07:00
Steve Howell	84aa1389d8	Extract bulk_add_subs_to_db_with_logging. This is a trivial code extraction.	2020-10-12 16:40:37 -07:00
Steve Howell	3ff9ce78ea	refactor: Extract send_peer_add_events.	2020-10-12 16:40:37 -07:00
Alex Vandiver	f3ba227614	create_user: Strip whitespace from initial password file. Fixes #12144.	2020-10-11 16:29:00 -07:00
Cody Piersall	5dab6e9d31	emoji-upload: Fix transparency issues on GIF emoji upload. This preserves the alpha layer on GIF images that need to be resized before being uploaded. Two important changes occur here: 1. The new frame is a copy of the original image, which preserves the GIF info. 2. The disposal method of the original GIF is preserved. This essentially determines what state each frame of the GIF starts from when it is drawn; see PIL's docs: https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html#saving for more info. This resolves some but not all of the test cases in #16370.	2020-10-11 16:23:07 -07:00
Anders Kaseorg	b7a94be152	python: Catch BaseException when we need to clean something up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:16:16 -07:00
Anders Kaseorg	7f69c1d3d5	python: Catch specific exceptions from requests. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	17ac17286c	python: Catch specific exceptions from subprocess. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	aabef3d9be	python: Catch specific exceptions from orjson. Followup to #16120. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	234f7245cf	export_usermessage_batch: Use os.rename. This avoids an extra stat call to check whether the target is a directory. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	83eca256a4	compilemessages: Use polib for get_name_from_po_file. This also corrects the name of zh_TW from “Chinese” to “Chinese (Taiwan)”. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	1346c5397a	zephyr: Use correct shell quoting for ssh. ssh always runs its command through a shell (after naïvely joining multiple arguments with spaces), so it needs an extra level of shell quoting. This should have no effect because we already validated user with a regex, but it’s better for escaping to be locally correct in case the context changes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	82593338ba	report: Show Git commit in a way that works for merges. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:35 -07:00
Anders Kaseorg	c9fec8f021	deliver_scheduled_messages: Don’t do_send_messages inside a transaction. do_send_messages has side effects outside the database and may not work reliably if its database effects are reordered by being inside a transaction. This also fixes a bug where we were doing the update incorrectly on the Message table. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:09:22 -07:00
Alex Vandiver	c2132a4f9c	queue: Drop register_json_consumer / json_drain_queue interface. Now that all callsites use the same interface, drop the now-unused ones, and their tests.	2020-10-11 14:19:42 -07:00
Alex Vandiver	5477b9d9a1	queue: Switch tests to start_json_consumer interface.	2020-10-11 14:19:42 -07:00
Alex Vandiver	179c387409	tornado: Switch to start_json_consumer interface.	2020-10-11 14:19:42 -07:00
Alex Vandiver	f0b23b0752	queue: Switch non-batch consumer to also use start_json_consumer. This has no effect on consumption rate, but unifies the codepaths. Before: ``` $ ./manage.py queue_rate --count 50000 Purging queue... Enqueue rate: 11187 / sec Dequeue rate: 4158 / sec ``` After: ``` $ ./manage.py queue_rate --count 50000 Purging queue... Enqueue rate: 11010 / sec Dequeue rate: 4113 / sec ```	2020-10-11 14:19:42 -07:00
Alex Vandiver	45c9c3cc30	queue: Monitor user_activity queue, now that it has a consumer. Since this was using repead individual get() calls previously, it could not be monitored for having a consumer. Add it in, by marking it of queue type "consumer" (the default), and adding Nagios lines for it. Also adjust missedmessage_emails to be monitored; it stopped using LoopQueueProcessingWorker in `5cec566cb9`, but was never added back into the set of monitored consumers.	2020-10-11 14:19:42 -07:00
Alex Vandiver	f9358d5330	queue: Switch batch interface to use the channel.consume iterator. This low-level interface allows consuming from a queue with timeouts. This can be used to either consume in batches (with an upper timeout), or one-at-a-time. This is notably more performant than calling `.get()` repeatedly (what json_drain_queue does under the hood), which is "highly discouraged as it is very inefficient"[1]. Before this change: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11158 / sec Dequeue rate: 3075 / sec ``` After: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11511 / sec Dequeue rate: 19938 / sec ``` [1] https://www.rabbitmq.com/consumers.html#fetching	2020-10-11 14:19:40 -07:00
Alex Vandiver	571f8b8664	queue: Use low-level queue_purge to empty at the end of tests. This is O(1) at the RabbitMQ API level, and doesn't rely on the code under test to function correctly during test cleanup.	2020-10-09 20:43:49 -07:00
Alex Vandiver	ac0ba21c2c	tests: Stop reusing a variable name. `loopworker_sleep_mock` is a file-level variable used to mock out the sleep() call in LoopQueueProcessingWorker; don't reuse the variable name for something else.	2020-10-09 20:42:20 -07:00
Alex Vandiver	754638f673	tests: Refactor test_queue_worker to separate queues.	2020-10-09 20:42:12 -07:00
Alex Vandiver	2547bdbf4a	queue: Rename consume_wrapper to a better name.	2020-10-09 20:40:51 -07:00
Alex Vandiver	d5a6b0f99a	queue: Rename queue_size, and update for all local queues. Despite its name, the `queue_size` method does not return the number of items in the queue; it returns the number of items that the local consumer has delivered but unprocessed. These are often, but not always, the same. RabbitMQ's queues maintain the queue of unacknowledged messages; when a consumer connects, it sends to the consumer some number of messages to handle, known as the "prefetch." This is a performance optimization, to ensure the consumer code does not need to wait for a network round-trip before having new data to consume. The default prefetch is 0, which means that RabbitMQ immediately dumps all outstanding messages to the consumer, which slowly processes and acknowledges them. If a second consumer were to connect to the same queue, they would receive no messages to process, as the first consumer has already been allocated them. If the first consumer disconnects or crashes, all prior events sent to it are then made available for other consumers on the queue. The consumer does not know the total size of the queue -- merely how many messages it has been handed. No change is made to the prefetch here; however, future changes may wish to limit the prefetch, either for memory-saving, or to allow multiple consumers to work the same queue. Rename the method to make clear that it only contains information about the local queue in the consumer, not the full RabbitMQ queue. Also include the waiting message count, which is used by the `consume()` iterator for similar purpose to the pending events list.	2020-10-09 20:40:39 -07:00
Alex Vandiver	a1ce1aca3b	queue: Update comment to be more accurate about import errors.	2020-10-09 20:40:32 -07:00
Alex Vandiver	2d71ca1fb8	email: Remove unused `log_digest_event` function. Its last callsite was removed in `e46cbaffa2`. Also ref #6786.	2020-10-08 20:35:53 -07:00
sahil839	4e8f5b5b31	streams: Change access_stream_for_delete_or_update to also return sub. We modify access_stream_for_delete_or_update function to return Subscription object also along with stream. This change will be helpful in avoiding an extra query to get subscription object in code for updating subscription role.	2020-10-08 17:07:30 -07:00
Aman Agrawal	8b419c93e4	message_send: Fix old guests being treated as full members. For streams in which only full members are allowed to post, we block guest users from posting there. Guests users were blocked from posting to admin only streams already. So now, guest users can only post to STREAM_POST_POLICY_EVERYONE streams. This is not a new feature but a bugfix which should have happened when implementing full member stream policy / guest users.	2020-10-08 11:30:11 -07:00
Anders Kaseorg	e1a3aa9c1a	deliver_scheduled_messages: Exit cleanly on Ctrl+C. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 16:15:19 -07:00
Anders Kaseorg	363374346c	management: Use signal.pause to sleep forever. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 16:15:19 -07:00
Anders Kaseorg	c7cbdba257	eslint: Fix unicorn/{new-for-builtins,throw-new-error,error-message}. https://github.com/sindresorhus/eslint-plugin-unicorn/blob/master/docs/rules/new-for-builtins.md https://github.com/sindresorhus/eslint-plugin-unicorn/blob/master/docs/rules/throw-new-error.md https://github.com/sindresorhus/eslint-plugin-unicorn/blob/master/docs/rules/error-message.md Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 16:00:33 -07:00
Anders Kaseorg	9bfbb29763	queue_processors: Use try…finally to prevent leaking an alarm. Otherwise, if consume_func raised an exception for any reason other than the alarm being fired, the still-pending alarm would have fired later at some arbitrary point in the calling code. We need two try…finally blocks in case the signal arrives just before signal.alarm(0). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 15:37:46 -07:00
akshatdalton	52c411df8a	emoji: Add padding around the gif on GIF emoji upload. Replaced ImageOps.fit by ImageOps.pad, in zerver/lib/upload.py, which returns a sized and padded version of the image, expanded to fill the requested aspect ratio and size. Fixes part of #16370.	2020-10-06 17:28:02 -07:00
Alex Vandiver	d47637fa40	queue: Set a max consume timeout with SIGALRM. SIGALRM is the simplest way to set a specific maximum duration that queue workers can take to handle a specific message. This only works in non-threaded environments, however, as signal handlers are per-process, not per-thread. The MAX_CONSUME_SECONDS is set quite high, at 10s -- the longest average worker consume time is embed_links, which hovers near 1s. Since just knowing the recent mean does not give much information[1], it is difficult to know how much variance is expected. As such, we set the threshold to be such that only events which are significant outliers will be timed out. This can be tuned downwards as more statistics are gathered on the runtime of the workers. The exception to this is DeferredWorker, which deals with quite-long requests, and thus has no enforceable SLO. [1] https://www.autodesk.com/research/publications/same-stats-different-graphs	2020-10-06 17:26:14 -07:00
Alex Vandiver	baf882a133	queue: Only ACK drain_queue once it has completed work on the list. Currently, drain_queue and json_drain_queue ack every message as it is pulled off of the queue, until the queue is empty. This means that if the consumer crashes between pulling a batch of messages off the queue, and actually processing them, those messages will be permanently lost. Sending an ACK on every message also results in a significant amount lot of traffic to rabbitmq, with notable performance implications. Send a singular ACK after the processing has completed, by making `drain_queue` into a contextmanager. Additionally, use the `multiple` flag to ACK all of the messages at once -- or explicitly NACK the messages if processing failed. Sending a NACK will re-queue them at the front of the queue. Performance of a no-op dequeue before this change: ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10847 / sec Dequeue rate: 2479 / sec ``` Performance of a no-op dequeue after this change (a 25% increase): ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10752 / sec Dequeue rate: 3079 / sec ```	2020-10-06 17:26:14 -07:00
Alex Vandiver	df86a564dc	queue: Let stop() work with LoopQueueProcessingWorker.	2020-10-06 17:26:14 -07:00
Alex Vandiver	8cf37a0d4b	queue: Add a tool to profile no-op enqueue and dequeue actions.	2020-10-06 17:26:14 -07:00
Mateusz Mandera	6e83bcc0d5	custom_profile_fields: Don't allow leading/trailing whitespaces. Allowing such whitespaces can lead to hard to debug issues e.g. with ldap sync.	2020-10-02 14:58:06 -07:00
Abhijeet Prasad Bodas	a20d22de43	i18n: Move locale select logic in home to i18n.py. Part of #16094. Moved the language selection preference logic from home.py to a new function in i18n.py to avoid repetition in analytics views and home views.	2020-10-02 14:56:20 -07:00
Aman Agrawal	08fbde4e7c	test_move_msgs: Rename variable for clarity.	2020-10-01 17:45:11 -07:00
Tim Abbott	8c8f3ee13b	test_classes: Extract home view helpers for reuse.	2020-10-01 15:14:25 -07:00
Tim Abbott	6d041a3b34	home: Include is_web_public_guest in page_params.	2020-10-01 15:07:19 -07:00
Tim Abbott	351d73ac5a	home: Pass realm to build_page_params_for_home_page_load. This is preparation for this needing to be a separate parameter from the user.	2020-10-01 15:00:36 -07:00
Aman Agrawal	18d852de49	unreads: Add support for web public guests. This handles the case of web public guests by returning RawUnreadMessagesResult with empty initalized values.	2020-10-01 14:53:43 -07:00
Aman Agrawal	e02f0fb1c4	get_raw_unread_data: Extract func to get unreads from ums.	2020-10-01 14:46:46 -07:00
Aman Agrawal	742bb7a9d5	zulip_otp_required: Don't 2fa logged out users. For users who are not authenticated, we don't need to 2fa them, we only need it once they are trying to login. Tweaked by tabbott to be much more readable; the new style might require new test coverage.	2020-10-01 14:46:46 -07:00
Tim Abbott	e8e876d54c	events: Deduplicate anonymous user code. This approach lets us deduplicate much of the fetch_initial_state_data logic around logged-out users.	2020-10-01 14:43:51 -07:00
Aman Agrawal	f46f251688	post_process_state: Allow web public guests. Because the logic already works correctly, we just need to change mypy types.	2020-10-01 14:41:49 -07:00
Aman Agrawal	190f481f49	stream_subscription: Mark notifications disabled for web public users. Users without an account can't get notifications, so we might as well ensure any UI displays them appropriately.	2020-10-01 14:40:48 -07:00
Aman Agrawal	b0d92b3ff6	HomeTest: Extract page_params keys to be used in other functions.	2020-10-01 14:39:54 -07:00
sahil839	78b98d8067	realm: Add wildcard_mention_policy setting. We add a new wildcard_mention_policy setting to handle wildcard mentions in large streams, with a wide range of policies available to organizations. We set the default to the safe option for preventing accidental spam: only stream administrators being able to use wildcard mentions in large streams.	2020-10-01 12:18:03 -07:00
Anders Kaseorg	46babbe9e1	import_realm: Close the memcached connection before forking. This prevents the memcached connection from being shared across multiple processes, and hopefully addresses unexpected behavior from cached functions like get_user_profile_by_id invoked inside the worker processes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-01 11:20:39 -07:00
Anders Kaseorg	cf5ededa35	openapi: Use at most one inline subschema in allOf. This fixes some of the warnings from openapi-generator. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-30 19:45:28 -07:00
sahil839	6c473ed75f	message: Call build_message_send_dict from check_message. We call build_message_send_dict from check_message instead of do_send_messages. This is a prep commit for adding a new setting for handling wildcard mentions in large streams.	2020-09-29 17:18:04 -07:00
sahil839	f1a5fbaeb0	message: Extract build_message_send_dict function. We extract the loop for building message dict in do_send_messages in a separate function named build_message_send_dict. This is a prep commit for moving the code for building of message dict in check_message.	2020-09-29 16:50:47 -07:00
sahil839	0514ba7ecb	message: Add 'links_for_embed' to message_dict. There is a bug where we send event for even those messages which do not have embedded links as we are using single set 'links_for_embed' to check whether we have to send event for embedded links or not. This commit fixes the bug by adding 'links_for_embed' in message dict itself and send the event only if that message has embedded links.	2020-09-29 16:50:47 -07:00
Steve Howell	c199571112	mypy: Add StreamDict. This requires us to rework the view code a little bit to explicitly assign fields.	2020-09-29 16:49:10 -07:00
Steve Howell	bee18c70f0	mypy: Use str in statsd_key.	2020-09-29 16:49:10 -07:00
Steve Howell	2c496d9afd	mypy: Fix do_send_user_group_update_event.	2020-09-29 16:49:10 -07:00
Steve Howell	a37ef208dc	mypy: Add RawReactionRow.	2020-09-29 16:49:10 -07:00
Anders Kaseorg	cfd93096b5	openapi: Remove yamole. As explained in the previous commit, yamole preprocessed allOf with an algorithm that is not standards compliant. We replicate that algorithm, but importantly, we only use it for our own code and not for building the openapi_core RequestValidator. This improves the time taken by OpenAPISpec().check_reload() from 1.69s to 0.53s, nearly all of which is inside openapi_core.create_spec. Closes #10484. Significantly improves #16068. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Anders Kaseorg	fb2d7c6741	openapi: Fix allOf usage to conform to the OpenAPI specification. yamole preprocesses our schema by naïvely merging all the objects in an allOf array together, but this fails to capture the meaning of allOf according to the OpenAPI specification. allOf is supposed to be a strict logical intersection of each subschema interpreted independently. It does not combine their properties maps before interpreting additionalProperties. So according to the old definition of JsonSuccess, every response is invalid: allOf: - additionalProperties: false properties: result: type: string - required: - result - msg properties: msg: type: string because the first subschema disallowed msg and the second subschema required msg. To fix this, whenever we use allOf for schema “inheritence”, the base schema must not specify additionalProperties, and the child schema must explicitly list all properties recursively inherited from the base schema in any subschema that uses additionalProperties. Fixes #16109. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Anders Kaseorg	c5765c9da6	openapi: Remove some keys redundant with a $ref. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Anders Kaseorg	f3ff082107	openapi: Remove trivial allOf wrapping. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-29 16:47:10 -07:00
Vishnu KS	367c792968	actions: Downgrade realm before scrubbing.	2020-09-28 15:37:49 -07:00
Vishnu KS	0d30f59c97	billing: downgrade_now -> downgrade_now_without_creating_additional_invoice.	2020-09-28 15:37:49 -07:00
Tim Abbott	0c2d1f068d	docs: Extend documentation of event system testing.	2020-09-28 12:37:54 -07:00
Steve Howell	def3dac6ae	event_schema: Add comments to top of the file. The comments basically explain the common coding patterns for making the checkers.	2020-09-28 12:19:28 -07:00
Tim Abbott	3242fc7388	soft_deactivation: Fix typo in logging output.	2020-09-28 12:12:04 -07:00
palash	7a7db69935	test_push_notifications: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_push_notifications.py	2020-09-28 12:12:00 -07:00
palash	0c18113910	soft_deactivation: Change root logger to zulip.soft_deactivation. Update logger in the following files using this logger: test_soft_deactivation, test_home, test_push_notifications	2020-09-28 12:12:00 -07:00
Tim Abbott	899cb41857	MessageDict: Remove _finalize_payload for simplicity. finalize_payload already has a few options; there's little benefit to this one being implemented as a separate helper function.	2020-09-28 12:00:18 -07:00
Tim Abbott	99396b25a6	MessageDict: Add a bit of docstring documentation.	2020-09-28 11:50:02 -07:00
Tim Abbott	90ff62aabc	actions: Rename message local variable to message_dict. This is a preparatory refactor to make it easy to see the changes using `git show` in the next commit.	2020-09-28 11:14:59 -07:00
sahil839	ae74f8aafb	actions: Remove unnecessary comment in do_send_messages function. This commit removes the unnecessary comment which was added in `9454683108`, when we were using message.get() for keys which were also passed as args in do_send_messages, but there are no such keys in the current code.	2020-09-28 10:58:35 -07:00
sahil839	76c75fea92	actions: Remove unnecessary line from do_send_messages. This commit removes the unnecessary line of code to get rendered_content from message dict sent by check_message when it actually does not inlcude 'rendered_content' key. This line was added in `9454683108`, but now we do not send rendered_content in the message dict as we render the message in do_send_messages itself.	2020-09-28 10:58:35 -07:00
Dinesh	acca870480	tests: Add a dummy request to self.client.login(). A later commit alters `authenticate` of EmailAuthBackend to add a store `needs_to_change_password` variable to session which is useful to insist users on changing their weak password. The tests start failing with that change because client.login() runs `authenticate` without a `request` object. So, this commit sends a request object with `request.session=self.client.session` to self.client.login() in tests wherever needed.	2020-09-25 16:24:18 -07:00
Abhijeet Prasad Bodas	d9d51e32c1	i18n: Simplify logic for translation data in page_params. This refactors the get_translation_data function to return an empty dict when 'en' language is passed, to avoid repetition of code in stats and home.	2020-09-25 16:21:37 -07:00
Dinesh	232eb8b7cf	auth: Render config error page on configuration error. We previously used to to redirect to config error page with a different URL. This commit renders config error in the same URL where configuration error is encountered. This way when conifguration error is fixed the user can refresh to continue normally or go back to login page from the link provided to choose any other backend auth. Also moved those URLs to dev_urls.py so that they can be easily accessed to work on styling etc. In tests, removed some of the asserts checking status code to be 200 as the function `assert_in_success_response` does that check.	2020-09-25 16:16:17 -07:00
Clara Dantas	8674287192	digest: Support digest of web public streams for guest users. This change requires some basic plumbing for test code creating web-public streams.	2020-09-25 16:11:04 -07:00
Vishnu KS	a888e65ea4	i18n: Don't include email tags in translation strings.	2020-09-25 15:53:26 -07:00
Tim Abbott	94a9fa1891	event_schema: Add documentation and rename a few functions. This should help make this revised subsystem readable for more new contributors. We still need to make updates to the high-level documentation.	2020-09-25 12:53:00 -07:00
Steve Howell	5b7c9c4714	test_events: Add check_realm_user_remove.	2020-09-25 11:43:20 -07:00
Steve Howell	7bb7f2943f	event_schema: Finish extraction with realm_emoji/update. We now no longer define any schemas in test_events--all of them are in event_schema, which helps our tooling cross-check schemas for openapi and node tests.	2020-09-25 11:43:20 -07:00
Steve Howell	ae4d083a5a	event_schema: Extract check_realm_domains_*.	2020-09-25 11:43:20 -07:00
Steve Howell	298bed9fa1	event_schema: Split check_update_message_flags.	2020-09-25 11:43:20 -07:00
Steve Howell	f6e0171d02	event_schema: Split check_reaction into add/remove. It happens that whether you add a reaction or remove a reaction, we send the exact same fields, just using a different op code. This sort of symmetry is actually kind of rare, as usually "add" events have more fields, and "remove" events might just send an id of something to remove. Our openapi schema treats these as two seperate events, so we are more consistent with it, and it helps our schema-checking tooling for node fixtures, too. Note that we now have to exempt the two events from our openapi checks, due to the is_mirror_dummy field in the deprecated user block. We can decide how to handle this later--one possibility is to just add it as an optional field on the event_schema side.	2020-09-25 11:43:20 -07:00
Steve Howell	b7b2546f44	event_schema: Extract check_subscription_update. Note that we use value_type for value instead of bool, since properties can be non-bool things like color, which we just don't test now. We should test them. We more than compensate for this by checking the actual value of the value in check_subscription_update.	2020-09-25 11:43:20 -07:00
Steve Howell	b920ebce81	event_schema: Extract check_has_zoom_token.	2020-09-25 11:43:20 -07:00
Steve Howell	0c4286222f	event_schema: Extract check_realm_update_dict.	2020-09-25 11:43:20 -07:00
Steve Howell	6ec6525624	event_schema: Extract check_delete_message. There is a legacy format where we send singular "message_id" instead of plural "message_ids". Then there are different fields for "private" and "stream" message types.	2020-09-25 11:43:20 -07:00
Steve Howell	88165aee6b	event_schema: Extract check_user_group_update.	2020-09-25 11:43:20 -07:00
Steve Howell	aaaac11661	event_schema: Extract check_user_group_remove.	2020-09-25 11:43:20 -07:00
Steve Howell	1b7af13f37	event_schema: Extract check_user_group_remove_members.	2020-09-25 11:43:20 -07:00
Steve Howell	19b7739065	event_schema: Extract check_user_group_add_members.	2020-09-25 11:43:20 -07:00
Steve Howell	4084f0b949	event_schema: Extract check_realm_user_add. Note that we make the schema for profile_data slightly more realistic, but it doesn't actually get exercised by our current tests (apart from making sure it's a dict), since we don't have profile data for our test realm. We also don't have the optional fields for bots, since our tests don't exercise that, nor delivery_email. So we exempt realm_user_add_event from openapi checks for now. When we try to match the openapi specs better, we will probably want to add a few tests to test_events. Obviously getting good coverage for adding users would be nice for all these scenarios: * delivery_email matters * bots * realm has profile fields	2020-09-25 11:43:19 -07:00
Steve Howell	dc2176a965	event_schema: Extract check_presence.	2020-09-25 11:43:19 -07:00
Steve Howell	6c74a44697	data_types: Generalize StringDictType. This is a prep commit for supporting "presence" events, where the key of the dictionary is some arbitrary string like "website" but the value of the dictionary is another dictionary itself with keys that are more like variable names.	2020-09-25 11:43:19 -07:00
Steve Howell	4f3d5f2d87	event_schema: Extract check_realm_filters. We have some known issues with representing tuples in openapi, so we exempt realm_filters from the relevant check.	2020-09-25 11:43:19 -07:00
Steve Howell	e40a5400e5	event_schema: Extract check_muted_topics. This also forces us to create TupleType. We exempt this from the openapi check, since we haven't figured out how to model tuples in openapi with the same precision as event_schema (and it may be impossible). Long term we just want to stop dealing in tuples, of course.	2020-09-25 11:43:19 -07:00
orientor	91ca1afe98	data_type: Add StringDict data type. StringDict is a data type for representing dictionaries where all keys and values are strings. Add this data type to data_types.py and edit other files so that this data type is put to use and tested. (slightly tweaked by @showell to remove a comment and shorten a var name now that we have a proper data type)	2020-09-25 11:43:19 -07:00
Steve Howell	78a2059b8d	event schema: Extract attachment checkers.	2020-09-25 11:43:19 -07:00
Steve Howell	4a947c971d	event_schema: Extract check_realm_export. These are all trivial transformations. Note that we don't insist timestamps are floats; the NumberType class allows ints too.	2020-09-25 11:43:19 -07:00
Steve Howell	d28c01284c	event_schema: Extract check_hotspots. This forces us to introduce a NumberType.	2020-09-25 11:43:19 -07:00
Steve Howell	cf26151cea	event_schema: Use realm_user_person_types. For realm_user events, we now structure the person type as a union of dicts, which is more consistent with how we model this in our openapi spec.	2020-09-25 11:43:19 -07:00
Steve Howell	10952394b0	test_events: Use int value of message_retention_days. We also make our schema in event_schema reflect this, which in turn makes us match the already accurate openapi spec, so we no longer need to exempt four types of events from our sanity checks.	2020-09-25 11:43:19 -07:00
Steve Howell	73e7f7edec	check-node-fixtures: Compare python/openapi schemas. We might want to rename the tool to something more general now, since we are really reconciling three things: - node fixtures - event_schema checkers for test_events - openapi specs The way we compare python and openapi schemas is as follows: - first convert openapi schemas to be build from DictType, ListType, etc. with from_opeapi - do a diff on the schemas Most of the new code is just having the FooType family of classes serialize themselves with schema().	2020-09-25 11:43:19 -07:00
Steve Howell	2b76eb767f	event_schema: Change propagate_mode to an enum.	2020-09-25 11:43:19 -07:00
Steve Howell	aca641a4d1	refactor: Extract data_types module. Defining types with an object hierarchy of type classes will allow us to build functionality that was impossible (or really janky) with the validators.py approach of composing functions. Most of the changes to event_schema.py were automated search/replaces. This patch doesn't really yet take advantage of the new FooType classes, but we will use it soon to audit our openapi specs.	2020-09-25 11:43:19 -07:00
Wes Galbraith	9645959ac4	populate_db: Add emoji reactions to development environment database. This change adds automated generated emoji reactions to the data in the development environment's database. Fixes part of #14991.	2020-09-23 16:10:37 -07:00
Steve Howell	f29b2884ca	bitbucket2: Format user info consistently. We now use get_user_info() to format all of our users in messages.	2020-09-23 15:31:38 -07:00
Steve Howell	e0b6619dac	bitbucket2: Simplify how we display user for fork events. Even before GDPR changes, it was strange that we displayed users differently for fork events vs. all other events. After GDPR, we don't even get the `username` field any more. So now we simply use `display_name` if available, and then we try `nickname`. See https://developer.atlassian.com/cloud/bitbucket/bitbucket-api-changes-gdpr/ for more context.	2020-09-23 15:31:38 -07:00
Steve Howell	1ef8d79352	bitbucket: Decouple BITBUCKET_FORK_BODY between versions. We were trying to share the same format string between the two different versions of bitbucket, but this only creates confusion, as the two versions are only close enough to be confusing. The format string might be the same, but the semantics are different, as well as the eventual outputs. For example, the {username} piece here is simple in version 2, but in version 3 we append a url to the user's name.	2020-09-23 15:31:38 -07:00
sahil839	fe370debe5	tests: Rename stream messages tests in test_message_send.py. This commit renames 'test_message_to_self' and 'test_api_message_to_self' tests to 'test_message_to_stream_by_name' and 'test_api_message_to_stream_by_name' to depict the actual purpose of these tests.	2020-09-23 15:28:31 -07:00
Aman Agrawal	5ae2325979	events: Don't send presence data for web public guests. We disable presence and hide list of users in right sidebar via setting realm_presence_disabled to false here for web public guests.	2020-09-23 12:11:22 -07:00
Aman Agrawal	5153a036f2	events: Disable topic edit & msg edit/delete for web public guest. To ensure web public guests cannot change any data, it's natural that we disable topic & msg edit / delete ops.	2020-09-23 12:11:22 -07:00
Aman Agrawal	3ec23e1a9d	fetch_initial_state_data: Handle case of web public guests. user_profile will be None for web_public_guests here. Hence, for settings (of which most be inaccessible by web public guest), which require a user_profile, we either set an empty value for them or set them to a default value. This will help render the frontend or extend support to our clients without breaking a lot of code. Tweaked by tabbott to add many comments.	2020-09-23 12:11:15 -07:00
Aman Agrawal	9cabd8f9cb	process_client: Don't update activity of unauthenticated users. This allows wrapper `add_logging_data` to be used to add client information for unauthenticated users (or web public guests).	2020-09-23 12:07:06 -07:00
Aman Agrawal	6012d3cff2	get_raw_user_data: Support acting_user=None. This already had support for web_public_guest, fixed mypy annotation.	2020-09-23 12:07:06 -07:00
Aman Agrawal	c88b4cba60	format_user_row: Support acting_user=None.	2020-09-23 12:07:06 -07:00
Aman Agrawal	48492a0633	fetch_initial_state_data: Pass realm as independent parameter. This removes dependency of the function on user_profile to get the realm, which will be useful when user_profile is None in case of web public guests.	2020-09-23 12:06:54 -07:00
Alex Vandiver	fd20e54c79	webhooks: Never log JsonableError to webook loggers. These represent known errors in what the user submitted. This is slightly complicated by UnsupportedWebhookEventType being an instance of JsonableError.	2020-09-22 15:11:48 -07:00
Alex Vandiver	7001004ec0	webhooks: Do not predicate on the "payload" key. If we are to log to the webhook logger, do so no matter which arguments are passed.	2020-09-22 15:11:48 -07:00
Alex Vandiver	1a763696f7	webhooks: Only enable webhook logging if it is a webhook. allow_webhook_access may be true if the request allows webhook requests, regardless of if it only used for a webhook integration. Only actually log to the verbose webhook logger if it is explicitly a webhook endpoint, as judged by `webhook_client_name`. This prevents requests for `POST /api/v1/messages` from being logged to the webhook logger if they mistakenly contain a `payload` argument.	2020-09-22 15:11:48 -07:00
Alex Vandiver	77d1a4a5c0	webhooks: Simplify logic around is_webhook_access. We clearly allow webhook access if we are setting the webhook_client_name. This removes the need for the `or`s later.	2020-09-22 15:11:48 -07:00
Alex Vandiver	d24869e484	webhooks: Rename is_webhook to allow_webhook_access. This argument does not define if an endpoint "is a webhook"; it is set for "/api/v1/messages", which is not really a webhook, but allows access from webhooks.	2020-09-22 15:11:48 -07:00
Alex Vandiver	b515c2bcbf	webhooks: Add a missing format for the realm.	2020-09-22 15:11:48 -07:00
Aman Agrawal	1b5b82e712	RealmFilterPattern: Mark converted content as AtomicString. If multiple filters match the same string, we run into an infinite loop of converting string into urls. To fix it, we mark the matched string as atomic after first conversion.	2020-09-22 15:10:38 -07:00
Anders Kaseorg	0228acf0f5	rest: Add rest_path shortcut for path with rest_dispatch. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-22 10:51:00 -07:00
Anders Kaseorg	e70f2ae58d	rest: Specify rest_dispatch handlers by function, not by string. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-22 10:46:28 -07:00
Anders Kaseorg	faf600e9f5	urls: Remove unused URL names and shorten others. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-22 10:46:28 -07:00
Alex Vandiver	db8daf4175	linkifiers: Allow tildes in target URLs.	2020-09-21 21:04:02 -07:00
Alex Vandiver	03c6a0f182	markdown: Skip other common file extensions in linking, sort.	2020-09-21 21:03:29 -07:00
Alex Vandiver	4361ce1246	markdown: Use tlds package to keep updated list of TLDs. Also remove a useage of "blacklist."	2020-09-21 21:03:29 -07:00
Aman Agrawal	37e77c77e2	message_fetch: Better handle unauthorized requests. We raise MissingAuthenticationError now, which adds `www_authenticate=session` header to the error response. This stops modern web-browsers from displaying a login form everytime a 401 response it sent to the client.	2020-09-21 16:07:43 -07:00
Tim Abbott	7fa8bafe81	lint: Fix type of initial 0 in queue monitoring.	2020-09-21 15:47:30 -07:00
Alex Vandiver	e5f62d083e	tornado: Merge the TORNADO_SERVER and TORNADO_PORTS configs. Having both of these is confusing; TORNADO_SERVER is used only when there is one TORNADO_PORT. Its primary use is actually to be _unset_, and signal that in-process handling is to be done. Rename to USING_TORNADO, to parallel the existing USING_RABBITMQ, and switch the places that used it for its contents to using TORNADO_PORTS.	2020-09-21 15:36:16 -07:00
Mateusz Mandera	810514dd9d	queue: Update stats file every 30 seconds. This system can't update stats while the queue is idle, without using threads for this, but at least we ensure to update the file after consuming an event if more than MAX_SECONDS_BEFORE_UPDATE_STATS passed since the last update, regardless of the number of iterations done so far.	2020-09-21 15:24:02 -07:00
Mateusz Mandera	40c4511a9c	queue: Fix misspelled consume_iteration_counter variable.	2020-09-21 15:22:58 -07:00
Mateusz Mandera	2365a53496	queue: Fix a race condition in monitoring after queue stops being idle. The race condition is described in the comment block removed by this commit. This leaves room for another, remaining race condition that should be virtually impossible, but nevertheless it seems worthwhile to have it documented in the code, so we put a new comment describing it. As a final note, this is not a new race condition, it was hypothetically possible with the old code as well.	2020-09-21 15:22:56 -07:00
Amitsinghyadav	a72e9476ee	api_docs: Add fragment references for all parameters. This makes it convenient to link to a specific parameter accepted by Zulip API endpoint.	2020-09-21 12:18:10 -07:00
Sumanth V Rao	c563cdba61	markdown: Add data-code-lang attribute for locally echoed messages. This mimics the backend logic for adding the data-attribute - to know what Pygments language was used to highlight the code block - in locally echoed messages. New test added checks our logic for canonicalizing pygments alias (for both frontend and backend). Other fixtures and tests amended.	2020-09-18 17:12:26 -07:00
Sumanth V Rao	564b199fe6	markdown: Escape lang when echoing back custom non-pygments languages. In `ae58ed5a7` we decided to echo back the text, when no Pygments lexer matching that language was found. When we do so, we must take care to HTML escape the lang before wrapping it in a data-code-language attribute. Tweaked by tabbott to make clear the escaping is defensive.	2020-09-18 17:12:11 -07:00
Alex Vandiver	1928696829	tests: Remove three references to Casper in the comments.	2020-09-18 15:13:40 -07:00
Alex Vandiver	f638518722	tornado: Move default production port to 9800. In development and test, we keep the Tornado port at 9993 and 9983, respectively; this allows tests to run while a dev instance is running. In production, moving to port 9800 consistently removes an odd edge case, when just one worker is on an entirely different port than if two workers are used.	2020-09-18 15:13:40 -07:00
Alex Vandiver	5c806fbd52	tornado: Remove a misleading comment and reformat. tornado.web.Application does not share any inheritance with Django at all; it has a similar router interface, but tornado.web.Application is not an instance of Django anything. Refold the long lines that follow it.	2020-09-18 15:13:40 -07:00
Alex Vandiver	4354386e69	tornado: Remove an unused port argument. This was added in `ec065e92ee` for the WebSocket codepath, which was subsequently removed in `ea6934c26d`.	2020-09-18 15:13:40 -07:00
Alex Vandiver	de1db2c838	sentry: Provide more metadata in queue processors. This allows aggregation by queue, makes the event data more readily accessible, and clears out the breadcrumbs upon every batch that is serviced.	2020-09-18 15:13:08 -07:00
Tim Abbott	9c0d6becc5	docs: Add comment links to i18n documentation.	2020-09-18 11:44:04 -07:00
Alex Vandiver	5a9b6feb18	tornado: Retry POST requests from Django to Tornado. While urllib3 retries all connection errors, it only retries a subset of read errors, since not all requests are safe to retry if they are not idempotent, and the far side may have already processed them once. By default, the only methods that are urllib3 retries read errors on are GET, TRACE, DELETE, OPTIONS, HEAD, and PUT. However, all of the requests into Tornado from Django are POST requests, which limits the effectiveness of `bb754e0902`. POST requests to `/api/v1/events/internal` are safe to retry; at worst, they will result in another event queue, which is low cost and will be GC'd in short order. POST requests to `/notify_tornado` are _not_ safe to retry, but this codepath is only used if USING_RABBITMQ is False, which only occurs during testing. Enable retries for read errors during all POSTs to Tornado, to better handle Tornado restarts without 500's.	2020-09-17 16:50:36 -07:00
Tim Abbott	ae58ed5a74	markdown: Tweak data-code-language testing and comments. This should make it clearer the precise decisions we've made about the intended semantics of this feature.	2020-09-15 12:30:57 -07:00
Sumanth V Rao	b0c9e0a295	markdown: Rename fenced code data-attribute to data-code-language.	2020-09-15 20:09:58 +05:30
Harsh Srivastava	ba36624442	api docs: Removing order dependency of deactivate user test. We create a User using `do_create_user`, before running the deactivation test. This lets us removing the ordering logic introduced in `7c17bdb9c5`.	2020-09-14 22:31:40 -07:00
Alex Vandiver	ff94254598	tornado: Log to files by port number. Without an explicit port number, the `stdout_logfile` values for each port are identical. Supervisor apparently decides that it will de-conflict this by appending an arbitrary number to the end: ``` /var/log/zulip/tornado.log /var/log/zulip/tornado.log.1 /var/log/zulip/tornado.log.10 /var/log/zulip/tornado.log.2 /var/log/zulip/tornado.log.3 /var/log/zulip/tornado.log.7 /var/log/zulip/tornado.log.8 /var/log/zulip/tornado.log.9 ``` This is quite confusing, since most other files in `/var/log/zulip/` use `.1` to mean logrotate was used. Also note that these are not all sequential -- 4, 5, and 6 are mysteriously missing, though they were used in previous restarts. This can make it extremely hard to debug logs from a particular Tornado shard. Give the logfiles a consistent name, and set them up to logrotate.	2020-09-14 22:17:51 -07:00
Alex Vandiver	812af977d3	compilemessages: Switch to canonical override_language().	2020-09-14 22:16:09 -07:00
Alex Vandiver	536bd3188e	middleware: Move locale-setting before domain checking. Calling `render()` in a middleware before LocaleMiddleware has run will pick up the most-recently-set locale. This may be from the _previous_ request, since the current language is thread-local. This results in the "Organization does not exist" page occasionally being in not-English, depending on the preferences of the request which that thread just finished serving. Move HostDomainMiddleware below LocaleMiddleware; none of the earlier middlewares call `render()`, so are safe. This will also allow the "Organization does not exist" page to be localized based on the user's browser preferences. Unfortunately, it also means that the default LocaleMiddleware catches the 404 from the HostDomainMiddlware and helpfully tries to check if the failure is because the URL lacks a language component (e.g. `/en/`) by turning it into a 304 to that new URL. We must subclass the default LocaleMiddleware to remove this unwanted functionality. Doing so exposes a two places in tests that relied (directly or indirectly) upon the redirection: '/confirmation_key' was redirected to '/en/confirmation_key', since the non-i18n version did not exist; and requests to `/stats/realm/not_existing_realm/` incorrectly were expecting a 302, not a 404. This regression likely came in during `f00ff1ef62`, since prior to that, the HostDomainMiddleware ran _after_ the rest of the request had completed.	2020-09-14 22:16:09 -07:00
sahil839	9c3341ad95	openapi: Rearrange users/{user_id}/subscriptions/{stream_id} docs. This commit moves docs for users/{user_id}/subscriptions/{stream_id} enndpoint to be after users/me/subscriptions/muted_topics docs. We are rearranging the docs because after adding the new patch endpoint for users/{user_id}/subscriptions/{stream_id}, openapi_core validator tries to match 'users/me/subscriptions/muted_topics' with 'users/{user_id}/subscriptions/{stream_id}' path in zulip.yaml and thus gives error while running tests. This is a bug in 'openapi_core' as it does not follows OpenAPI specs to match concrete paths before their templated counterparts. Thus, this commit rearranges the docs such that openapi_core validator tries to match muted_topics endpoint with the correct path in zulip.yaml docs.	2020-09-14 22:04:45 -07:00
Sumanth V Rao	033351609d	markdown: Add data-codehilite-language attr for fenced code. When converting fenced code markdown, we add the language (if specified) in a data-attribute by tweaking the HTML generated. Doing so, allows the frontend to make use of this attr to display view-in-playground option for codeblocks. We use pygments to get the lexer subclass name and use that instead of directly using the language in the data-attribute. Doing so, helps us map different language aliases (like `js` and `javascript`) into a common variable (like `JavaScript`) - and avoids the client from dealing with multiple tags corresponding to the same language. The html structure for a message like this: ``` js ..content.. ``` would now be: <div class="codehilite" data-codehilite-language="JavaScript"> <pre>..content..</pre> </div> Tests and fixtures amended.	2020-09-14 21:25:19 -07:00
Anders Kaseorg	a659542d84	parallel: Remove run_parallel. This was a broken abstraction that returned to its caller within multiple forked processes on exceptions, and encouraged ignoring the error code (as all of its callers did). Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	0f16df2f13	transfer: Migrate from run_parallel to multiprocessing. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	7f410ff0de	import_realm: Migrate from run_parallel to multiprocessing. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	9a2aad58d0	import_util: Migrate from run_parallel to multiprocessing. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	73eb0aa891	test_runner: Monkey-patch to allow using multiprocessing in tests. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Tim Abbott	f751acbec5	openapi: Add comments on top of various OpenAPI doc files. This is part of our standard approach for trying to make it easy for folks to find relevant documentation on a system they're trying to understand.	2020-09-14 15:25:46 -07:00
Aman Agrawal	2bc3924672	move_topic_to_stream: Allow moving to/between/from private streams. Fixes #16284. Most of the work for this was done when we implemented correct behavior for guest users, since they treat public streams like private streams anyway. The general method involves moving the messages to the new stream with special care of UserMessage. We delete UserMessages for subs who are losing access to the message. For private streams with protected history, we also create UserMessage elements for users who are not present in the old stream, since that's important for those users to access the moved messages.	2020-09-14 15:00:55 -07:00
Anders Kaseorg	ddf8ec33df	upload: Strip leading slash from deleted S3 export paths. Previously, S3UploadBackend.delete_export_tarball failed to strip the leading ‘/’ from the export path. This mistake is now caught by Moto 1.3.15. I expect it caused deletion failures in the real S3, although I haven’t verified this. We store export_path in the audit log with a leading ‘/’, but the actual S3 keys do not have a leading ‘/’. Changing either system would require a migration. So the new convention is that the variables named ‘export_path’ have a leading ‘/’, while variables named ‘path_id’ or ‘key’ do not. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 20:59:09 -07:00
Anders Kaseorg	42d038f09b	migrations: Fix 0301 to replace a Python loop with SQL. The previous code is correctly flagged by semgrep 0.23 as a violation of our sql-format rule. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 20:59:09 -07:00
Hemanth V. Alluri	f57196b1d0	webhooks/gitlab: Add fixture for MR opened with multiple assignees. Fixes #16173. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-09-13 20:54:14 -07:00
Hemanth V. Alluri	54aba8d402	webhooks/gitlab: Use information about all assignees. Previously, the GitLab webhook code, namely the `get_objects_assignee` method first tried to get a single assignee and if that failed then it looks for multiple assignees and then it would return the first assignee that it found (there's actually a code smell here - a loop which would always return on the first iteration). Instead, this commit will change that behavior to first check for multiple assignees first then for a single assignee if we can't find multiple assignees. Ultimately it will return a list of all of the assignees (however many that might be [0, n]). This method has then aptly been renamed to `get_assignees`. Finally, we tweked the code using this method to always use it's output as an "assignees" parameter to templates (there's also an assignee parameter which we want to avoid here for consistency). Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-09-13 20:54:14 -07:00
Hemanth V. Alluri	bbe7a54171	webhooks/gitlab: Make permissions consistent across all fixtures. For some reasons, some of the fixtures had the +x bit set, while some didn't. What this commit does is make sure that no fixture is marked as "executable" (for anyone). Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-09-13 20:54:14 -07:00
Anders Kaseorg	dfab09b17d	markdown: Replace hyperlink requirement with urllib.parse. The previous code only worked by accident and hyperlink 20.0.0 breaks it. >>> hyperlink.parse("example.com").replace(scheme="https") DecodedURL(url=URL.from_text('https:example.com')) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 15:37:28 -07:00
Anders Kaseorg	f39d4cf1f0	sentry: Pass format arguments to logging. semgrep 0.23 correctly flags this as a violation of our logging-format rule. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 15:37:28 -07:00
Mateusz Mandera	70bcc7a8e9	migrations: Add case-insensitive unique index on realm and stream name. This is closely analogical to `b9b146c809`.	2020-09-13 15:36:43 -07:00
Anders Kaseorg	463929f349	urls: Migrate re_path routes to path. Django treats path("<name>") like re_path(r"(?P<name>[^/]+)") and path("<path:name>") like re_path(r"(?P<name>.+)"). This is more readable and consistent than the mix of slightly different regexes we had before, and fixes various bugs: • The r'apps/(.*)$' regex was missing a start anchor ^, so it incorrectly matched all URLs that included apps/ as a substring anywhere. • The r'accounts/login/(google)/$' regex was missing a start anchor ^, so it incorrectly matched all URLs that ended with accounts/login/google/. • The type annotation of zerver.views.realm_export.delete_realm_export takes export_id as an int, but it was previously passed as a string. • The type annotation of zerver.views.users.avatar takes medium as a bool, but it was previously passed as a string. • The [0-9A-Za-z]+ pattern for uidb64 was missing the - and _ characters that can validly be part of a base64url encoded string (although I think the id is actually a decimal integer here, in which case only 012345ADEIMNOQTUYcgjkwxyz are present in its base64url encoding). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 14:29:26 -07:00
Anders Kaseorg	7c17bdb9c5	openapi: Remove ‘example’ $ref siblings. $ref siblings are ignored according to the OpenAPI specification, and the referenced definitions already have examples. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 11:57:13 -07:00
Anders Kaseorg	caa08d76b5	openapi: Inline parameter references to avoid ‘required’ $ref siblings. $ref siblings are ignored according to the OpenAPI specification. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 11:57:13 -07:00
Anders Kaseorg	b41165fd01	dev_urls: Give the user_avatars URLPattern a name. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-12 11:21:40 -07:00
palash	2632317b06	test_message_send: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_message_send.py	2020-09-12 11:04:51 -07:00
palash	f2f8034b76	test_markdown: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_markdown.py	2020-09-12 11:04:51 -07:00
palash	588fff236d	test_service_bot_system: Refactor mock.patch with assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_service_bot_system.py	2020-09-12 10:59:36 -07:00
palash	4e2769c31c	test_decorators: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_decorators.py	2020-09-12 10:59:36 -07:00
palash	c3fee5059f	test_email_mirror: Refactor mock.path to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_email_mirror.py	2020-09-12 10:59:36 -07:00
palash	34003fc7f6	email_mirror: Change root logger to zerver.lib.email_mirror.	2020-09-12 10:53:56 -07:00
palash	60f9f87c5d	test_external: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_external.py	2020-09-12 10:53:56 -07:00
palash	485d740fe8	apple auth: Change logger to zulip.auth.apple from root logger. Replace default root logger with zulip.auth.apple for apple auth in file zproject/backends.py and update the test cases accordingly in file zerver/tests/test_auth_backends.py	2020-09-12 10:53:56 -07:00
palash	04f08f45e7	test_auth_backends: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file test_auth_backends.py. This change requires adjusting test_log_into_subdomain_when_email_is_none to use an explicit token since that appears in the log output.	2020-09-12 10:53:37 -07:00
Aman Agrawal	ed8796f23f	report: Allow anonymous queries to report (un)narrow times.	2020-09-11 16:57:27 -07:00
Alex Vandiver	3f6e4ff303	webhooks: Move the extra logging information into a formatter. This clears it out of the data sent to Sentry, where it is duplicative with the indexed metadata -- and potentially exposes PHI if Sentry's "make this issue public" feature is used.	2020-09-11 16:43:29 -07:00
Alex Vandiver	b8a2e6b5f8	webhooks: Configure webhook loggers in zproject/computed_settings.py. This limits the webhook errors to only go to their respective log files, and not to the general server logs.	2020-09-11 16:43:29 -07:00
Alex Vandiver	6323218a0e	request: Maintain a thread-local of the current request. This allows logging (to Sentry, or disk) to be annotated with richer data about the request.	2020-09-11 16:43:29 -07:00
Mateusz Mandera	bb4567f57e	queue: Extract get_remaining_queue_size method.	2020-09-11 15:51:07 -07:00
Mateusz Mandera	1d466a4fc5	queue: Make embed_link updates stats on every iteration.	2020-09-11 15:51:07 -07:00
Tim Abbott	f0c2c640b6	api docs: Add changes for community_topic_editing_limit_seconds. The previous commit fixed this in the changelog, but it should have also been documented directly as well.	2020-09-11 15:50:06 -07:00
Alex Vandiver	4917391133	webhooks: Derive payload from request itself.	2020-09-10 17:47:22 -07:00
Alex Vandiver	a1f5f6502c	webhooks: In logger, pull user from request, rather than parameter. request.user is set by validate_api_key, which is called by webhook_view and authenticated_rest_api_view.	2020-09-10 17:47:22 -07:00
Alex Vandiver	d04db7c5fe	webhooks: Remove repetitive argument to UnsupportedWebhookEventType. The name of the webhook can be added by the webhook decorator.	2020-09-10 17:47:21 -07:00
Alex Vandiver	e2ab7b9e17	webhooks: Update API_KEY_ONLY_WEBHOOK_LOG_PATH to WEBHOOK_LOG_PATH. The existence of "API_KEY" in this configuration variable is confusing. It is fundamentally about webhooks.	2020-09-10 17:47:21 -07:00
Alex Vandiver	cf6ebb9c8d	webhooks: Rename api_key_only_webhook_view to webhook_view. There are no other types of webhook views; this is more concise.	2020-09-10 17:47:21 -07:00
Alex Vandiver	8cfacbf8aa	webhooks: Update comment about typing the webhook decorator. The previous link was to "extended callable" types, which are deprecated in favor of callback protocols. Unfortunately, defining a protocol class can't express the typing -- we need some sort of variadic generics[1]. Specifically, we wish to support hitting the endpoint with additional parameters; thus, this protocol is insufficient: ``` class WebhookHandler(Protocol): def __call__(request: HttpRequest, api_key: str) -> HttpResponse: ... ``` ...since it prohibits additional parameters. And allowing extra arguments: ``` class WebhookHandler(Protocol): def __call__(request: HttpRequest, api_key: str, args: object, *kwargs: object) -> HttpResponse: ... ``` ...is similarly problematic, since the view handlers do not support _arbitrary_ keyword arguments. [1] https://github.com/python/typing/issues/193	2020-09-10 17:47:21 -07:00
Alex Vandiver	ea8823742b	webhooks: Adjust the name of the unsupported logger. `zulip.zerver.lib.webhooks.common` was very opaque previously, especially since none of the logging was actually done from that module. Adjust to a more explicit logger name.	2020-09-10 17:47:21 -07:00
Alex Vandiver	9ea9752e0e	webhooks: Rename UnexpectedWebhookEventType to UnsupportedWebhookEventType. Any exception is an "unexpected event", which means talking about having an "unexpected event logger" or "unexpected event exception" is confusing. As the error message in `exceptions.py` already explains, this is about an _unsupported_ event type. This also switches the path that these exceptions are written to, accordingly.	2020-09-10 17:47:21 -07:00
Alex Vandiver	8016769613	webhooks: Move UnexpectedWebhookEventType into zerver.lib.exceptions. `8e10ab282a` moved UnexpectedWebhookEventType into `zerver.lib.exceptions`, but left the import into `zserver.lib.webhooks.common` so that webhooks could continue to import the exception from there. This clutters things and adds complexity; there is no compelling reason that the exception's source of truth should not move alongside all other exceptions.	2020-09-10 17:47:21 -07:00
Mateusz Mandera	f95dd628bd	email: Fix race conditions with concurrent ScheduledEmail handling. The main race conditions, which actually happened in production was with concurrent execution of deliver_email and clear_scheduled_emails. clear_scheduled_emails could delete all email.users in the middle of deliver_email execution, causing it to pass empty to_user_ids list to send_email. We mitigate this by getting the list of user ids in a single query and moving forward with that snapshot, not having to worry about database data being mutated anymore. clear_scheduled_emails had potential race conditions with concurrent execution of itself due to not locking the appropriate rows upon selecting them for the purpose of potentially deleting them. FOR UPDATE locks need to be acquired to prevent simultaneous mutation. Tested manually with some print+sleep debugging to make some races happen. fixes #zulip-2k (sentry)	2020-09-09 15:59:01 -07:00
Anders Kaseorg	b7b7475672	python: Use standard secrets module to generate random tokens. There are three functional side effects: • Correct an insignificant but mathematically offensive bias toward repeated characters in generate_api_key introduced in commit 47b4283c4b4c70ecde4d3c8de871c90ee2506d87; its entropy is increased from 190.52864 bits to 190.53428 bits. • Use the base32 alphabet in confirmation.models.generate_key; its entropy is reduced from 124.07820 bits to the documented 120 bits, but now it uses 1 syscall instead of 24. • Use the base32 alphabet in get_bigbluebutton_url; its entropy is reduced from 51.69925 bits to 50 bits, but now it uses 1 syscall instead of 10. (The base32 alphabet is A-Z 2-7. We could probably replace all of these with plain secrets.token_urlsafe, since I expect most callers can handle the full urlsafe_b64 alphabet A-Z a-z 0-9 - _ without problems.) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-09 15:52:57 -07:00
Dinesh	c64888048f	puppeteer: Rename CASPER_TESTS env variable to PUPPETEER_TESTS. Also modified few comments to match with the changes.	2020-09-09 13:38:39 -04:00
Aman	c114447e11	topic_history: Allow anonymous access in web-public streams. For web-public streams, clients can access full topic history without being authenticated. They only need to additionally send "streams:web-public" narrow with their request like all the other web-public queries.	2020-09-04 10:17:56 -07:00
Anders Kaseorg	3b301f522b	python: Tweak some magic trailing commas to avoid Black bugs. https://github.com/psf/black/issues/1658 https://github.com/psf/black/issues/1671 Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	f91d287447	python: Pre-fix a few spots for better Black formatting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	bb4fc3c4c7	python: Prefer --flag=option over --flag option. For less inflation by Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	bef46dab3c	python: Prefer kwargs form of dict.update. For less inflation by Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	75c59a820d	python: Convert subprocess.Popen.communicate to run or check_output. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:42:35 -07:00
Vishnu KS	6bbcb622e1	onboarding: Move send_welcome_bot_response to onboarding.	2020-09-03 17:41:08 -07:00
Alex Vandiver	70e8cc5783	tests: Add more tests around exception handling. This verifies that we actually do enqueue a record when there is an error on non-staging. With the previous commit, it verifies that that data serializes correctly.	2020-09-03 17:34:31 -07:00
Alex Vandiver	69a7944c48	tests: Extract a helper to run tests for each bot type.	2020-09-03 17:34:31 -07:00
Alex Vandiver	b1cac67c31	tests: Check JSON serializability of test data with mock_queue_publish.	2020-09-03 17:34:31 -07:00
Alex Vandiver	660c2e5383	tests: Add a mock to verify that queue_json_publish is valid JSON. See also #16121. However, that does not address the tests that stub out the call to queue_json_publish itself.	2020-09-03 17:34:31 -07:00
Alex Vandiver	012113c542	models: Force the translated role into a translated string. The return type of `ugettext_lazy('...')` (aliased as `_`) is a promise, which is only forced into a string when it is dealt with in string context. This `django.utils.functional.lazy.__proxy__` object is not entirely transparent, however -- it cannot be serialized by `orjson`, and `isinstance(x, str) == False`, which can lead to surprising action-at-a-distance. In the two places which will serialize the role value (either into Zulip's own error reporting queue, or Sentry's), force the return value. Failure to do this results in errors being dropped mostly-silently, as they cannot be serialized and enqueued by the error reporter logger, which has no recourse but to just log a warning; see previous commit. When we do this forcing, explicitly override the language to be the realm default. Failure to provide this override would translate the role into the role in the language of the _request_, yielding varying results.	2020-09-03 17:26:54 -07:00
Alex Vandiver	2f350c614b	sentry: Capture exceptions that happen during AdminNotifyHandler. AdminNotifyHandler is used to notify admins of errors; it is a critical piece of logic. Failures in reporting errors will compound, since its `except Exception` clauses cannot generate logging at the `error` or `exception` level, as that would be recursive. It must settle for logging at the `warning` level, and hope that admins are vigilant to the logging there. Increase the chances of being notified of failures in this logger, by bubbling up those exceptions to Sentry, which is an orthogonal reporting stack.	2020-09-03 17:26:54 -07:00
Anders Kaseorg	bb5a963086	soft_deactivate_users: Elide default=[] for users argument. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	ddcf8dd8bc	create_user: Use None as default for --password, --password-file. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	a50fae89e2	python: Elide type=str from argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	fbfd4b399d	python: Elide action="store" for argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	1f2ac1962f	python: Elide default=None for argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	3c5b39da9c	python: Elide nargs for argparse flag arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	b4597a8ca8	python: Elide default for store_{true,false} argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Aman Agrawal	d9431a5e66	exceptions: Raise InvalidSubdomainError when realm is invalid. When user requests for a realm that doesn't exists, we raise a InvalidSubdomainError. This reduces our effort at repeatedly ensuring realm is valid in request in web-public queries.	2020-09-03 10:45:14 -07:00
Steve Howell	dad0073331	trello webhook: Ignore copyCard actions. See https://github.com/zulip/zulip/issues/16185, which encourages folks to fill in for these missing actions.	2020-09-03 10:44:39 -07:00
Steve Howell	5bff66b450	github webhook: Always send messages for team edits. If there are unsupported keys, we still log an error, but we now also send a message to the stream. (This is a good tradeoff for the github webhook, since users can just turn off notifications if they find it spammy. Also, we intend to support "repository" soon.) This is a bit of an experiment to see how this plays in the field: * will customers notice the change? * will Sentry reports look any different?	2020-09-03 10:44:39 -07:00
Steve Howell	c6b9a23c17	github webhooks: Fix message for unsupported team payloads. The main thing fixed here is that we weren't turning on our keys into a list. And then I refined the message a bit more, including sorting the keys. I also avoid the unnecessary "else".	2020-09-03 10:44:39 -07:00
Steve Howell	8785790a27	github tests: Add test_team_edited_error_handling. This doesn't test much interesting yet, but it will soon.	2020-09-03 10:44:39 -07:00
Steve Howell	cc2dbefc60	mypy: Use better types for EVENT_FUNCTION_MAPPER. The EVENT_FUNCTION_MAPPER maps a string event name to a function handler. Before this we circumvented mypy checks with a call to get_body_function_based_on_type, which specified Any as the type of our event function. Now the types are rigorous. This change was impossible without the recent commit to introduce the Helper class.	2020-09-03 10:44:39 -07:00
Steve Howell	4de2b78c25	github refactor: Add Helper class. The Helper class will soon grow, but the immediate problem it solves is the need to jankily inspect the parameters of our get_*_body function. Most of the changes were handled by an ad hoc munge.py script. The substantive changes were adding the Helper class and passing it in. And then the linter discovered a place where the optional include_title parameter wasn't used (which is one of the reasons to avoid the janky inspect-signature technique). As a side note, none of the include_title parameters needed a default value of False, as we always passed in an explicit value. We test cover both sides of include_title, which you can verify by hard coding it to either True or False (and seeing the relevant failures), although I suspect most individual codepaths only test one value, based on whether "topic" is in the fixture or not. Finally, I know Helper is not a great name, but I intend to evolve the class a bit before deciding whether a more descriptive name is helpful here. (For example, an upcoming commit will add a log_unexpected helper method.)	2020-09-03 10:44:39 -07:00
Steve Howell	ead7cbea40	github refactor: Handle header_event explicitly. We get the header_event one level up the call stack now, too. It's somewhat annoying that we have our own concept of "event" here, instead of just returning our event handlers directly, or just calling them directly, but it's a bit non-trivial to fix that right away. In passing, I remove the strange OR for "ping", which is already a key in EVENT_FUNCTION_MAPPER.	2020-09-03 10:44:39 -07:00
Steve Howell	0d9b1817f9	github refactor: Use early-exit idiom. We also comment a bit more explicitly about the None case.	2020-09-03 10:44:39 -07:00
Steve Howell	5c916135c9	github webhooks: Avoid string interpolation. We know the event explicitly here.	2020-09-03 10:44:39 -07:00
Steve Howell	425db931a8	github webhook: Explicitly ignore team actions.	2020-09-03 10:44:39 -07:00
Steve Howell	294fd59983	github webhook: Ignore more pull_request actions. See https://github.com/zulip/zulip/issues/16258 for possible follow up here. We now ignore the following two new pull_request actions (as well as the three existing ones from before): approved converted_to_draft As the issue above indicates, we may want to actually support "approved" if we can find somebody to work on the webhook. (And then the issue goes a little broader than what changed here.)	2020-09-03 10:44:39 -07:00
Steve Howell	5dea85a186	github tests: Extract test_ignored_pull_request_actions. We consolidate the tests and remove the fixtures, which just have a lot of noisy fields that we ignore. Also, pull_request__request_review_removed was named improperly.	2020-09-03 10:44:39 -07:00
Steve Howell	4c2787c35f	github tests: Add test_ignored_events.	2020-09-03 10:44:39 -07:00
Steve Howell	040bf82122	github webhook: Remove unused exception class.	2020-09-03 10:44:39 -07:00
Steve Howell	3634fe903b	decorator test: Dedent some assertions. These assertions didn't need to be nested in the with blocks.	2020-09-03 10:44:39 -07:00
Steve Howell	e91e21c9e7	webhook logger: Add summary field. Before this the only way we took advantage of the summary from UnexpectedWebhookEventType was by looking at exc_info(). Now we just explicitly add it to the log message, which also sets us up to call log_exception_to_webhook_logger directly with some sort of "summary" info when we don't actually want a real exception (for example, we might want to report anomalous webhook data but still continue the transaction). A minor change in passing is that I move the payload parameter lexically.	2020-09-03 10:44:39 -07:00
Aman Agrawal	66a1037f06	test: Use variable instead of hard coded value.	2020-09-02 17:58:19 -07:00
Anders Kaseorg	c5a904fb05	django_api, test_classes: Use Python 3 form of super(). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:31:46 -07:00
Anders Kaseorg	02725d32dd	python: Rewrite list() as []. Suggested by the flake8-comprehensions plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00

... 3 4 5 6 7 ...

12810 Commits