zulip

Commit Graph

Author	SHA1	Message	Date
Anders Kaseorg	2d45308546	CVE-2020-10935: Fix XSS vulnerability in local link rewriting. Make sure rewrite_local_links_to_relative does not accidentally change the meaning of links. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-01 14:01:45 -07:00
Anders Kaseorg	4f748fb627	markdown: Stop setting target="_blank". This setting is being overridden by the frontend since the last commit, and the security model is clearer and more robust if we don't make it appear as though the markdown processor is handling this issue. Co-authored-by: Tim Abbott <tabbott@zulipchat.com> Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-01 14:01:45 -07:00
Tim Abbott	e3a4aeeffa	CVE-2020-9445: Remove unused and insecure modal_link feature. Zulip's modal_link markdown feature has not been used since 2017; it was a hack used for a 2013-era tutorial feature and was never used outside that use case. Unfortunately, it's sloppy implementation was exposed in the markdown processor for all users, not just the tutorial use case. More importantly, it was buggy, in that it did not validate the link using the standard validation approach used by our other code interacting with links. The right solution is simply to remove it.	2020-04-01 14:01:45 -07:00
Udit107710	ef741bf317	messages: Return shallow copy of message object. When more than one outgoing webhook is configured, the message which is send to the webhook bot passes through finalize_payload function multiple times, which mutated the message dict in a way that many keys were lost from the dict obj. This commit fixes that problem by having `finalize_payload` return a shallow copy of the incoming dict, instead of mutating it. We still mutate dicts inside of `post_process_dicts`, though, for performance reasons. This was slightly modified by @showell to fix the `test_both_codepaths` test that was added concurrently to this work. (I used a slightly verbose style in the tests to emphasize the transformation from `wide_dict` to `narrow_dict`.) I also removed a deepcopy call inside `get_client_payload`, since we now no longer mutate in `finalize_payload`. Finally, I added some comments here and there. For testing, I mostly protect against the root cause of the bug happening again, by adding a line to make sure that `sender_realm_id` does not get wiped out from the "wide" dictionary. A better test would exercise the actual code that exposed the bug here by sending a message to a bot with two or more services attached to it. I will do that in a future commit. Fixes #14384	2020-03-29 15:12:27 -07:00
Steve Howell	e29ddd0ce0	outgoing_webhook: Remove `event` from process_success. The `event` parameter is never used by `process_success`, and eliminating it allows us to greatly simplify tests that are just confusingly passing in events that are totally ignored.	2020-03-29 15:12:27 -07:00
Steve Howell	bacfadbc61	minor: Use explicit params in build_bot_request. I also tweaked the block comment to mention gravatars.	2020-03-29 15:12:27 -07:00
Stefan Weil	d2fa058cc1	text: Fix some typos (most of them found and fixed by codespell). Signed-off-by: Stefan Weil <sw@weilnetz.de>	2020-03-27 17:25:56 -07:00
arpit551	8f7733cb20	emails: Added placeholders strings in FormAddress. We've had a bug for a while that if any ScheduledEmail objects get created with the wrong email sender address, even after the sysadmin corrects the problem, they'll still get errors because of the objects stored with the wrong format. We solve this by using FromAddress placeholders strings in send_future_email function, so that ScheduledEmail objects end up setting the final `from_address` value when mail is actually sent using the setting in effect at that time. Fixes #11008.	2020-03-27 16:41:02 -07:00
Steve Howell	c2b3269420	message perf: Streamline stream name lookups. When we are fetching messages, we need to hydrate stream names into the messages for legacy reasons. (Ideally, we could skip this step for the webapp and modern mobile clients, since they really only need stream_ids, but we're not there yet.) We keep a recipient cache that maps recipient ids to stream names. When we populate that cache, we now use `values(...)` to avoid fat objects and extra DB work. Note that we are already using a similar technique for hydrating PM/huddle recipients.	2020-03-27 17:20:34 +00:00
Tim Abbott	06c97b5be2	api docs: Render example responses as with JSON codehilite. This makes the example responses a lot prettier visually.	2020-03-27 00:03:36 -07:00
Tim Abbott	820f0e275e	api docs: Redesign visuals for documenting arguments. The previous system for documenting arguments was very ugly if any of the examples or descriptions were wrong. After thinking about this for a while, I concluded the core problem was that a table was the wrong design element to use for API parameters, and we'd be much better off with individual card-type widgets instead. This rewrites the API arguments documentation implementation to use a basic sort of card-like system with some basic styling; I think the result is a lot more readable, and it's a lot more clear how we would add additional OpenAPI details (like parameter types) to the documentation.	2020-03-27 00:03:36 -07:00
Anders Kaseorg	7ff9b22500	docs: Convert many http URLs to https. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-03-26 21:35:32 -07:00
Graham Bleaney	fd5ee9a831	bots: Decouple user input from imported module. This commit modifies 'zerver/lib/bot_lib.py' to decouple the user-controllable 'service_name' parameter from the value that is passed in to 'import_module'. This is done as a precautionary hardening.	2020-03-25 16:39:17 -07:00
Graham Bleaney	2fe9d85a5f	redirects: Refactor redirect code to use central helper function. This commit introduces two new functions in 'url_encoding.py' which centralize two common patterns for constructing redirect URLs. It also migrates the files using those patterns to use the new functions.	2020-03-25 16:39:17 -07:00
Graham Bleaney	5dca599481	export: Harden s3 export against directory traversal. This commit modifies 'zerver/lib/export.py' to raise an exception in the presence of a suspected attempt at directory traversal.	2020-03-25 16:39:17 -07:00
Emilio López	d3c841d587	email_mirror: also check for Envelope-To After subscribing a stream email address to a Mailman email list and receiving a message from it (using the polling configuration with an Exim + Dovecot mailserver), the following error message is emitted by Zulip: Logger zerver.lib.email_mirror, from module zerver.lib.email_mirror line 77: Error generated by Anonymous user (not logged in) on zulip deployment Sender: "Foo Bar" <foo@example.com> To: No recipient found Missing recipient in mirror email This is because the To: header on the received email corresponds to the email list, and there are no other headers to indicate the final recipient, apart from the "Envelope-To" header added by Exim. To resolve this problem, the commit adds "Envelope-To" to the list of headers to check for a match.	2020-03-25 16:28:46 -07:00
Vishnu KS	f8ddab58ba	billing: Downgrade plan to Limited during realm deactivation. The realm would be instantly downgraded to Limited plan when deactivated. Any extra users that were added in the final month would not be charged.	2020-03-25 10:54:10 -07:00
Tim Abbott	85c9ffd91c	message: Validate propagate_mode parameters. This improves the error handling for invalid values of the propagate_mode parameter to our message editing endpoints. Previously, invalid values would just work like change_one rather than doing nothing.	2020-03-24 12:36:45 -07:00
Anders Kaseorg	39f9abeb3f	python: Convert json.loads(f.read()) to json.load(f). Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-03-24 10:46:32 -07:00
Tim Abbott	180d8abed6	messages: Fix unlikely exception when trying to delete a message.	2020-03-22 21:35:27 -07:00
Tim Abbott	481d351cee	events: Fix buggy apply_events handling of starred_messages. The previous starred_messages race handling did not correctly consider the possibility that an event queue might have been registered without starred_messages.	2020-03-22 21:30:23 -07:00
Mateusz Mandera	27c19b081b	rate_limit: Remove inaccurate docstring on clear_history methods.	2020-03-22 18:42:35 -07:00
Mateusz Mandera	b9e5103d0c	rate_limit: Refactor RateLimiterBackend to operate on keys and rules. Instead of operating on RateLimitedObjects, and making the classes depend on each too strongly. This also allows getting rid of get_keys() function from RateLimitedObject, which was a redis rate limiter implementation detail. RateLimitedObject should only define their own key() function and the logic forming various necessary redis keys from them should be in RedisRateLimiterBackend.	2020-03-22 18:42:35 -07:00
Mateusz Mandera	8069133f88	rate_limit: Remove __str__ methods of RateLimitedObjects. These were clunky from the start and are no longer used, as keys are now used directly for logging purposes.	2020-03-22 18:42:35 -07:00
Mateusz Mandera	4e9f77a6c4	rate_limit: Adjust keys() of some RateLimitedObjects. type().__name__ is sufficient, and much readable than type(), so it's better to use the former for keys. We also make the classes consistent in forming the keys in the format type(self).__name__:identifier and adjust logger.warning and statsd to take advantage of that and simply log the key().	2020-03-22 18:42:35 -07:00
Mateusz Mandera	2c6b1fd575	rate_limit: Rename key_fragment() method to key().	2020-03-22 18:42:35 -07:00
Mateusz Mandera	9c9f8100e7	rate_limit: Add the concept of RateLimiterBackend. This will allow easily swapping and using various implementations of rate-limiting, and separate the implementation logic from RateLimitedObjects.	2020-03-22 18:42:35 -07:00
Mateusz Mandera	85df6201f6	rate_limit: Move functions called by external code to RateLimitedObject.	2020-03-22 18:42:35 -07:00
Mateusz Mandera	3b5b19fde8	tornado: Log shard id in all logs coming from tornado processes. This will make it easier to investigate using logs which requests are being processed by which Tornado process.	2020-03-22 18:26:35 -07:00
Steve Howell	8c1244d0b4	tests: Kill off find_one() helper. This was only recently added. Using tuple assignment raises the same errors, so the indirection probably isn't worth it.	2020-03-20 13:40:20 -07:00
Steve Howell	ef772ee12f	bot events: Prevent duplicate add-bot notifications. We don't need `do_create_user` to send a partial event here for bots. The only caller to `do_create_user` that actually creates bots (apart from some tests that just need data setup) is `add_bot_backend`, which sends the more complete event including bot "extras" like service info. The modified event tests show the simplification here (2 events instead of 3). Also, the bot tests now use tuple unpacking, which will force a ValueError if we duplicate events again.	2020-03-20 13:40:19 -07:00
Steve Howell	f647587675	bulk_create: Handle realms that hide delivery emails.	2020-03-19 16:04:05 -07:00
Steve Howell	ecbbc3e365	performance: Simplify bulk_create_users(). We were going back to the database to get all the users in the realm, when we had them right there already. I believe this is a legacy of us running on a very old version of Django (back in early days), where `bulk_create` didn't give you back ids in a nice way. In the interim we added the `RealmAuditLog` code, which does take advantage of the existing profiles (and proves we can rely on them). But meanwhile we were still doing a query to get all N users in the realm. With `selected_related`! To be fair, bulk_create_users() is by its very nature a pretty infrequent operation. This change is more motivated by code cleanup. Now we just loop through user_ids for the Recipient/Subscriber foreign key rows. I also removed some fairly convoluted code mapping emails to user_ids and just work in user_id space.	2020-03-19 16:04:05 -07:00
Steve Howell	1306239c16	tests: Use email/delivery_email more explicitly. We try to use the correct variation of `email` or `delivery_email`, even though in some databases they are the same. (To find the differences, I temporarily hacked populate_db to use different values for email and delivery_email, and reduced email visibility in the zulip realm to admins only.) In places where we want the "normal" realm behavior of showing emails (and having `email` be the same as `delivery_email`), we use the new `reset_emails_in_zulip_realm` helper. A couple random things: - I fixed any error messages that were leaking the wrong email - a test that claimed to rely on the order of emails no longer does (we sort user_ids instead) - we now use user_ids in some place where we used to use emails - for IRC mirrors I just punted and used `reset_emails_in_zulip_realm` in most places - for MIT-related tests, I didn't fix email vs. delivery_email unless it was obvious I also explicitly reset the realm to a "normal" realm for a couple tests that I frankly just didn't have the energy to debug. (Also, we do want some coverage on the normal case, even though it is "easier" for tests to pass if you mix up `email` and `delivery_email`.) In particular, I just reset data for the analytics and corporate tests.	2020-03-19 16:04:03 -07:00
Steve Howell	42ee2f5e86	tests: Fix test coverage on recent commit. I guess `test_classes` has 100% line coverage enforcement, which is a bit tricky for error handling. This fixes that, as well as making the name snake_case and improving the format of the errors.	2020-03-19 11:37:31 -04:00
Steve Howell	80acbb9fdf	Clean up `test_get_all_profiles_avatar_urls`. This test was using the anti-pattern of doing an assertion inside a conditional. I added the `findOne` helper to make it easier to write robust tests for scenarios like this.	2020-03-19 10:34:35 -04:00
Steve Howell	ca74cd6e37	bug fix: Fix unread counts for certain API messages. If I send a message from a normal Zulip client, it is considered to be "read" by me. But if I send it via an API program (using my human account), the message is not immediately "read" by me. Now we handle this correctly in `get_raw_unread_data`. The symptom of this was that these messages would get "stuck" in "Private Messages" narrows until the next time you reloaded your app.	2020-03-17 16:26:42 -07:00
Mateusz Mandera	5e47f2975e	actions: Optimize query in get_occupied_streams. Using an Exists subquery to avoid scanning the entire Subscription table seems to speed things up greatly. Set up with: ./manage.py populate_db --extra_users 2000 --extra-streams 1000 Tested on my computer, the original function was taking ~1.2seconds, the optimized version only ~0.05-0.06. Likely fixes #13874; we can re-open if after production testing we feel more work is warranted.	2020-03-17 05:44:05 -07:00
Mateusz Mandera	884ff425da	cache: Remove dead code for caching recipients. With recipient column denormalized into all three of Stream, UserProfile and Huddle, there is no more use for this caching.	2020-03-17 05:41:11 -07:00
Mateusz Mandera	b4ce167a88	models: Add recipient foreign key to Huddle. This follows the already tested approach from `8acfa17fe6`.	2020-03-17 05:41:11 -07:00
Steve Howell	fcc5ae5247	invites: Fix regression w/email vs. delivery_email. In `220c2a5ff3` I introduced a query to find invites by delivery_email but was still using email as the key. For most realms `email` and `delivery_email` are synonymous, so this temporary bug would not affect them. For realms that restrict emails, the invite would have probably failed for other reasons, but the symptom would have been less clear.	2020-03-12 10:13:08 -04:00
Steve Howell	1b16693526	tests: Limit email-based logins. We now have this API... If you really just need to log in and not do anything with the actual user: self.login('hamlet') If you're gonna use the user in the rest of the test: hamlet = self.example_user('hamlet') self.login_user(hamlet) If you are specifically testing email/password logins (used only in 4 places): self.login_by_email(email, password) And for failures uses this (used twice): self.assert_login_failure(email)	2020-03-11 17:10:22 -07:00
Steve Howell	c235333041	test performance: Pass in users to api_* helpers. This reduces query counts in some cases, since we no longer need to look up the user again. In particular, it reduces some noise when we count queries for O(N)-related tests. The query count is usually reduced by 2 per API call. We no longer need to look up Realm and UserProfile. In most cases we are saving these lookups for the whole tests, since we usually already have the `user` objects for other reasons. In a few places we are simply moving where that query happens within the test. In some places I shorten names like `test_user` or `user_profile` to just be `user`.	2020-03-11 14:18:29 -07:00
Steve Howell	626ad0078d	tests: Add uuid_get and uuid_post. We want a clean codepath for the vast majority of cases of using api_get/api_post, which now uses email and which we'll soon convert to accepting `user` as a parameter. These apis that take two different types of values for the same parameter make sweeps like this kinda painful, and they're pretty easy to avoid by extracting helpers to do the actual common tasks. So, for example, here I still keep a common method to actually encode the credentials (since the whole encode/decode business is an annoying detail that you don't want to fix in two places): def encode_credentials(self, identifier: str, api_key: str) -> str: """ identifier: Can be an email or a remote server uuid. """ credentials = "%s:%s" % (identifier, api_key) return 'Basic ' + base64.b64encode(credentials.encode('utf-8')).decode('utf-8') But then the rest of the code has two separate codepaths. And for the uuid functions, we no longer have crufty references to realm. (In fairness, realm will also go away when we introduce users.) For the `is_remote_server` helper, I just inlined it, since it's now only needed in one place, and the name didn't make total sense anyway, plus it wasn't a super robust check. In context, it's easier just to use a comment now to say what we're doing: # If `role` doesn't look like an email, it might be a uuid. if settings.ZILENCER_ENABLED and role is not None and '@' not in role: # do stuff	2020-03-11 14:18:29 -07:00
Steve Howell	00dc976379	tests: Use users for common_subscribe_to_streams. We also use users for get_streams().	2020-03-11 14:18:29 -07:00
Mateusz Mandera	89394fc1eb	middleware: Use request.user for logging when possible. Instead of trying to set the _requestor_for_logs attribute in all the relevant places, we try to use request.user when possible (that will be when it's a UserProfile or RemoteZulipServer as of now). In other places, we set _requestor_for_logs to avoid manually editing the request.user attribute, as it should mostly be left for Django to manage it. In places where we remove the "request._requestor_for_logs = ..." line, it is clearly implied by the previous code (or the current surrounding code) that request.user is of the correct type.	2020-03-09 13:54:58 -07:00
Mateusz Mandera	0255ca9b6a	middleware: Log user.id/realm.string_id instead of _email.	2020-03-09 13:54:58 -07:00
Tim Abbott	5835023021	tests: Use user IDs internally in send message helpers. This uses the better, modern, user ID based API for sending messages internally in the test suite, something that's convenient to do as a follow-up to the migration to pass UserProfile objects to these functions.	2020-03-07 18:31:13 -08:00
Steve Howell	5e2a32c936	tests: Use users in send_*_message. This commit mostly makes our tests less noisy, since emails are no longer an important detail of sending messages (they're not even really used in the API). It also sets us up to have more scrutiny on delivery_email/email in the future for things that actually matter. (This is a prep commit for something along those lines, kind of hard to explain the full plan.)	2020-03-07 18:30:13 -08:00
Vishnu KS	1c6435d4cc	validator: Optionally record a type_structure attribute. We plan to use these records to check and record the schema of Zulip's events for the purposes of API documentation. Based on an original messier commit by tabbott. In theory, a nicer version of this would be able to work directly off the mypy type system, but this will be good enough for our use case.	2020-03-06 17:07:14 -08:00

1 2 3 4 5 ...

4915 Commits