zulip

Commit Graph

Author	SHA1	Message	Date
Anders Kaseorg	a054f57af6	message: Bundle message stripping, validation, and truncation. We always want to do these at the same time. Previously, message editing did too much stripping (fixes #16837) and failed to check for NUL bytes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-12-18 17:44:13 -08:00
palash	34003fc7f6	email_mirror: Change root logger to zerver.lib.email_mirror.	2020-09-12 10:53:56 -07:00
Anders Kaseorg	b7b7475672	python: Use standard secrets module to generate random tokens. There are three functional side effects: • Correct an insignificant but mathematically offensive bias toward repeated characters in generate_api_key introduced in commit 47b4283c4b4c70ecde4d3c8de871c90ee2506d87; its entropy is increased from 190.52864 bits to 190.53428 bits. • Use the base32 alphabet in confirmation.models.generate_key; its entropy is reduced from 124.07820 bits to the documented 120 bits, but now it uses 1 syscall instead of 24. • Use the base32 alphabet in get_bigbluebutton_url; its entropy is reduced from 51.69925 bits to 50 bits, but now it uses 1 syscall instead of 10. (The base32 alphabet is A-Z 2-7. We could probably replace all of these with plain secrets.token_urlsafe, since I expect most callers can handle the full urlsafe_b64 alphabet A-Z a-z 0-9 - _ without problems.) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-09 15:52:57 -07:00
Emilio López	7b35234c7b	email_mirror: Fix exception handling unstructured headers. This commit rewrites the way addresses are collected. If the header with the address is not an AddressHeader (for instance, Delivered-To and Envelope-To), we take its string representation. Fixes: #15864 ("Error in email_mirror - _UnstructuredHeader has no attribute addresses").	2020-07-22 12:11:25 -07:00
Anders Kaseorg	1ed2d9b4a0	logging: Use logging.exception and exc_info for unexpected exceptions. logging.exception() and logging.debug(exc_info=True), etc. automatically include a traceback. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	a803e68528	email-mirror-postfix: Handle 8-bit messages correctly. Since JSON can’t represent bytes, we encode them with base64. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 20:24:06 -07:00
Anders Kaseorg	bff3dcadc8	email: Migrate to new Python ≥ 3.3 email API. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 20:24:06 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Anders Kaseorg	67e7a3631d	python: Convert percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 15:02:09 -07:00
Anders Kaseorg	8dd83228e7	python: Convert "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, but with the NamedTuple changes reverted (see commit `ba7906a3c6`, #15132). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-08 15:31:20 -07:00
Anders Kaseorg	0bcd4a6281	email_mirror: Avoid parameters that shadow names in their own types. They confuse semgrep, and also, like, people in general. https://github.com/returntocorp/semgrep/issues/923 Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-05 09:35:11 -07:00
Anders Kaseorg	bdc365d0fe	logging: Pass format arguments to logging. https://docs.python.org/3/howto/logging.html#optimization Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-02 10:18:02 -07:00
Anders Kaseorg	fead14951c	python: Convert assignment type annotations to Python 3.6 style. This commit was split by tabbott; this piece covers the vast majority of files in Zulip, but excludes scripts/, tools/, and puppet/ to help ensure we at least show the right error messages for Xenial systems. We can likely further refine the remaining pieces with some testing. Generated by com2ann, with whitespace fixes and various manual fixes for runtime issues: - invoiced_through: Optional[LicenseLedger] = models.ForeignKey( + invoiced_through: Optional["LicenseLedger"] = models.ForeignKey( -_apns_client: Optional[APNsClient] = None +_apns_client: Optional["APNsClient"] = None - notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - signup_notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + signup_notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - author: Optional[UserProfile] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) + author: Optional["UserProfile"] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) - bot_owner: Optional[UserProfile] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) + bot_owner: Optional["UserProfile"] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) - default_sending_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) - default_events_register_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_sending_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_events_register_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) -descriptors_by_handler_id: Dict[int, ClientDescriptor] = {} +descriptors_by_handler_id: Dict[int, "ClientDescriptor"] = {} -worker_classes: Dict[str, Type[QueueProcessingWorker]] = {} -queues: Dict[str, Dict[str, Type[QueueProcessingWorker]]] = {} +worker_classes: Dict[str, Type["QueueProcessingWorker"]] = {} +queues: Dict[str, Dict[str, Type["QueueProcessingWorker"]]] = {} -AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional[LDAPSearch] = None +AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional["LDAPSearch"] = None Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 11:02:32 -07:00
Udit107710	db30cf470c	refactor: Making email_mirror independent of actions. Moved truncate_body, truncate_content and truncate_topic to message.py.	2020-04-18 16:58:29 -07:00
Emilio López	d3c841d587	email_mirror: also check for Envelope-To After subscribing a stream email address to a Mailman email list and receiving a message from it (using the polling configuration with an Exim + Dovecot mailserver), the following error message is emitted by Zulip: Logger zerver.lib.email_mirror, from module zerver.lib.email_mirror line 77: Error generated by Anonymous user (not logged in) on zulip deployment Sender: "Foo Bar" <foo@example.com> To: No recipient found Missing recipient in mirror email This is because the To: header on the received email corresponds to the email list, and there are no other headers to indicate the final recipient, apart from the "Envelope-To" header added by Exim. To resolve this problem, the commit adds "Envelope-To" to the list of headers to check for a match.	2020-03-25 16:28:46 -07:00
Mateusz Mandera	8069133f88	rate_limit: Remove __str__ methods of RateLimitedObjects. These were clunky from the start and are no longer used, as keys are now used directly for logging purposes.	2020-03-22 18:42:35 -07:00
Mateusz Mandera	4e9f77a6c4	rate_limit: Adjust keys() of some RateLimitedObjects. type().__name__ is sufficient, and much readable than type(), so it's better to use the former for keys. We also make the classes consistent in forming the keys in the format type(self).__name__:identifier and adjust logger.warning and statsd to take advantage of that and simply log the key().	2020-03-22 18:42:35 -07:00
Mateusz Mandera	2c6b1fd575	rate_limit: Rename key_fragment() method to key().	2020-03-22 18:42:35 -07:00
Mateusz Mandera	9c9f8100e7	rate_limit: Add the concept of RateLimiterBackend. This will allow easily swapping and using various implementations of rate-limiting, and separate the implementation logic from RateLimitedObjects.	2020-03-22 18:42:35 -07:00
Mateusz Mandera	85df6201f6	rate_limit: Move functions called by external code to RateLimitedObject.	2020-03-22 18:42:35 -07:00
Steve Howell	28a8ffbc4c	email_mirror: Use internal_send_stream_message(). This is just a refactoring to the more modern API for sending internal messages. To make this work we now plumb the email_gateway flag through `internal_send_stream_message` instead of `internal_send_message`. We also change `send_zulip` to have its callers pass in a full UserProfile object (which one of them already had).	2020-02-10 15:45:13 -08:00
Mateusz Mandera	8bd3752d13	email_mirror: Handle encoded attachment filenames.	2020-01-30 13:03:47 -08:00
Mateusz Mandera	49b76318c6	email_mirror: Extract handle_header_content function.	2020-01-30 13:03:47 -08:00
Mateusz Mandera	9dcf677bf9	email_mirror: Parse encoded From headers with show_sender=True.	2020-01-29 12:27:35 -08:00
Mateusz Mandera	d37e6ef921	email_mirror: Use plaintext if html body empty with prefer-html option. If an email is sent with the .prefer-html option, but it has no html body, it's better to fall back to plaintext content instead of treating it as a user error.	2020-01-16 15:25:27 -08:00
Mateusz Mandera	0c9c218e91	email_mirror: Add prefer-html and prefer-text address options. Closes #13484. These options tell zulip whether to prefer the plaintext or html version of the email message. prefer-text is the default behavior, so including the option doesn't change anything as of now, but we're adding it to prepare to potentially change the default behavior in the future.	2020-01-16 15:25:19 -08:00
Mateusz Mandera	0beae44081	email_mirror: Use .walk() to search all MIME parts for attachments. Fixes #13416 We used to search only one level in depth through the MIME structure, and thus would miss attachments that were nested deeper (which can happen with some email clients). We can take advantage of message.walk() to iterate through each MIME part.	2020-01-14 15:37:39 -08:00
Mateusz Mandera	1561d144e0	email_mirror: Insert a new line before attachment links.	2020-01-14 15:37:39 -08:00
Mateusz Mandera	d5ac1afce8	email_mirror: Check address usability in get_missed_message_address.	2020-01-12 20:43:51 -08:00
Mateusz Mandera	89046ea1a9	email_mirror: Give extract_and_validate a more descriptive name.	2020-01-12 11:30:18 -08:00
Mateusz Mandera	90a69ab24f	email_mirror: Reuse exception messages in mirror_email_message.	2020-01-12 11:30:18 -08:00
Mateusz Mandera	b87cf22b33	email_mirror: Move send_to_mm_address code to process_missed_message. process_missed_message did nothing other than calling send_to_missed_message_address with the same arguments, so there's no reason to have these as separate functions.	2020-01-07 13:03:32 -08:00
Mateusz Mandera	c011d2c6d3	email_mirror: Migrate missed message addresses from redis to database. Addresses point 1 of #13533. MissedMessageEmailAddress objects get tied to the specific that was missed by the user. A useful benefit of that is that email message sent to that address will handle topic changes - if the message that was missed gets its topic changed, the email response will get posted under the new topic, while in the old model it would get posted under the old topic, which could potentially be confusing. Migrating redis data to this new model is a bit tricky, so the migration code has comments explaining some of the compromises made there, and test_migrations.py tests handling of the various possible cases that could arise.	2020-01-07 13:03:22 -08:00
Mateusz Mandera	1c5461663f	users: Eliminate some unnecessary get_personal_recipient calls.	2019-12-09 15:24:35 -08:00
Tim Abbott	6618cec9db	logging: Switch various logging code paths to use user IDs. This fixes EMAIL_ADDRESS_VISIBILITY_ADMINS support as well as being more reliable/stable over time.	2019-11-15 17:24:01 -08:00
Mateusz Mandera	3271235200	email_mirror: Ignore missed message email if the user isn't active.	2019-09-20 17:58:10 -07:00
Mateusz Mandera	f1b135bd16	email_mirror: Rename include-quotations to include-quotes.	2019-07-20 15:53:43 -07:00
Mateusz Mandera	58754830fd	email_mirror: Rename "include-footers" option to "include-footer".	2019-07-08 20:10:21 -07:00
Mateusz Mandera	569d79b9d8	email_mirror: Add support for "+include-quotations" in address. We add an option to disable the stripping of quotations from the email body, if "+include-quotations" token is included in the email address.	2019-06-02 10:50:59 -07:00
Mateusz Mandera	e4138c5463	email_mirror: Add support for "+include-footers" in address. In addition to the "+show-sender" option, we now add "+include-footers" which disables stripping of the footer from the email body if this token is included in the email address.	2019-06-02 10:50:59 -07:00
Mateusz Mandera	a5aa4adb54	email_mirror: Add general support for optional tokens in the address. To enable a comfortable way of adding more optional tokens in the address (like current '+show-sender') we change decode_email_address to return a general dictionary containing options specified through adding these optional tokens in the To: address. For now, we only have "+show-sender", but more can be easily added using this change.	2019-06-02 10:50:59 -07:00
Mateusz Mandera	a0efd76f4e	email_mirror: Rewrite log_and_report and cover it with tests. log_and_report and its helper functions were mostly old code no longer well adapted to how email mirror works currently, as well as having no test coverage. We rewrite this part of the email to report errors in a similar manner, and add tests for it. We're able to get rid of the clunky and now useless debug_info dictionary in process message, as log_and_report only needs the recipient email in its third argument.	2019-05-20 19:35:32 -07:00
Mateusz Mandera	2adcdd0c25	email_mirror: Don't pass debug_info to process_stream_message. The only place in which process_stream_message used debug_info was to set the 'stream' key, which would only be used if ZulipEmailForwardError was raised after this line in the code - which is impossible, because after that line only send_zulip (which doesnt raise this exception) and logger.info get called, then process_stream_message successfully returns and then process_message succesfully returns as well. So this debug_info code wasn't doing anything. We remove it.	2019-05-20 19:35:32 -07:00
Mateusz Mandera	40f5755546	email_mirror: Handle case of unspecified charset in Content-Type header. If the text part of an email message didn't specify the charset in the Content-Type header, the text content wouldn't be found. We fix this, by assuming us-ascii charset in those cases, as specified by RFC6657: https://tools.ietf.org/html/rfc6657	2019-05-09 09:57:40 -07:00
Mateusz Mandera	c1ceba9037	rate_limiter: Move email_mirror limiter to use rate_limit_entity. We change the rate limiting code in the email mirror to use the new, general rate_limit_entity function.	2019-05-01 12:54:32 -07:00
Anders Kaseorg	643bd18b9f	lint: Fix code that evaded our lint checks for string % non-tuple. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-04-23 15:21:37 -07:00
Mateusz Mandera	c7c1dbec60	email_mirror: Raise ZulipEmailForwardError if email pattern not recognised. With the previous commit, fixes #1836. As specified in the issue above, we make get_email_gateway_message_string_from_address raise an exception if it doesn't recognise the email gateway address pattern. Then, we make appropriate adjustments in the codepaths which call this function.	2019-03-21 15:25:57 -07:00
Mateusz Mandera	e32c444ecf	email_mirror: Move some helper functions out of actions.py. These functions don't really belong in actions.py, so we move them out, into email_mirror_helpers.py. They can't go directly into email_mirror.py or we'd get circular imports resulting in ImportError.	2019-03-21 15:25:57 -07:00
Mateusz Mandera	1901775383	email_mirror: Add realm-based rate limiting. Closes #2420 We add rate limiting (max X emails withing Y seconds per realm) to the email mirror. By creating RateLimitedRealmMirror class, inheriting from RateLimitedObject, and rate_limit_mirror_by_realm function, following a mechanism used by rate_limit_user, we're able to have this implementation mostly rely on the already existing, and proven over time, rate_limiter.py code. The rules are configurable in settings.py in RATE_LIMITING_MIRROR_REALM_RULES, analogically to RATE_LIMITING_RULES. Rate limit verification happens in the MirrorWorker in queue_processors.py. We don't rate limit missed message emails, as due to using one time addresses, they're not a spam threat. test_mirror_worker is adapted to the altered MirrorWorker code and a new test - test_mirror_worker_rate_limiting is added in test_queue_worker.py to provide coverage for these changes.	2019-03-18 11:16:58 -07:00

1 2 3

140 Commits