zulip

Commit Graph

Author	SHA1	Message	Date
Mateusz Mandera	49b76318c6	email_mirror: Extract handle_header_content function.	2020-01-30 13:03:47 -08:00
Mateusz Mandera	9dcf677bf9	email_mirror: Parse encoded From headers with show_sender=True.	2020-01-29 12:27:35 -08:00
Mateusz Mandera	d37e6ef921	email_mirror: Use plaintext if html body empty with prefer-html option. If an email is sent with the .prefer-html option, but it has no html body, it's better to fall back to plaintext content instead of treating it as a user error.	2020-01-16 15:25:27 -08:00
Mateusz Mandera	0c9c218e91	email_mirror: Add prefer-html and prefer-text address options. Closes #13484. These options tell zulip whether to prefer the plaintext or html version of the email message. prefer-text is the default behavior, so including the option doesn't change anything as of now, but we're adding it to prepare to potentially change the default behavior in the future.	2020-01-16 15:25:19 -08:00
Mateusz Mandera	0beae44081	email_mirror: Use .walk() to search all MIME parts for attachments. Fixes #13416 We used to search only one level in depth through the MIME structure, and thus would miss attachments that were nested deeper (which can happen with some email clients). We can take advantage of message.walk() to iterate through each MIME part.	2020-01-14 15:37:39 -08:00
Mateusz Mandera	1561d144e0	email_mirror: Insert a new line before attachment links.	2020-01-14 15:37:39 -08:00
Mateusz Mandera	d5ac1afce8	email_mirror: Check address usability in get_missed_message_address.	2020-01-12 20:43:51 -08:00
Mateusz Mandera	89046ea1a9	email_mirror: Give extract_and_validate a more descriptive name.	2020-01-12 11:30:18 -08:00
Mateusz Mandera	90a69ab24f	email_mirror: Reuse exception messages in mirror_email_message.	2020-01-12 11:30:18 -08:00
Mateusz Mandera	b87cf22b33	email_mirror: Move send_to_mm_address code to process_missed_message. process_missed_message did nothing other than calling send_to_missed_message_address with the same arguments, so there's no reason to have these as separate functions.	2020-01-07 13:03:32 -08:00
Mateusz Mandera	c011d2c6d3	email_mirror: Migrate missed message addresses from redis to database. Addresses point 1 of #13533. MissedMessageEmailAddress objects get tied to the specific that was missed by the user. A useful benefit of that is that email message sent to that address will handle topic changes - if the message that was missed gets its topic changed, the email response will get posted under the new topic, while in the old model it would get posted under the old topic, which could potentially be confusing. Migrating redis data to this new model is a bit tricky, so the migration code has comments explaining some of the compromises made there, and test_migrations.py tests handling of the various possible cases that could arise.	2020-01-07 13:03:22 -08:00
Mateusz Mandera	1c5461663f	users: Eliminate some unnecessary get_personal_recipient calls.	2019-12-09 15:24:35 -08:00
Tim Abbott	6618cec9db	logging: Switch various logging code paths to use user IDs. This fixes EMAIL_ADDRESS_VISIBILITY_ADMINS support as well as being more reliable/stable over time.	2019-11-15 17:24:01 -08:00
Mateusz Mandera	3271235200	email_mirror: Ignore missed message email if the user isn't active.	2019-09-20 17:58:10 -07:00
Mateusz Mandera	f1b135bd16	email_mirror: Rename include-quotations to include-quotes.	2019-07-20 15:53:43 -07:00
Mateusz Mandera	58754830fd	email_mirror: Rename "include-footers" option to "include-footer".	2019-07-08 20:10:21 -07:00
Mateusz Mandera	569d79b9d8	email_mirror: Add support for "+include-quotations" in address. We add an option to disable the stripping of quotations from the email body, if "+include-quotations" token is included in the email address.	2019-06-02 10:50:59 -07:00
Mateusz Mandera	e4138c5463	email_mirror: Add support for "+include-footers" in address. In addition to the "+show-sender" option, we now add "+include-footers" which disables stripping of the footer from the email body if this token is included in the email address.	2019-06-02 10:50:59 -07:00
Mateusz Mandera	a5aa4adb54	email_mirror: Add general support for optional tokens in the address. To enable a comfortable way of adding more optional tokens in the address (like current '+show-sender') we change decode_email_address to return a general dictionary containing options specified through adding these optional tokens in the To: address. For now, we only have "+show-sender", but more can be easily added using this change.	2019-06-02 10:50:59 -07:00
Mateusz Mandera	a0efd76f4e	email_mirror: Rewrite log_and_report and cover it with tests. log_and_report and its helper functions were mostly old code no longer well adapted to how email mirror works currently, as well as having no test coverage. We rewrite this part of the email to report errors in a similar manner, and add tests for it. We're able to get rid of the clunky and now useless debug_info dictionary in process message, as log_and_report only needs the recipient email in its third argument.	2019-05-20 19:35:32 -07:00
Mateusz Mandera	2adcdd0c25	email_mirror: Don't pass debug_info to process_stream_message. The only place in which process_stream_message used debug_info was to set the 'stream' key, which would only be used if ZulipEmailForwardError was raised after this line in the code - which is impossible, because after that line only send_zulip (which doesnt raise this exception) and logger.info get called, then process_stream_message successfully returns and then process_message succesfully returns as well. So this debug_info code wasn't doing anything. We remove it.	2019-05-20 19:35:32 -07:00
Mateusz Mandera	40f5755546	email_mirror: Handle case of unspecified charset in Content-Type header. If the text part of an email message didn't specify the charset in the Content-Type header, the text content wouldn't be found. We fix this, by assuming us-ascii charset in those cases, as specified by RFC6657: https://tools.ietf.org/html/rfc6657	2019-05-09 09:57:40 -07:00
Mateusz Mandera	c1ceba9037	rate_limiter: Move email_mirror limiter to use rate_limit_entity. We change the rate limiting code in the email mirror to use the new, general rate_limit_entity function.	2019-05-01 12:54:32 -07:00
Anders Kaseorg	643bd18b9f	lint: Fix code that evaded our lint checks for string % non-tuple. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-04-23 15:21:37 -07:00
Mateusz Mandera	c7c1dbec60	email_mirror: Raise ZulipEmailForwardError if email pattern not recognised. With the previous commit, fixes #1836. As specified in the issue above, we make get_email_gateway_message_string_from_address raise an exception if it doesn't recognise the email gateway address pattern. Then, we make appropriate adjustments in the codepaths which call this function.	2019-03-21 15:25:57 -07:00
Mateusz Mandera	e32c444ecf	email_mirror: Move some helper functions out of actions.py. These functions don't really belong in actions.py, so we move them out, into email_mirror_helpers.py. They can't go directly into email_mirror.py or we'd get circular imports resulting in ImportError.	2019-03-21 15:25:57 -07:00
Mateusz Mandera	1901775383	email_mirror: Add realm-based rate limiting. Closes #2420 We add rate limiting (max X emails withing Y seconds per realm) to the email mirror. By creating RateLimitedRealmMirror class, inheriting from RateLimitedObject, and rate_limit_mirror_by_realm function, following a mechanism used by rate_limit_user, we're able to have this implementation mostly rely on the already existing, and proven over time, rate_limiter.py code. The rules are configurable in settings.py in RATE_LIMITING_MIRROR_REALM_RULES, analogically to RATE_LIMITING_RULES. Rate limit verification happens in the MirrorWorker in queue_processors.py. We don't rate limit missed message emails, as due to using one time addresses, they're not a spam threat. test_mirror_worker is adapted to the altered MirrorWorker code and a new test - test_mirror_worker_rate_limiting is added in test_queue_worker.py to provide coverage for these changes.	2019-03-18 11:16:58 -07:00
Mateusz Mandera	a64a075ff1	email_mirror: Ignore stream_name part of receiving address. To prepare for changing how the stream name gets encoded into mirror email addresses while making sure old addresses keep working, we ignore the stream_name part when receiving emails into the mirror and we only look at the email_token to identify into which stream to mirror the email.	2019-03-18 11:06:51 -07:00
Tim Abbott	50dc317466	notifications: Rename notifications.py to email_notifications.py. This library is entirely about email notifications specifically, and this rename should help make the codebase more readable.	2019-03-15 11:02:17 -07:00
Mateusz Mandera	edcb6d57fc	email_mirror: Don't remove quotations from forwarded messages. Addresses point 2 of #10612. We use a regex to detect if a form of FWD indicator is present at the beginning of the subject, which means the message has been forwarded. remove_quotations argument is added to a couple of functions where it's necessary. In filter_footer, the criteria for a line to be a possible beginning of a footer is changed to line.strip() == "--", instead of line.strip().startswith("--"), because the former would remove quotations from plaintext emails. This change makes sense, because RFC 3676 specifies ""-- " as the separator line between the body and the signature of a message": https://tools.ietf.org/html/rfc3676	2019-03-09 15:36:17 -08:00
Mateusz Mandera	0633f268fb	email_mirror: Move subject processing into process_stream_message. We remove the 'subject' argument of process_stream_message and make subject processing happen inside the function, as it's a more appropriate place than the general process_message function and is needed to have a good way of disabling removing quotations in forwarded emails sent into the mirror.	2019-03-09 15:36:17 -08:00
Mateusz Mandera	dbff533e09	email_mirror: Add the sender at the start of stream message. Fixes part 3 of #10612. When sending an email to the email mirror to a stream address, if "+show-sender" is added in the address, the stream message will now include "From: <sender>" at the top.	2019-03-07 14:28:33 -08:00
Tim Abbott	7099d01641	email mirror: Fix missing variable for logging.	2019-02-13 13:16:55 -08:00
Eeshan Garg	179b747769	streams: Refactor multi-option helpers into separate functions. For internal stream messages, most of the time, we have access to a Stream object. For the few corner cases where we don't, it is a much cleaner approach to have a separate function that accepts a stream name than having one multi-option helper that accepts both names and objects.	2019-02-12 11:10:26 -08:00
Eeshan Garg	3470e541c8	internal_send_stream_message: Support accepting a Stream object. If the caller has access to a Stream object, it is wasteful to query a database for a stream by ID or name. In addition, not having to go through stream names eliminates various classes of possible bugs involved with re-fetching the Stream object by name.	2019-02-08 08:59:03 -08:00
Anders Kaseorg	f0ecb93515	zerver core: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:41:24 -08:00
Tim Abbott	a92a5f19f0	email_mirror: Handle case where email body is empty. This provides logging that makes clear this situation is a user error.	2019-01-15 11:30:15 -08:00
Tim Abbott	14b2ed649b	email_mirror: Don't email errors for emails missing body type. This lowers the severity on content type errors to not send spammy error emails, and instead just log a warning.	2019-01-15 11:30:15 -08:00
Tim Abbott	828577c3b2	email_mirror: Fix parsing of unicode in subject headers. Previously, we had some hand-written logic for parsing the subject line of the email's headers and turning it into a Python string using each of the valid encodings for an email. That logic was buggy, and sometimes resulted in a bytes object being passed into the `send_zulip`, which would eventually throw an exception. The fix for this is to use the Python standard library make_header method for handling internationalized email headers. https://stackoverflow.com/questions/7331351/python-email-header-decoding-utf-8	2019-01-07 10:21:04 -08:00
Mateusz Mandera	87c95c6f41	email_mirror: Strip RE and FWD from email subject. Fixes part 1 of #10612. We use a regex to remove RE:, FWD: (and similar variations) from email subjects. Unit test is included, we add subjects.json in fixtures containing various subjects to try the stripping on.	2019-01-05 15:59:19 -08:00
Tom Daff	f2e06128c6	email_mirror: Add email address parsing. When trying to find the email gateway address, use the `email.util.getaddresses` function to deal with cases where multiple recipients are included in the email header or the stream address appears as an angle-addr with a name given (e.g. if someone added it to their address book). Added some other headers where the required address may appear: "Resent" headers are sometimes used for forwarding, and streams may also be found in CC. There is no way to find the address if the email was recieved as a BCC.	2019-01-03 13:34:20 -08:00
Tim Abbott	adf27aae4c	python: Remove now-unnecessary str_utils library. This library was absolutely essential as part of our Python 2->3 migration process, but all of its calls should be either no-ops or encode/decode operations. Note also that the library has been wrong since the incorrect refactoring in `1f9244e060`. Fixes #10807.	2018-11-27 11:57:54 -08:00
Marco Burstein	6f569719c9	integrations: Change the truncation marker for long messages. Change the truncation marker from `...` to `\n[message truncated]` when receiving messages from the API or through e-mail. Also, update tests to account for the new change. Fix #10871.	2018-11-26 11:09:39 -08:00
Steve Howell	af1acf9239	Rename constant to MAX_TOPIC_NAME_LENGTH.	2018-11-07 10:03:53 -08:00
Steve Howell	2fd0cfe708	Use topic_name() helper in more places.	2018-11-07 10:03:53 -08:00
Tim Abbott	704967faa4	email_mirror: Don't import talon unless we're using it. Talon is an expensive import; on my system, deferring this import saves 28ms on the import time for Zulip.	2018-10-17 11:25:38 -07:00
Tim Abbott	75376a3fc5	email_mirror: Limit message length using defined constants. Previously, we had the somewhat arbitrary limit of 2K characters (which some users complained about), as well as the constant 60 for the topic.	2018-09-21 10:39:57 -07:00
Aditya Bansal	1f9244e060	zerver/lib: Change use of typing.Text to str.	2018-05-10 14:19:49 -07:00
Tim Abbott	2cdd367d49	email_mirror: Fix handling of empty topic. Also fixs some corner cases around pure-whitespace topics, and migrates from the years-obsolete "no subject". Fixes #9207.	2018-04-26 10:21:29 -07:00
Puneeth Chaganti	4ce8f2aaa2	upload: Rename upload_message_image to upload_message_file. Tweaked by tabbott to also fix a Slack import comment.	2018-03-30 13:38:31 -07:00
rht	9161f8c39b	zerver/lib: Remove u prefix from strings.	2018-02-05 12:12:58 -08:00
neiljp (Neil Pilgrim)	25d5a2ee1c	requirements: Upgrade mypy to 0.560. Fixes #7835.	2017-12-20 18:09:36 -08:00
Tim Abbott	5306a9634d	email_mirror: Rewrite to not use internal_send_message. This was causing problems with the fact that `get_system_bot` now only works for actual system bot users.	2017-11-26 17:14:23 -08:00
Robert Hönig	0e0a8a2b14	queue processor tests: Call consume by default. This significantly improves the API for queue_json_publish to not be overly focused on what the behavior of this function should be in our unit tests.	2017-11-26 11:45:34 -08:00
rht	561ba33f69	zerver/lib: Use python 3 syntax for typing. Split by tabbott from a larger commit; this covers a batch of files with no open PRs touching them.	2017-11-21 20:45:52 -08:00
rht	5ee40bf718	Remove usage of six.moves.binary_type.	2017-11-09 10:00:00 -08:00
Harshit Bansal	65838bb825	email_gateway: Disable code block processor for email gateway. Generally emails are not written with markdown in mind and hence sometimes render in strange ways. This commit fixes a particular issue that was causing whitespace before paragraphs to be treated as code block due to which email content was being rendered in a box that scrolls in right direction a lot. Fixes: #7045.	2017-11-09 09:56:35 -08:00
rht	fef7d6ba09	zerver/lib: Remove u prefix from strings. License: Apache-2.0 Signed-off-by: rht <rhtbot@protonmail.com>	2017-11-03 15:34:37 -07:00
Steve Howell	8b012c6210	Extract get_personal_recipient().	2017-10-28 17:57:39 -07:00
Tim Abbott	e5df05fd35	tests: Suppress logging spam in email mirror tests.	2017-10-27 16:06:03 -07:00
derAnfaenger	18e5bcbbb1	tests: Enable call_consume_in_tests for email mirror queue.	2017-10-26 14:53:27 -07:00
Tim Abbott	c61b6d06e5	email_mirror: Strip content before checking for empty emails. This may fix an exception we were getting of the form: "Error queueing internal message by emailgateway@zulip.com: Message must not be empty".	2017-10-18 21:13:03 -07:00
Tim Abbott	069f681bc6	email_mirror: Filter out null characters in email bodies. They're rarely useful, usually displayed invisibly in most tools anyway, and this helps make sure the message makes it into Zulip rather than being rejected.	2017-10-03 15:32:05 -07:00
Tim Abbott	167c3570f8	email_mirror: Extract construct_message_body.	2017-10-03 15:32:05 -07:00
rht	035ed93111	zerver/lib: remove `import six`.	2017-09-27 19:10:28 -07:00
rht	f43e54d352	zerver/lib: Remove absolute_import.	2017-09-27 10:00:39 -07:00
Greg Price	dfbd80f302	email_mirror: Convert subjects back to str from Redis's bytes. Redis and the Redis client know nothing but bytes. When we take a `bytes` object it returns and pass it down as `subject` here, it causes an exception deep inside message processing if the realm has any filters, when `bugdown.subject_links` attempts to search the subject for the filters, which are of course `str` patterns. For symmetry, make the conversion to bytes on the storing side explicit too.	2017-08-25 16:14:33 -07:00
Rishi Gupta	3bc74113ad	utils: Cast generate_random_token to str. Having this be Text is forcing various URLs, emails, etc to be type annotated as Text.	2017-07-17 23:18:47 -07:00
Tim Abbott	786b339b96	email_mirror: Fix exception for emails with no valid content type. If a broken email shows up with no text or email content-type, we were attempting to return an undefined variable.	2017-07-13 22:19:49 -07:00
Rishi Gupta	1291ac9eff	emails: Groundwork to personalize reply-to name of missed message emails.	2017-07-05 15:33:01 -07:00
Rishi Gupta	a8af0b6d91	email_mirror: Change missed message noreply to not have a display name. It's better to see "noreply@..." when replying to a message that you can't reply to than "Zulip".	2017-07-04 14:25:01 -07:00
Eklavya Sharma	1d8c316ff0	mypy: Make email_mirror pass --strict-optional check.	2017-05-24 18:49:54 -07:00
Konstantin Gukov	dd76222a3f	Fetch system bots using new get_system_bot function. This eliminate a bunch of uninteresting calls to get_user_profile_by_email.	2017-05-23 10:30:40 -07:00
Aditya Bansal	cbaace87cd	pep8: Add compliance with rule E261 to email_mirror.py.	2017-05-07 23:21:50 -07:00
K.Kanakhin	e3e52e7284	email-mirror: Move postfix email mirror integration to separate script. This fixes a performance problem where we were previously starting up a full Django process (~0.7s even on a fast machine) every time a new email came in, potentially allowing users to accidentally DoS a Zulip server. Now, we just post over HTTPS, allowing the existing thread pool support to do its job. - Add script wrapper to communicate postfix pipe with django web server over HTTP(S). It uses shared_secret authentication mode. - Add django view to process messages from email mirror server. - Clean management command `email-mirror`. Left just functional for cron email processing. - Add routes for new tornado view. - Change pipe script in master process postfix config template based on updated script. - Add tests. Tweaked by tabbott to adjust the directory and set better defaults. Fixes #2421.	2017-04-24 21:24:23 -07:00
Tim Abbott	9866124b78	mypy: Fix some new errors flagged by latest mypy master. Mostly list -> List bugs in annotations.	2017-03-19 21:03:45 -07:00
Rishi Gupta	5dc683ba8d	Use Realm.string_id instead of Realm.domain when logging.	2017-03-13 09:42:14 -07:00
Tim Abbott	75e81253f2	mypy: Work around several new mypy bugs in 0.501.	2017-03-04 15:33:39 -08:00
Raghav Jajodia	a3a03bd6a5	mypy: Added Dict, List and Set imports. Fixed mypy errors associated with the upgrade.	2017-03-04 14:33:44 -08:00
PhilSk	53f3d84af2	attachment: Add 'size' field tracking size of uploaded files. This tracking will make it possible in the future to limit the total size of uploads on a per-user or per-organization basis. Fixes #3774.	2017-03-01 15:58:21 -08:00
Tim Abbott	4e171ce787	lint: Clean up E126 PEP-8 rule.	2017-01-23 22:06:13 -08:00
Tim Abbott	99c5563bc6	internal_send_message: Make realm argument mandatory. A lot of care has been taken to ensure we're using the realm that the message is being sent into, not the realm of the sender, to correctly handle the logic for cross-realm bot users such as the notifications bot.	2017-01-21 21:37:30 -08:00
Robert Hönig	0917493588	mypy: Convert zerver/lib to use typing.Text.	2016-12-25 10:33:45 -08:00
Tim Abbott	4a4664d268	mypy: Remove a bunch of now-unnecessary type: ignore annotations. Since mypy and typeshed have advanced a lot over the last several months, we no longer need these `type: ignore` annotations.	2016-10-17 11:48:34 -07:00
Steve Howell	f0eaee68e4	bug: Fix traceback in get_missed_message_token_from_address(). If you supplied an unrecognizable address to our email system, or you had EMAIL_GATEWAY_PATTERN configured wrong, the get_missed_message_token_from_address() used to crash hard and cryptically with a traceback saying that you can't call startswith() on a None object. Now we throw a ZulipEmailForwardError exception. This will still lead to a traceback, but it should be easier to diagnose the problem.	2016-09-22 13:41:26 -07:00
Steve Howell	dbbc64dbfe	bug: Fix code that mis-identifies missed message formats. In our email mirror, we have a special format for missed message emails that uses a 32-bit randomly generated token that we put into redis that is then prefixed with "mm" for a total of 34 characters. We had a bug where we would mis-classify emails like mmcfoo@example.com as being these system-generated emails that were part of the redis setup. It's actually a little unclear how the bug in the library function would have manifested from the user's point of view, but it was definitely buggy code, and it's possibly related in a subtle way to an error report we got from a customer where only one of their users, who happened to have a name like mmcfoo, was having problems with the mirror.	2016-09-22 13:41:26 -07:00
Steve Howell	0b7cac04d4	email mirror: Extract is_mm_32_format().	2016-09-22 13:41:26 -07:00
Tim Abbott	4423222d92	email_mirror: Add successful processing logging.	2016-09-08 16:54:10 -07:00
Tim Abbott	55d7947b76	email_mirror: Use ERROR_BOT for error reporting. This has the nice side effect of getting rid of the now-unnecessary ADMIN_DOMAIN from this codepath -- we really just want whichever realm ERROR_BOT is in.	2016-08-22 21:28:01 -07:00
Umair Khan	2c07f1b19a	Use NOREPLY_EMAIL_ADDRESS if email gateway not enabled. This fixes a regression where missed message emails would not be sent at all in the event that EMAIL_GATEWAY_PATTERN was unset. The overall experience still isn't great, but it's better than crashing. Fixes: #1411 [commit message expanded by tabbott]	2016-07-31 20:38:18 -07:00
Eklavya Sharma	e6502710b6	Change exception.message to str(exception). The 'message' attribute in Exception has been deprecated. It has been removed in python 3.	2016-07-13 16:00:46 -07:00
Umair Khan	395e053ce3	Revert "Revert "Extract reply from email."" This reverts commit `f1ba3ded42`.	2016-07-13 11:24:18 -07:00
Eklavya Sharma	f1ba3ded42	Revert "Extract reply from email." This reverts commit `f1f48f305e`. The use of sklearn unfortunately caused a substantial slowdown to the Zulip provisioning process, which didn't seem worth it for a relatively minor feature.	2016-07-10 11:30:30 -07:00
Umair Khan	f1f48f305e	Extract reply from email.	2016-07-08 10:58:25 -07:00
Tim Abbott	98db1d996f	email_mirror: Fix some indentation issues.	2016-07-07 10:02:08 -07:00
Eklavya Sharma	5e81a4d93f	zerver/lib/email_mirror.py: Improve annotation in python 3. Add asserts and if statements to help mypy.	2016-07-07 10:01:30 -07:00
Eklavya Sharma	4f221c21a0	zerver/lib/email_mirror.py: Improve subject extraction. Improve subject extraction in process_message.	2016-07-07 09:55:23 -07:00
medullaskyline	e2eb4e0b7e	Annotate zerver/lib/email_mirror.py. [With some fixes from @sharmaeklavya2].	2016-06-20 15:58:40 -07:00
Eklavya Sharma	c654c4032d	zerver/models.py: Annotate get_display_recipient. get_display_recipient's annotation clashes with other wrong annotations. Fix those wrong annotations. Since get_display_recipient returns a Union, use isinstance checks and casts to make mypy checks succeed.	2016-06-12 23:34:57 +05:30
Tim Abbott	54022ac204	Fix unnecessary whitespace between , and ).	2016-05-04 14:16:53 -07:00

1 2 3 4

167 Commits