zulip

Commit Graph

Author	SHA1	Message	Date
Steve Howell	c6448263c3	refactor: Add MentionBackend. We will eventually use this to avoid redundant queries. The diff is slightly noisy here, but there are no logic changes.	2021-12-30 11:28:15 -08:00
Steve Howell	ea252ab53e	refactor: Convert FullNameInfo to a dataclass. As part of this we no longer query for email, which is a vestige of when we used emails to identify users on the frontend.	2021-12-30 11:28:15 -08:00
Steve Howell	f5fc348786	mypy: Add explicit types for dbdata references. When our handlers specifically reference self.md.zulip_db_data, we now use an explicit type. We probably want a more robust solution here, such as a semgrep rule.	2021-12-30 11:28:15 -08:00
Steve Howell	df84892aad	markdown: Convert DbData to a dataclass.	2021-12-30 11:28:15 -08:00
Steve Howell	4e551f8279	refactor: Introduce get_stream_name_map. We only need a name -> id map, and the FullNameInfo type was a lie.	2021-12-30 11:28:15 -08:00
Steve Howell	c04a8097f3	mypy: Add EmojiInfo type. We now serialize still_url as None for non-animated emojis, instead of omitting the field. The webapp does proper checks for falsiness here. The mobile app does not yet use the field (to my knowledge). We bump the API version here. More discussion here: https://chat.zulip.org/#narrow/stream/378-api-design/topic/still_url/near/1302573	2021-12-30 11:28:14 -08:00
Alex Vandiver	6a40c17ccf	markdown: CSS-escape preview links. This adds `soupsieve` as an explicit dependency, but intentionally does not adjust the provision version, as it was already an indirect dependency.	2021-10-26 18:17:23 -07:00
Alex Vandiver	52f74bbd9b	markdown: Run URL preview links through camo. Not proxying these requests through camo is a security concern. Furthermore, on the desktop client, any embed image which is hosted on a server with an expired or otherwise invalid certificate will trigger a blocking modal window with no clear source and a confusing error message; see zulip/zulip-desktop#1119. Rewrite all `message_embed_image` URLs through camo, if it is enabled.	2021-10-26 18:17:23 -07:00
Anders Kaseorg	58920affd4	python: Remove re.UNICODE flag (redundant in Python 3). https://docs.python.org/3/library/re.html#re.A Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-22 13:42:29 -07:00
Alex Vandiver	9381a3bd45	linkifiers: Support URL percent-encoded bytes. Supporting URL percent-encoded bytes is possible using `%%20`, but this is not necessarily very understandable to end-users, even those that understand percent encoding. Allow `%20` in linkifier URL format strings, and transform them into `%%20` in the pattern just before they are applied in markdown translation. Care must be taken here, such that already-escaped `%`s are not escaped an extra time. We do this before rendering, and not before storage, as a simplification; the JS-side linkifier at present only understands `%(foo)s` and thus needs no changes, and to avoid an un-escaping pass before showing in the admin UI.	2021-10-22 13:00:20 -07:00
Anders Kaseorg	4839b7ed27	url_preview: Interpret og:image relative to full page URL. og:image is supposed to be an absolute URL, but some sites incorrectly provide a relative URL. In this case, it makes more sense to interpret it relative to the full page URL after redirects, rather than relative to just the domain part of the page URL before redirects. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-21 12:20:37 -07:00
Alex Vandiver	db934be064	CVE-2021-41115: Use re2 for user-supplied linkifier patterns. Zulip attempts to validate that the regular expressions that admins enter for linkifiers are well-formatted, and only contain a specific subset of regex grammar. The process of checking these properties (via a regex!) can cause denial-of-service via backtracking. Furthermore, this validation itself does not prevent the creation of linkifiers which themselves cause denial-of-service when they are executed. As the validator accepts literally anything inside of a `(?P<word>...)` block, any quadratic backtracking expression can be hidden therein. Switch user-provided linkifier patterns to be matched in the Markdown processor by the `re2` library, which is guaranteed constant-time. This somewhat limits the possible features of the regular expression (notably, look-head and -behind, and back-references); however, these features had never been advertised as working in the context of linkifiers. A migration removes any existing linkifiers which would not function under re2, after printing them for posterity during the upgrade; they are unlikely to be common, and are impossible to fix automatically. The denial-of-service in the linkifier validator was discovered by @erik-krogh and @yoff, as GHSL-2021-118.	2021-10-04 21:26:24 +00:00
Tim Abbott	545911b051	markdown: Remove useless locless_schemes check. This check was copied from upstream python-markdown's "safe mode" before they removed that feature. The upstream history is that they introduced this check in `2db5d1c8e4`, which was not a complete security check, and then added the immediately following check (with an allowlist of schemes) in `0b4ffbb60e`. Their first, incomplete check provides no security benefit and makes the code hard to reason about, so we remove it.	2021-09-09 09:03:40 -07:00
rht	c24ab8c4d3	markdown: Expand list of safelisted URL schemes to match HTML spec.	2021-09-09 09:03:40 -07:00
Anders Kaseorg	66ad6a4583	docs: Inline code spans are not blocks. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-07 16:12:39 -07:00
Anders Kaseorg	646c04eff2	Rename default branch to ‘main’. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-06 12:56:35 -07:00
Alex Vandiver	4d428490fd	outgoing_http: Use OutgoingSession subclasses in more places. This adds the X-Smokescreen-Role header to proxy connections, to track usage from various codepaths, and enforces a timeout. Timeouts were kept consistent with their previous values, or set to 5s if they had none previously.	2021-09-01 05:34:13 -07:00
Priyansh Garg	1e51c23494	markdown: Remove unnecessary checks for zulip_message. This commits removes some unnecessary checks for `self.md.zulip_message`, which were put there historically, as earlier we used to add the additional properties like mentions_user_ids, alert_words, etc. to Message dict only. These were later moved to MessageRenderingResult class in commit `75cea329b` but the checks weren't removed. This is important because while rendering the messages imported from other chat tools (like Rocket.Chat), the Message dict is not passed to the markdown, due to which the checks for `self.md.zerver_message` fails and hence, things like user mentions, stream/topic mentions are not rendered in the imported messages properly.	2021-08-31 16:53:42 -07:00
Anders Kaseorg	4206e5f00b	python: Remove locally dead code. These changes are all independent of each other; I just didn’t feel like making dozens of commits for them. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-19 01:51:37 -07:00
Anders Kaseorg	806494da06	markdown: Stream and parse incrementally in fetch_open_graph_image. This way we can stop reading as soon as we get to the body. Also, send an Accept header, check that the request was actually successful, use lxml.etree.iterparse instead of a broken hand-rolled state machine, and support XHTML, all for negative 28 lines of code. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-05 09:17:32 -07:00
Priyansh Garg	0a875c1c4c	markdown: Fix jpeg extension in `IMAGE_EXTENSIONS`.	2021-08-05 08:54:02 -07:00
Anders Kaseorg	42fa62e563	Revert "time_widget: Make the generated time string more readable." This reverts commit `1965584eec`. This syntax has a bad interaction with table syntax and needs to be rethought. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-03 16:45:31 -07:00
Ganesh Pawar	1965584eec	time_widget: Make the generated time string more readable. Before: <time:2021-07-14T00:14:00-07:00> After: <time:2021-07-14\|00:14:00\|UTC-07:00> Fixes #19205	2021-08-02 23:17:01 -07:00
Anders Kaseorg	3665deb93a	python: Remove unnecessary intermediate lists. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	162e9d6c0b	fenced_code: Optimize FENCE_RE to fix cubic worst-case complexity. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-22 16:40:44 -07:00
Anders Kaseorg	c56440ded0	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-05 12:23:06 -07:00
Priyansh Garg	94a2be06f3	markdown: Use a shared variable for IMAGE_EXTENSION.	2021-07-02 11:22:55 -07:00
akshatdalton	44a298b671	minor: Use `OUTER_CAPTURE_GROUP` variable instead of string value.	2021-06-25 17:43:27 -07:00
akshatdalton	490f6b6880	markdown: Extract regex in local variables.	2021-06-25 17:43:01 -07:00
PIG208	75cea329b4	markdown: Refactor out additional properties added to Message. This adds a new class called MessageRenderingResult to contain the additional properties we added to the Message object (like alert_words) as well as the rendered content to ensure typesafe reference. No behavioral change is made except changes in typing. This is a preparatory change for adding django-stubs to the backend. Related: #18777	2021-06-24 18:14:53 -07:00
akshatdalton	c507931ac8	refactor: Export non-markdown logic in mention.py.	2021-06-14 13:26:30 -07:00
Wesley Aptekar-Cassels	d5ba94082a	markdown: Increase max rendered message length to 1MB. This should help with #17425, where messages with lots of LaTeX are lost, due to the large expansion factor. This isn't a total fix for this - large messages with lots of LaTeX can still end up larger than 1MB, and rendering could timeout, but this fix should help significantly. 1MB is still small enough that I don't expect we'll run into any DOS problems - my testing didn't show any problems rendering messages that contain ~1MB of LaTeX.	2021-06-03 10:10:35 -07:00
akshatdalton	7df62ebbaf	settings: Make `MAX_MESSAGE_LENGTH` a server-level setting. This will offer users who are self-hosting to adjust this value. Moreover, this will help to reduce the overall time taken to test `test_markdown.py` (since this can be now overridden with `override_settings` Django decorator). This is done as a prep commit for #18641.	2021-06-03 09:26:28 -07:00
akshatdalton	832c763c38	minor: Remove unnecessary `__init__` method in `InlineInterestingLinkProcessor`. Subclass `Treeprocessor` takes care of the `__init__` method.	2021-05-26 17:13:03 -07:00
Anders Kaseorg	bac96cae80	markdown: Fix Dropbox image previews. ?dl=1 causes Dropbox to send Content-Type: application/binary, which can’t be interpreted by Camo. Use ?raw=1 instead. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-25 13:42:29 -07:00
akshatdalton	503247ebfa	refactor: Add class `CompiledInlineProcessor` to de-duplicate code.	2021-05-23 14:30:22 -07:00
akshatdalton	78f26b6031	minor: Use `super` to initialize subclass.	2021-05-23 14:30:22 -07:00
akshatdalton	18203d8af3	markdown: Silence user group mention inside blockquotes.	2021-05-18 17:31:25 -07:00
akshatdalton	0245b590e9	markdown: Add support for user group silent mention. Prior to this, we only supported direct mention to the user groups. This commit extends that support to silent mention for the user groups. A related test case is also added. Fixes: #11711.	2021-05-18 17:31:25 -07:00
akshatdalton	f56fca308a	mention: Refactor `USER_GROUP_MENTIONS_RE` and simplify its related code path. Earlier, USER_GROUP_MENTIONS_RE was: r"(?<![^\s\'\"\(,:<])@(\[^\]+\)" For the syntax: foo, this was unnecessarily capturing it as foo* and the extraction of `foo` was done using another helper function: `extract_user_group`. This is now changed as: r"(?<![^\s\'\"\(,:<])@(\(?P<match>[^\]+)\*)" and extraction of `foo` can be done just by using the named capture group `match`. This change also helps to simplify its related code path.	2021-05-18 17:31:25 -07:00
akshatdalton	d5a36ac5e2	mention: Refactor `MENTIONS_RE` and simplify its related code path. Earlier, MENTIONS_RE was: r"(?<![^\s\'\"\(,:<])@(?P<silent>_?)(?P<match>\\[^\]+\\)" For the syntax: foo, this was unnecessarily capturing it as foo* and adding extra operation for the extraction of `foo`. This is now changed as: r"(?<![^\s\'\"\(,:<])@(?P<silent>_?)(\\(?P<match>[^\]+)\\*)" and extraction of `foo` can be done just by using the named capture group `match`. This change also helps to simplify its related code path.	2021-05-18 17:31:25 -07:00
akshatdalton	a9d89b3c56	minor: Convert `unicode_emoji_regex` to uppercase. Following the convention, we use uppercase for regex. Also, `unicode_emoji_regex` is given a conventional name ending with `*_RE`: `UNICODE_EMOJI_RE`.	2021-05-18 17:31:25 -07:00
akshatdalton	ffc4724287	minor: Convert `emoticon_regex` to uppercase. Following the convention, we use uppercase for regex. Also, `emoticon_regex` is given a conventional name ending with `*_RE`: `EMOTICON_RE`.	2021-05-18 17:31:25 -07:00
akshatdalton	9f6e6709d3	minor: Convert `user_group_mentions` to uppercase. Following the convention, we use uppercase for regex. Also, `user_group_mentions` is given a conventional name ending with `*_RE`: `USER_GROUP_MENTIONS_RE`.	2021-05-18 17:31:25 -07:00
akshatdalton	0a01b1b28e	minor: Convert `find_mentions` to uppercase. Following the convention, we use uppercase for regex. Also, `find_mentions` is given a conventional name ending with `*_RE`: `MENTIONS_RE`.	2021-05-18 17:31:25 -07:00
Ganesh Pawar	529f72fa3f	markdown: Add support for sms and tel links. Fixes #18390	2021-05-10 15:15:34 -07:00
akshatdalton	55f4996f16	markdown: Fix silent wildcard mentions bug. A message containing wildcard mention when quoted (which is turned into a silent mention) or message with silent wildcard mention notifies the users by sending desktop, sound, and missed message email notifications. This is clearly a bug which is fixed by this commit. Fixes: #18354.	2021-05-10 12:19:40 -07:00
Anders Kaseorg	d0c6f4f400	python: Strip leading and trailing spaces from docstrings. This is enforced by Black ≥ 21.4b0. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-07 22:42:39 -07:00
Anders Kaseorg	995389b4c1	markdown: Don’t apply further Markdown processing to KaTeX output. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-01 15:43:35 -07:00
Wesley Aptekar-Cassels	6b7a3fb74a	markdown: Rewrite all external images to use Camo. Requesting external images is a privacy risk, so route all external images through Camo. Tweaked by tabbott for better test coverage, more comments, and to fix bugs.	2021-04-30 10:36:16 -07:00
Anders Kaseorg	e3f2ffa681	docs: Capitalize “Markdown” consistently. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-04-26 09:31:08 -07:00
Anders Kaseorg	178736c8eb	docs: Fix spelling errors caught by codespell. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-04-26 09:31:08 -07:00
Tim Abbott	2e928a0853	markdown: Remove logic for creating markdown engines for all realms. This logic likely never ran due to a combination of bugs. * Running `maybe_update_markdown_engines` unconditionally meant that `if md_engine_key in md_engines` was likely always true. * Introduced in 65838bb: DEFAULT_MARKDOWN_KEY could never be in md_engines, so should we have ever reached that code path, we'd have tried to rebuild all markdown engines every time. And it also wasn't clearly helpful -- because we fetch all linkifiers for a realm on every request anyway, we don't really save database queries by doing a bulk fetch on startup, and doing so would likely result in a material regression to Zulip's overall startup time that we were creating markdown engines for large numbers of realms in bulk during process startup.	2021-04-13 09:18:18 -07:00
Abhijeet Prasad Bodas	52a86d9604	linkifiers: Use dictionaries for internal structures. This change does not affect the API in anyway. All internal code now uses dictionaries to denote a linkifier, instead of tuples.	2021-04-05 18:16:08 -07:00
Abhijeet Prasad Bodas	68fe912c63	refactor: Rename most of "filter" to "linkifier". After this only the database table, events, and API endpoints remain.	2021-04-05 18:14:07 -07:00
Abhijeet Prasad Bodas	f896a7667f	refactor: Update some uses of "filter" to "linkifier". This updates some comments and local variables which could be changed without breaking other stuff.	2021-04-05 18:14:07 -07:00
Anders Kaseorg	ceb7e2d2bd	Revert "markdown: Add support to shorten GitHub links." This reverts commit `9c6d8d9d81` (#16916). This feature has known bugs, and also wants some design changes to make it customizable like linkifiers, so we’re retargeting this to post-4.x. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-04-02 15:52:34 -07:00
Sumanth V Rao	e12f682e2e	markdown: Include text & url in `topic_links` parameter of our API. The linkifier code now includes both the shortened text and the expanded URL, sorted by the order of the occurrence in a topic. This list is passed back in the `topic_links` parameter of the /messages and the /events APIs. topic_links earlier vs now: earlier: ['https://www.google.com', 'https://github.com/zulip/zulip/32'] now: [{'url': 'https://www.google.com', 'text': 'https://www.google/com}, {'url': 'https://github.com/zulip/zulip/32', 'text': '#32'}] Similarly, the topic_links local echo logic in the frontend now returns back an object. Fixes: #17109.	2021-03-30 15:53:07 -07:00
m-e-l-u-h-a-n	1b8a5a3344	markdown: Refactor backend logic for handling user mention. Backend logic for handling user mention was cluttered because it was handled at two stages first in get_possible_mentions_info while fetching mention data based on the messsage and then later in UserMentionPattern which handles processing of text for mention. Ideally UserMentionPattern should depend on get_possible_mentions_info only for data but there was a shared logic between these two that made it hard to debug any possible bugs. Updates in this commit make both of these functions coherent in terms of logic and also add appropiate comments to improve readability of these functions. There was also a hidden bug that if a user A is mentioned in with @name\|id then @invalid\|id again mentioned A because of the way we handled mentions earlier. It is solved as a result of this refactor and appropiate test has been added for this. This has been tested manually as well as by adding new test to address missing case.	2021-03-28 16:52:48 -07:00
m-e-l-u-h-a-n	2699048208	markdown: Extend user mention syntax to support user_id for mentioning. Extend our markdown system to support mentioning of users by id also. Following these changes, it would be possible to mention users with @\|user_id and silently mention using @_\|user_id. Main intention for extending the mention syntax is to make it convenient for bots to mention a users using their ids. It is to be noted that previous syntax are also supported. Documentation tweaked by tabbott for better readability. The changes were tested manually in development server, and also by adding some new backend and frontend tests. Fixes: #17487.	2021-03-25 00:44:56 -07:00
akshatdalton	9c6d8d9d81	markdown: Add support to shorten GitHub links. We add support to shorten links and test their shortening in well-organized, clean manner that makes it trivial to extend the GitHub approach for GitLab and perhaps other services. We only shorten basic types of GitHub links (issue, PR, commit) that fit a set of simple common patterns; the default behaviour of Autolink is kept for everything else. Logic added in frontend and backend Markdown Processor is identical. This makes easy to extend the logic for other services like GitLab. Fixes #11895.	2021-03-25 00:39:44 -07:00
m-e-l-u-h-a-n	830c4acedc	markdown: Fix invalid mention bug for stream and stream topic mention. Modifies `StreamPattern` and `StreamTopicPattern` to inherit from InlineProcessor instead of Pattern. This change is done because Pattern stopped checking for matching patterns as soon as it found a match which was not a valid stream. Due to this all the subsequent mention failed, even if they were valid. This bug was only present in backend renderring due to markdown.inlinepatterns.Pattern. Due to above changes verbose_compile is no longer used for precompiling STREAM_LINK_REGEX, STREAM_TOPIC_LINK_REGEX as adds ^(.?) and (.?)$ which cause extra overhead of matching pattern which is not required. With new InlineProcessor these extra patterns at beggining and end are not required. So, StreamPattern and StreamTopicPattern now define their own __init__ method for precompiling the regex. Fixes #17535. These changes were tested locally in dev server and by adding some new markdown tests to test these.	2021-03-23 01:28:30 -07:00
m-e-l-u-h-a-n	dadbba0c25	markdown: Fix invalid mention bug for user group mention. Modifies `UserGroupMentionPattern` to inherit from InlineProcessor instead of Pattern. This change is done because Pattern stopped checking for matching patterns as soon as it found a match which was not a valid user group. Due to this all the subsequent user group mention failed, even if they were valid. This bug was only present in backend renderring due to markdown.inlinepatterns.Pattern. This was reported as issue #17535. These changes were tested locally in dev server and by adding some new markdown tests to test these.	2021-03-23 01:28:30 -07:00
m-e-l-u-h-a-n	c8979a5100	markdown: Fix invalid mention bug for user mention. Modifies `UserMentionPattern` to inherit from InlineProcessor instead of Pattern. This change is done because Pattern stopped checking for matching patterns as soon as it found a match which was not a valid user. Due to this all the subsequent user mention failed. This bug was only present in backend renderring due to markdown.inlinepatterns.Pattern. This was reported as issue #17535. These changes were tested locally in dev server and by adding some new markdown tests to test these.	2021-03-23 01:28:30 -07:00
Anders Kaseorg	23088b5d78	markdown: Fix some Any annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-03-17 18:41:46 -07:00
Anders Kaseorg	9864907985	mypy: Correct typing.re imports to typing. Although typing.re exists in the standard library, mypy has never recognized it. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-03-17 18:41:46 -07:00
Anders Kaseorg	0a09c9dfd7	markdown: Re-enable typeshed stub for Python-Markdown. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-03-10 11:49:59 -08:00
Anders Kaseorg	b728727d9d	timeout: Remove unnecessary varargs support. Mypy can check it this way. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-15 17:05:28 -08:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	ae0afa2390	markdown: Explode config dict. Commit `434094e599` (#11321) changed this from an Extension to a subclass of Markdown, so it no longer has any reason to use a config dict structured like that of an Extension. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-05 10:52:31 -05:00
Anders Kaseorg	4ca66e7278	timezone: Correct common_timezones dictionary. The changes are as follows: • Fix one day offset in all western zones. • Correct CST from -64800 to -21600 and CDT from -68400 to -18000. • Disambiguate PST in favor of -28000 over +28000. • Add GMT, UTC, WET, previously excluded for being at offset 0. • Add ACDT, AEDT, AKST, MET, MSK, NST, NZDT, PKT, which the previous code did not find. • Remove numbered abbreviations -12, …, +14, which are unnecessary. • Remove MSD and PKST, which are no longer used. Hardcode the dict and verify it with a test, so that future discrepancies won’t go silently unnoticed. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-01-27 15:23:15 -08:00
Anders Kaseorg	fbf8ce0305	markdown: Add types for extra Markdown members. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	b48bdc65b9	markdown: Fix AlertWordNotificationProcessor.run type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	9573f6dc00	markdown: Fix build_block_parser type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	060036dfd5	markdown: Merge build_engine into Markdown constructor. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
Anders Kaseorg	08c64f5cfa	markdown: Fix imports for compatibility with typeshed stubs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:27 -08:00
akshatdalton	620e9cbf72	markdown: Fix merging of separate quotations. Initally, when writing two or more quotes, having a blank line in between them, merges those quotes. This created confusion especially in "quote and reply". This commit fixes such issues. Now two or more quotes having a blank line in between them, will not get merged. This change is correct both for usability and for improving our compatibility with CommonMark. Fixes #14379.	2020-10-30 15:21:15 -07:00
Anders Kaseorg	1352f2f233	python: Replace manual quote_plus usage with urlencode. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	e513b75e86	markdown: Remove handler for old bug with incompatible twitter library. See commit `8b002040e0` and #86. The development environment bug that necessitated this handler has long been irrelevant. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:30:26 -07:00
akshatdalton	287c4ed2bb	markdown: Fix Youtube and Vimeo preview overriding markdown link titles bug. Initially markdown titles were overridden by Youtube and Vimeo preview titles. But now it will check if any markdown title is present to replace Youtube or Vimeo preview titles, if preview of linked websites is enabled. Fixes #16100	2020-10-19 12:06:13 -07:00
Anders Kaseorg	7f69c1d3d5	python: Catch specific exceptions from requests. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Aman Agrawal	1b5b82e712	RealmFilterPattern: Mark converted content as AtomicString. If multiple filters match the same string, we run into an infinite loop of converting string into urls. To fix it, we mark the matched string as atomic after first conversion.	2020-09-22 15:10:38 -07:00
Alex Vandiver	03c6a0f182	markdown: Skip other common file extensions in linking, sort.	2020-09-21 21:03:29 -07:00
Alex Vandiver	4361ce1246	markdown: Use tlds package to keep updated list of TLDs. Also remove a useage of "blacklist."	2020-09-21 21:03:29 -07:00
Anders Kaseorg	dfab09b17d	markdown: Replace hyperlink requirement with urllib.parse. The previous code only worked by accident and hyperlink 20.0.0 breaks it. >>> hyperlink.parse("example.com").replace(scheme="https") DecodedURL(url=URL.from_text('https:example.com')) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-13 15:37:28 -07:00
Anders Kaseorg	02725d32dd	python: Rewrite list() as []. Suggested by the flake8-comprehensions plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	a276eefcfe	python: Rewrite dict() as {}. Suggested by the flake8-comprehensions plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	ab120a03bc	python: Replace unnecessary intermediate lists with generators. Mostly suggested by the flake8-comprehension plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Alex Vandiver	5b74de7be7	markdown: Add another twitter code to retry-later. Error code 131 is documented to be an arbitrary server error on Twitter's side; add it to the retry list.	2020-08-18 10:32:24 -07:00
Alex Vandiver	092ed87ae3	markdown: Cache Twitter 403 responses that are semi-permanent. `03ca3afbc2` added more codes that are equivalent to 404's; this adds to the list of cache-as-None codes a couple which are equivalent to 403's. It does not comprise _all_ possible 403-like codes -- many of them are "the client is not OK," which is relevant to log as an error still.	2020-08-18 10:32:24 -07:00
Anders Kaseorg	768f9f93cd	docs: Capitalize Markdown consistently. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:23:06 -07:00
Alex Vandiver	2928bbc8bd	logging: Report stack_info on logging.exception calls. The exception trace only goes from where the exception was thrown up to where the `logging.exception` call is; any context as to where _that_ was called from is lost, unless `stack_info` is passed as well. Having the stack is particularly useful for Sentry exceptions, which gain the full stack trace. Add `stack_info=True` on all `logging.exception` calls with a non-trivial stack; we omit `wsgi.py`. Adjusts tests to match.	2020-08-11 10:16:54 -07:00
Alex Vandiver	90cdda9836	markdown: Link the twitter response code docs inline.	2020-07-31 10:35:41 -07:00
Alex Vandiver	03ca3afbc2	markdown: Treat more twitter codes as also permanent failures. Per the API documentation[1], the following codes all correspond to HTTP 404: - `34`: Sorry, that page does not exist. The specified resource was not found. - `144`: No status found with that ID. The requested Tweet ID is not found (if it existed, it was probably deleted) - `421`: This Tweet is no longer available. The Tweet cannot be retrieved. This may be for a number of reasons. - `422`: This Tweet is no longer available because it violated the Twitter Rules. The Tweet is not available in the API. Treat all of these identically. [1] https://developer.twitter.com/en/docs/basics/response-codes	2020-07-31 10:35:41 -07:00
Alex Vandiver	fc141af30e	markdown: Factor out twitter error code handling.	2020-07-31 10:35:41 -07:00
Vinit Singh	308cf8ac00	markdown: Inline Youtube previews instead of appending it to the end. This change makes our handling of youtube-url previews consistent with how we handle our inline images. This allows the previews to render next to the paragraph that links to the youtube video. Follow-up to PR #15773.	2020-07-22 16:11:17 -07:00
Rohitt Vashishtha	fb2946aaf6	Revert "markdown: Remove paragraphs that only contain a tweet link." This reverts commit `d3770153a6`. We do not show a link to the tweet in our preview, so we should revert to our previous behavior for now.	2020-07-17 14:30:22 -07:00
Rohitt Vashishtha	d3770153a6	markdown: Remove paragraphs that only contain a tweet link. This is similar to our behavior with image previews, and helps reduce clutter in the final rendered html. We add the string 'Tweet: ' to our existing tests so those tests remain the same.	2020-07-13 12:24:32 -07:00

1 2 3 4

167 Commits