zulip

Commit Graph

Author	SHA1	Message	Date
Anders Kaseorg	b0e569f07c	ruff: Fix SIM102 nested `if` statements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-01-23 11:18:36 -08:00
Trident Pancake	c6ea673cc9	markdown: Update max inline preview from 10 to 24. The max inline preview limit was previously increased to 10 by #20789. However, as issue #23624 shows, it's still causing confusion for users when they include more than 10 links. Bump this limit up to 24, which is a multiple of the 4 image preview per line logic.	2023-01-18 14:58:00 -05:00
Anders Kaseorg	17300f196c	ruff: Fix ISC003 Explicitly concatenated string. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-01-04 16:25:07 -08:00
Anders Kaseorg	2c5e114f8b	ruff: Fix ISC001 Implicitly concatenated string literals on one line. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-01-04 16:25:07 -08:00
Anders Kaseorg	b5cad938b8	ruff: Fix DTZ006 `datetime.datetime.fromtimestamp()` without `tz` argument. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-01-04 16:25:07 -08:00
Anders Kaseorg	f7e97b1180	ruff: Fix PLW0602 Using global but no assignment is done. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-01-04 16:25:07 -08:00
Zixuan James Li	a3a0103d86	markdown: Calculate linkifier precedence in topics. This uses the linkifier index among the list of linkifiers in the replacement as the priority to order the replacement order for patterns in the topic. This avoids having multiple overlapping matches that each produce a link. The linkifier with the lowest id will be prioritized when its pattern overlaps with another. Linkifiers are prioritized over raw URLs. Note that the same algorithm is used for local echoing and the backend markdown processor. Fixes #23715. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-12-13 15:16:20 -08:00
Zixuan James Li	4602c34108	markdown: Correctly retrieve indices for repeated matches. The same pattern being matched multiple times in a topic cannot be properly ordered using topic_name.find(match_text) and etc. when there are multiple matches of the same pattern in the topic. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-12-13 15:16:20 -08:00
Anders Kaseorg	e634e3276a	ruff: Fix PLC0414 Import alias does not rename original package. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-12-04 22:11:24 -08:00
Anders Kaseorg	73c4da7974	ruff: Fix N818 exception name should be named with an Error suffix. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-11-17 16:52:00 -08:00
Anders Kaseorg	924d530292	ruff: Fix N813 camelcase imported as lowercase. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-11-16 09:29:11 -08:00
Anders Kaseorg	2876ae8e48	ruff: Fix N803 argument name should be lowercase. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-11-16 09:29:11 -08:00
Anders Kaseorg	46955da3a0	ruff: Fix ANN204 missing return type annotation for __init__. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-11-16 09:29:11 -08:00
Alex Vandiver	5d42a0cb00	linkifiers: Support %20 in URLs for topic links. `9381a3bd45` added support for linkifier pattern URLs containing `%20`-style escapes, but only did so for the codepath which is used in the message body -- topic links did not understand them. Expand the support to include when they are substituted into topics.	2022-10-11 14:31:13 -07:00
Anders Kaseorg	8230324068	markdown: Store ZulipMarkdown in members with the right type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-10-06 15:15:10 -07:00
Anders Kaseorg	3cf91e9e45	markdown: Rename our Markdown subclass to ZulipMarkdown. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-10-06 15:15:10 -07:00
Anders Kaseorg	97be895cf0	markdown: Remove Optional from zulip_rendering_result type. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-10-06 15:15:10 -07:00
Anders Kaseorg	d01c99d2ee	markdown: Add missing None check in InlineInterestingLinkProcessor. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-10-06 15:15:10 -07:00
Anders Kaseorg	4a61e36def	CVE-2022-36048: Rewrite only specific local links to relative. Due to mismatches between the URL parsers in Python and browsers, it was possible to hoodwink rewrite_local_links_to_relative into generating links that browsers would interpret as absolute. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-08-24 16:29:09 -07:00
N-Shar-ma	ef044b8697	markdown: Update characters allowed before @ and stream mentions. Now the following characters are allowed before @-mentions and stream references (starting with #) for proper rendering - {, [, /. This commit makes the markdown rendering consistent with autocomplete (anything that is autocompleted is also rendered properly).	2022-08-06 19:29:39 -07:00
Mateusz Mandera	2299aa3382	docs: Remove some outdated references to thumbnailing.md doc. The doc was removed in `405bc8dabf`	2022-07-12 17:44:24 -07:00
Anders Kaseorg	8246ee7c57	mypy: Add links to specific mypy bugs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-07-05 17:54:58 -07:00
Anders Kaseorg	dc33a0ae67	markdown: Rewrite include plugin without markdown-include. markdown-include is GPL licensed. Also, rewrite it as a block processor, so that it works correctly inside indented blocks. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-06-26 17:36:31 -07:00
Anders Kaseorg	6331a314d4	Correctly hyphenate “non-”. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 22:10:31 -07:00
Anders Kaseorg	a2825e5984	python: Use Python 3.8 typing.{Protocol,TypedDict}. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 12:57:49 -07:00
Alex Vandiver	351bdfaf78	preview: Use cache only as a non-durable cache, not an IPC. The `get_link_embed_data` / `link_embed_data_from_cache` pair as introduced in `c93f1d4eda` uses the cache as a temporary store inside of the `embed_links` worker; this means that it must be durable storage, or the worker will stall and re-fetch the same links to preview them. Switch to plumbing through the fetched URL embed data as an parameter to the Markdown evaluation which uses them, rather than using the cache as an intermediary. This frees up the cache to be merely a non-durable cache. As a side-effect, this removes get_cache_with_key, and link_embed_data_from_cache which was its only callsite.	2022-04-15 14:48:12 -07:00
Alex Vandiver	327ff9ea0f	preview: Use a dataclass for the embed data. This is significantly cleaner than passing around `Dict[str, Any]` all of the time.	2022-04-15 14:48:12 -07:00
Alex Vandiver	661c333377	markdown: Use named parameters to add_a helper. This has enough parameters that it benefits from making which is which explicit.	2022-04-15 14:48:12 -07:00
Alex Vandiver	452a30305d	markdown: Clarify url parameter of "add_a" helper.	2022-04-15 14:48:12 -07:00
Alex Vandiver	1ac0035f8c	markdown: Allow whitespace overlaps in topic linkifiers. `prepare_linkifier_pattern`, as of `db934be064`, adds a match to the end of the regex, of either the end of string, or a non-word character -- this is in place of a negative look-ahead, which is no longer possible in re2. This causes the regex to consume trailing whitespace, and thus not be able to match twice in succession with `pattern.finditer` -- "#1234 #5678" fails to match because the space is consumed by the first match of the regex. Rather than use `pattern.finditer`, write own own version, which rewinds over the non-word character consumed after the match, if any. This allows the same "after" non-word character to also satisfy the "before" of the next match. Fixes #21502.	2022-03-22 15:40:03 -07:00
Anders Kaseorg	1629d6bfb3	python: Reformat with Black 22 (stable). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-18 18:03:13 -08:00
Anders Kaseorg	df304c40da	markdown: Use built-in hex formatting for unicode_emoji_to_codepoint. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-03 11:00:04 -08:00
Puneeth Chaganti	d55c137277	emoji: Add yellow_large_square and green_large_square emojis. Wordle has recently become a thing and it uses green, yellow and white (or black in dark mode) large square unicode characters to let people share their gameplay. Zulip converts the white and black large square unicode characters to emojis, but not the green and yellow ones. This causes the Wordle grid to be misaligned when shared on Zulip. This commit adds green and yellow large square emojis to our emoji list to fix the problem.	2022-02-02 16:26:31 -08:00
Puneeth Chaganti	6beb84b553	emoji: Use str.rjust to pad codepoint strings instead of a loop.	2022-02-02 16:26:30 -08:00
Puneeth Chaganti	0eeb74b3c2	emoji: Fix minor typo in unicode_emoji_to_codepoint comment.	2022-02-02 16:26:28 -08:00
Alex Vandiver	19f891968d	markdown: Increase the maximum number of image previews per message. The limit here is purely to prevent breakage in case of a pathological number of images in a single message; 5 images is entirely possible in a reasonable message, and causes user confusion when they are not expended. Increase the limit to 10 per message.	2022-01-14 11:30:07 -08:00
Steve Howell	4adcaf92f7	refactor: Attach get_stream_name_map to MentionData. This diff looks slightly noisy, but the main chunk of code that we moved here has the same logic as before, and it just gets realm_id from MentionBackend now, instead of having our markdown processor have to supply it. We basically want MentionData to be the gatekeeper of mention data, and then we delegate backend tasks to MentionBackend. Soon we will add a cache to MentionBacked, which will justify this change a bit more.	2021-12-30 11:28:15 -08:00
Steve Howell	c6448263c3	refactor: Add MentionBackend. We will eventually use this to avoid redundant queries. The diff is slightly noisy here, but there are no logic changes.	2021-12-30 11:28:15 -08:00
Steve Howell	ea252ab53e	refactor: Convert FullNameInfo to a dataclass. As part of this we no longer query for email, which is a vestige of when we used emails to identify users on the frontend.	2021-12-30 11:28:15 -08:00
Steve Howell	f5fc348786	mypy: Add explicit types for dbdata references. When our handlers specifically reference self.md.zulip_db_data, we now use an explicit type. We probably want a more robust solution here, such as a semgrep rule.	2021-12-30 11:28:15 -08:00
Steve Howell	df84892aad	markdown: Convert DbData to a dataclass.	2021-12-30 11:28:15 -08:00
Steve Howell	4e551f8279	refactor: Introduce get_stream_name_map. We only need a name -> id map, and the FullNameInfo type was a lie.	2021-12-30 11:28:15 -08:00
Steve Howell	c04a8097f3	mypy: Add EmojiInfo type. We now serialize still_url as None for non-animated emojis, instead of omitting the field. The webapp does proper checks for falsiness here. The mobile app does not yet use the field (to my knowledge). We bump the API version here. More discussion here: https://chat.zulip.org/#narrow/stream/378-api-design/topic/still_url/near/1302573	2021-12-30 11:28:14 -08:00
Alex Vandiver	6a40c17ccf	markdown: CSS-escape preview links. This adds `soupsieve` as an explicit dependency, but intentionally does not adjust the provision version, as it was already an indirect dependency.	2021-10-26 18:17:23 -07:00
Alex Vandiver	52f74bbd9b	markdown: Run URL preview links through camo. Not proxying these requests through camo is a security concern. Furthermore, on the desktop client, any embed image which is hosted on a server with an expired or otherwise invalid certificate will trigger a blocking modal window with no clear source and a confusing error message; see zulip/zulip-desktop#1119. Rewrite all `message_embed_image` URLs through camo, if it is enabled.	2021-10-26 18:17:23 -07:00
Anders Kaseorg	58920affd4	python: Remove re.UNICODE flag (redundant in Python 3). https://docs.python.org/3/library/re.html#re.A Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-22 13:42:29 -07:00
Alex Vandiver	9381a3bd45	linkifiers: Support URL percent-encoded bytes. Supporting URL percent-encoded bytes is possible using `%%20`, but this is not necessarily very understandable to end-users, even those that understand percent encoding. Allow `%20` in linkifier URL format strings, and transform them into `%%20` in the pattern just before they are applied in markdown translation. Care must be taken here, such that already-escaped `%`s are not escaped an extra time. We do this before rendering, and not before storage, as a simplification; the JS-side linkifier at present only understands `%(foo)s` and thus needs no changes, and to avoid an un-escaping pass before showing in the admin UI.	2021-10-22 13:00:20 -07:00
Anders Kaseorg	4839b7ed27	url_preview: Interpret og:image relative to full page URL. og:image is supposed to be an absolute URL, but some sites incorrectly provide a relative URL. In this case, it makes more sense to interpret it relative to the full page URL after redirects, rather than relative to just the domain part of the page URL before redirects. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-21 12:20:37 -07:00
Alex Vandiver	db934be064	CVE-2021-41115: Use re2 for user-supplied linkifier patterns. Zulip attempts to validate that the regular expressions that admins enter for linkifiers are well-formatted, and only contain a specific subset of regex grammar. The process of checking these properties (via a regex!) can cause denial-of-service via backtracking. Furthermore, this validation itself does not prevent the creation of linkifiers which themselves cause denial-of-service when they are executed. As the validator accepts literally anything inside of a `(?P<word>...)` block, any quadratic backtracking expression can be hidden therein. Switch user-provided linkifier patterns to be matched in the Markdown processor by the `re2` library, which is guaranteed constant-time. This somewhat limits the possible features of the regular expression (notably, look-head and -behind, and back-references); however, these features had never been advertised as working in the context of linkifiers. A migration removes any existing linkifiers which would not function under re2, after printing them for posterity during the upgrade; they are unlikely to be common, and are impossible to fix automatically. The denial-of-service in the linkifier validator was discovered by @erik-krogh and @yoff, as GHSL-2021-118.	2021-10-04 21:26:24 +00:00
Tim Abbott	545911b051	markdown: Remove useless locless_schemes check. This check was copied from upstream python-markdown's "safe mode" before they removed that feature. The upstream history is that they introduced this check in `2db5d1c8e4`, which was not a complete security check, and then added the immediately following check (with an allowlist of schemes) in `0b4ffbb60e`. Their first, incomplete check provides no security benefit and makes the code hard to reason about, so we remove it.	2021-09-09 09:03:40 -07:00

1 2 3 4

154 Commits