zulip

Commit Graph

Author	SHA1	Message	Date
Alex Vandiver	db934be064	CVE-2021-41115: Use re2 for user-supplied linkifier patterns. Zulip attempts to validate that the regular expressions that admins enter for linkifiers are well-formatted, and only contain a specific subset of regex grammar. The process of checking these properties (via a regex!) can cause denial-of-service via backtracking. Furthermore, this validation itself does not prevent the creation of linkifiers which themselves cause denial-of-service when they are executed. As the validator accepts literally anything inside of a `(?P<word>...)` block, any quadratic backtracking expression can be hidden therein. Switch user-provided linkifier patterns to be matched in the Markdown processor by the `re2` library, which is guaranteed constant-time. This somewhat limits the possible features of the regular expression (notably, look-head and -behind, and back-references); however, these features had never been advertised as working in the context of linkifiers. A migration removes any existing linkifiers which would not function under re2, after printing them for posterity during the upgrade; they are unlikely to be common, and are impossible to fix automatically. The denial-of-service in the linkifier validator was discovered by @erik-krogh and @yoff, as GHSL-2021-118.	2021-10-04 21:26:24 +00:00
Sahil Batra	88346949b5	messages: Do not allow mentioning system user groups. We do not allow mentioning system user groups for now because this can lead to circumventing the wildcard mention restrictions. It will be enabled once we add a setting to control that. This is implemented by just ignoring it as one of the mentioned user group even if the message content inlcudes the mention syntax for it and the message is sent normally. We still keep the for_mention parameter for accessing user group while sending email and push notifications as mentioning system user groups will be allowed in future. This commit also removes the test for email notifications for system user groups as we are not allowing mentioning them. This commit is only for backend change as we already exclude the system groups from mention typeaheads and other UI.	2021-09-09 11:25:33 -07:00
Sahil Batra	550d97a593	settings: Refactor callers of do_change_user_setting to pass acting_user.	2021-09-08 11:04:44 -07:00
Dinesh	9443e01a5d	refactor: Rename do_set_user_display_setting to do_set_user_setting.	2021-09-07 10:16:42 -07:00
Anders Kaseorg	646c04eff2	Rename default branch to ‘main’. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-09-06 12:56:35 -07:00
Anders Kaseorg	162e9d6c0b	fenced_code: Optimize FENCE_RE to fix cubic worst-case complexity. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-22 16:40:44 -07:00
Anders Kaseorg	fb3ddf50d4	python: Fix mypy no_implicit_reexport errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-16 14:02:31 -07:00
Anders Kaseorg	1ae56e466b	cache: Fix typing for post_save and post_delete flush handlers. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-16 13:14:04 -07:00
Mateusz Mandera	6ec5a9698f	test_markdown: Fix unnecessarily hard-coded user id.	2021-07-13 08:31:11 -07:00
PIG208	75cea329b4	markdown: Refactor out additional properties added to Message. This adds a new class called MessageRenderingResult to contain the additional properties we added to the Message object (like alert_words) as well as the rendered content to ensure typesafe reference. No behavioral change is made except changes in typing. This is a preparatory change for adding django-stubs to the backend. Related: #18777	2021-06-24 18:14:53 -07:00
akshatdalton	c507931ac8	refactor: Export non-markdown logic in mention.py.	2021-06-14 13:26:30 -07:00
Wesley Aptekar-Cassels	d5ba94082a	markdown: Increase max rendered message length to 1MB. This should help with #17425, where messages with lots of LaTeX are lost, due to the large expansion factor. This isn't a total fix for this - large messages with lots of LaTeX can still end up larger than 1MB, and rendering could timeout, but this fix should help significantly. 1MB is still small enough that I don't expect we'll run into any DOS problems - my testing didn't show any problems rendering messages that contain ~1MB of LaTeX.	2021-06-03 10:10:35 -07:00
akshatdalton	7df62ebbaf	settings: Make `MAX_MESSAGE_LENGTH` a server-level setting. This will offer users who are self-hosting to adjust this value. Moreover, this will help to reduce the overall time taken to test `test_markdown.py` (since this can be now overridden with `override_settings` Django decorator). This is done as a prep commit for #18641.	2021-06-03 09:26:28 -07:00
akshatdalton	6143cb6e73	test_markdown: Use assertTrue/assertFalse instead of assertEqual.	2021-06-02 17:20:45 -07:00
Anders Kaseorg	bac96cae80	markdown: Fix Dropbox image previews. ?dl=1 causes Dropbox to send Content-Type: application/binary, which can’t be interpreted by Camo. Use ?raw=1 instead. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-25 13:42:29 -07:00
Abhijeet Prasad Bodas	352634a851	tests: Consistently use assert_length helper. This helper does some nice things like printing out the data structure incase of failure.	2021-05-19 11:55:56 -07:00
akshatdalton	18203d8af3	markdown: Silence user group mention inside blockquotes.	2021-05-18 17:31:25 -07:00
akshatdalton	0245b590e9	markdown: Add support for user group silent mention. Prior to this, we only supported direct mention to the user groups. This commit extends that support to silent mention for the user groups. A related test case is also added. Fixes: #11711.	2021-05-18 17:31:25 -07:00
akshatdalton	55f4996f16	markdown: Fix silent wildcard mentions bug. A message containing wildcard mention when quoted (which is turned into a silent mention) or message with silent wildcard mention notifies the users by sending desktop, sound, and missed message email notifications. This is clearly a bug which is fixed by this commit. Fixes: #18354.	2021-05-10 12:19:40 -07:00
Anders Kaseorg	544bbd5398	docs: Fix capitalization mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-10 09:57:26 -07:00
Wesley Aptekar-Cassels	6b7a3fb74a	markdown: Rewrite all external images to use Camo. Requesting external images is a privacy risk, so route all external images through Camo. Tweaked by tabbott for better test coverage, more comments, and to fix bugs.	2021-04-30 10:36:16 -07:00
Arun Sankar	146b32d63a	test users: Add an escape char to a test username. Changed the name of the test-user cordelia from `Cordelia Lear` to `Cordelia, Lear's daughter`. This change will enable us to test users with escape characters in their names. I also updated the Node, Puppeteer, Backend tests and Fixtures to support this change.	2021-04-13 11:42:06 -07:00
Tim Abbott	2e928a0853	markdown: Remove logic for creating markdown engines for all realms. This logic likely never ran due to a combination of bugs. * Running `maybe_update_markdown_engines` unconditionally meant that `if md_engine_key in md_engines` was likely always true. * Introduced in 65838bb: DEFAULT_MARKDOWN_KEY could never be in md_engines, so should we have ever reached that code path, we'd have tried to rebuild all markdown engines every time. And it also wasn't clearly helpful -- because we fetch all linkifiers for a realm on every request anyway, we don't really save database queries by doing a bulk fetch on startup, and doing so would likely result in a material regression to Zulip's overall startup time that we were creating markdown engines for large numbers of realms in bulk during process startup.	2021-04-13 09:18:18 -07:00
Abhijeet Prasad Bodas	52a86d9604	linkifiers: Use dictionaries for internal structures. This change does not affect the API in anyway. All internal code now uses dictionaries to denote a linkifier, instead of tuples.	2021-04-05 18:16:08 -07:00
Abhijeet Prasad Bodas	68fe912c63	refactor: Rename most of "filter" to "linkifier". After this only the database table, events, and API endpoints remain.	2021-04-05 18:14:07 -07:00
Abhijeet Prasad Bodas	f896a7667f	refactor: Update some uses of "filter" to "linkifier". This updates some comments and local variables which could be changed without breaking other stuff.	2021-04-05 18:14:07 -07:00
Sumanth V Rao	e12f682e2e	markdown: Include text & url in `topic_links` parameter of our API. The linkifier code now includes both the shortened text and the expanded URL, sorted by the order of the occurrence in a topic. This list is passed back in the `topic_links` parameter of the /messages and the /events APIs. topic_links earlier vs now: earlier: ['https://www.google.com', 'https://github.com/zulip/zulip/32'] now: [{'url': 'https://www.google.com', 'text': 'https://www.google/com}, {'url': 'https://github.com/zulip/zulip/32', 'text': '#32'}] Similarly, the topic_links local echo logic in the frontend now returns back an object. Fixes: #17109.	2021-03-30 15:53:07 -07:00
Mateusz Mandera	f329878376	migrations: Subscription.is_user_active denormalization - step one. This adds the is_user_active with the appropriate code for setting the value correctly in the future. In the following commit a migration to backfill the value for existing Subscriptions will be added. To ensure correct user_profile.is_active handling also in tests, we replace all direct .is_active mutation with calls to appropriate functions.	2021-03-30 09:19:03 -07:00
shanukun	459710a897	refactor: Make acting_user a mandatory kwarg for do_set_realm_property.	2021-03-29 15:51:45 -07:00
m-e-l-u-h-a-n	1b8a5a3344	markdown: Refactor backend logic for handling user mention. Backend logic for handling user mention was cluttered because it was handled at two stages first in get_possible_mentions_info while fetching mention data based on the messsage and then later in UserMentionPattern which handles processing of text for mention. Ideally UserMentionPattern should depend on get_possible_mentions_info only for data but there was a shared logic between these two that made it hard to debug any possible bugs. Updates in this commit make both of these functions coherent in terms of logic and also add appropiate comments to improve readability of these functions. There was also a hidden bug that if a user A is mentioned in with @name\|id then @invalid\|id again mentioned A because of the way we handled mentions earlier. It is solved as a result of this refactor and appropiate test has been added for this. This has been tested manually as well as by adding new test to address missing case.	2021-03-28 16:52:48 -07:00
m-e-l-u-h-a-n	2699048208	markdown: Extend user mention syntax to support user_id for mentioning. Extend our markdown system to support mentioning of users by id also. Following these changes, it would be possible to mention users with @\|user_id and silently mention using @_\|user_id. Main intention for extending the mention syntax is to make it convenient for bots to mention a users using their ids. It is to be noted that previous syntax are also supported. Documentation tweaked by tabbott for better readability. The changes were tested manually in development server, and also by adding some new backend and frontend tests. Fixes: #17487.	2021-03-25 00:44:56 -07:00
m-e-l-u-h-a-n	830c4acedc	markdown: Fix invalid mention bug for stream and stream topic mention. Modifies `StreamPattern` and `StreamTopicPattern` to inherit from InlineProcessor instead of Pattern. This change is done because Pattern stopped checking for matching patterns as soon as it found a match which was not a valid stream. Due to this all the subsequent mention failed, even if they were valid. This bug was only present in backend renderring due to markdown.inlinepatterns.Pattern. Due to above changes verbose_compile is no longer used for precompiling STREAM_LINK_REGEX, STREAM_TOPIC_LINK_REGEX as adds ^(.?) and (.?)$ which cause extra overhead of matching pattern which is not required. With new InlineProcessor these extra patterns at beggining and end are not required. So, StreamPattern and StreamTopicPattern now define their own __init__ method for precompiling the regex. Fixes #17535. These changes were tested locally in dev server and by adding some new markdown tests to test these.	2021-03-23 01:28:30 -07:00
m-e-l-u-h-a-n	dadbba0c25	markdown: Fix invalid mention bug for user group mention. Modifies `UserGroupMentionPattern` to inherit from InlineProcessor instead of Pattern. This change is done because Pattern stopped checking for matching patterns as soon as it found a match which was not a valid user group. Due to this all the subsequent user group mention failed, even if they were valid. This bug was only present in backend renderring due to markdown.inlinepatterns.Pattern. This was reported as issue #17535. These changes were tested locally in dev server and by adding some new markdown tests to test these.	2021-03-23 01:28:30 -07:00
m-e-l-u-h-a-n	c8979a5100	markdown: Fix invalid mention bug for user mention. Modifies `UserMentionPattern` to inherit from InlineProcessor instead of Pattern. This change is done because Pattern stopped checking for matching patterns as soon as it found a match which was not a valid user. Due to this all the subsequent user mention failed. This bug was only present in backend renderring due to markdown.inlinepatterns.Pattern. This was reported as issue #17535. These changes were tested locally in dev server and by adding some new markdown tests to test these.	2021-03-23 01:28:30 -07:00
Mateusz Mandera	d91d3a05b9	tests: Use do_create_realm where possible. Using do_create_realm should be preferred over manual creation where possible, as it creates more realistic data.	2021-03-14 08:50:02 -07:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	5028c081cb	python: Merge concatenated string literals that Black would uglify. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	8ba95063d5	test_markdown: Construct FencedBlockPreprocessor with a real Markdown. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-11-10 15:54:28 -08:00
akshatdalton	620e9cbf72	markdown: Fix merging of separate quotations. Initally, when writing two or more quotes, having a blank line in between them, merges those quotes. This created confusion especially in "quote and reply". This commit fixes such issues. Now two or more quotes having a blank line in between them, will not get merged. This change is correct both for usability and for improving our compatibility with CommonMark. Fixes #14379.	2020-10-30 15:21:15 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	d81a93cdf3	requirements: Upgrade markdown to 3.3.1. Upstream has slightly changed the whitespace around stashes. Take this opportunity to clean up the extra blank lines we were outputting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-19 11:54:14 -07:00
Anders Kaseorg	6564540d15	docs: Fix some spelling errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Aman Agrawal	1b5b82e712	RealmFilterPattern: Mark converted content as AtomicString. If multiple filters match the same string, we run into an infinite loop of converting string into urls. To fix it, we mark the matched string as atomic after first conversion.	2020-09-22 15:10:38 -07:00
Sumanth V Rao	033351609d	markdown: Add data-codehilite-language attr for fenced code. When converting fenced code markdown, we add the language (if specified) in a data-attribute by tweaking the HTML generated. Doing so, allows the frontend to make use of this attr to display view-in-playground option for codeblocks. We use pygments to get the lexer subclass name and use that instead of directly using the language in the data-attribute. Doing so, helps us map different language aliases (like `js` and `javascript`) into a common variable (like `JavaScript`) - and avoids the client from dealing with multiple tags corresponding to the same language. The html structure for a message like this: ``` js ..content.. ``` would now be: <div class="codehilite" data-codehilite-language="JavaScript"> <pre>..content..</pre> </div> Tests and fixtures amended.	2020-09-14 21:25:19 -07:00
palash	f2f8034b76	test_markdown: Refactor mock.patch to assertLogs. Replaced mock.patch with assertLogs for testing log outputs in file zerver/tests/test_markdown.py	2020-09-12 11:04:51 -07:00
Anders Kaseorg	61d0417e75	python: Replace ujson with orjson. Fixes #6507. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:55:12 -07:00
Anders Kaseorg	768f9f93cd	docs: Capitalize Markdown consistently. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:23:06 -07:00
Anders Kaseorg	60a25b2721	docs: Fix spelling errors caught by codespell. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:23:06 -07:00
Mohit Gupta	7d574795f1	tests: Remove unnecessary print statments. This removes spam in test-backend output caused by print statement.	2020-07-22 17:12:28 -07:00

1 2

70 Commits