zulip

Commit Graph

Author	SHA1	Message	Date
Zixuan James Li	fd9a0f4274	typing: Apply trivial none-checks with assertions as necessary. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-06-23 19:25:48 -07:00
Anders Kaseorg	a2825e5984	python: Use Python 3.8 typing.{Protocol,TypedDict}. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 12:57:49 -07:00
Zixuan James Li	9d448e73d2	decorator: Remove cachify in favor of lru_cache. `cachify` is essentially caching the return value of a function using only the non-keyword-only arguments as the key. The use case of the function in the backend can be sufficiently covered by `functools.lru_cache` as an unbound cache. There is no signficant difference apart from `cachify` overlooking keyword-only arguments, and `functools.lru_cache` being conveniently typed. Signed-off-by: Zixuan James Li <359101898@qq.com>	2022-04-14 12:44:35 -07:00
Mateusz Mandera	d800ac33a0	push_notifications: Send user_uuid to the push bouncer. Fixes #18017. In previous commits, the change to the bouncer API was introduced to support this and then a series of migrations added .uuid to UserProfiles. Now the code for self-hosted servers that makes requests to the bouncer is changed to make use of it.	2022-03-14 17:47:30 -07:00
Anders Kaseorg	b0ce4f1bce	docs: Fix many spelling mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-07 18:51:06 -08:00
Anders Kaseorg	27977eddeb	export: Use tar -C to switch directories. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-12-17 08:01:53 -08:00
Tim Abbott	31842e1377	export: Fix empty realm_icons directory in single-user exports.	2021-12-10 12:05:34 -08:00
Steve Howell	9a39ca217f	user export: Show less info for recipients. For PM and huddles, show full names but no emails or other crufty fields.	2021-12-09 17:20:01 -08:00
Steve Howell	6a5c407b05	user export: Be more selective about exported messages.	2021-12-09 17:20:01 -08:00
Steve Howell	fa654fd7a0	user export: Ignore realm icon and logo. These are not considered to be "personal" info, even if you upload them, so we don't export them. Generally the only folks who upload these are admins, who can easily get them in other ways. In fact, anybody can get these via the app.	2021-12-09 17:20:01 -08:00
Steve Howell	8f991f8eb1	export: Make sure messages are sorted across files. We now ensure that all message ids are sorted BEFORE we split them into batches. We now do a few extra "slim" queries to get message ids up front. But, now, when we divide them into batches, we no longer run 2 or 3 different complicated queries in a loop. We just basically hydrate our message ids, so `write_message_partials` should be easy to reason about. This change also means that for tiny realms with < 1000 messages you will always have just one json file, since we aggregate the ids from the queries before batching.	2021-12-09 12:22:34 -08:00
Steve Howell	cef0e11816	export: Add get_id_list_gently_from_database. This is slightly overkill for the single-user use case, but for small queries it's barely any overhead, and it's a nice abstraction.	2021-12-09 12:22:34 -08:00
Steve Howell	8ea320812f	user exports: Chunkify messages in sorted order. This accomplishes a few things: * It extracts `chunkify` rather than having us clumsily track chunking-related stuff in a big loop that is doing other stuff. * It makes it so that all message ids in message-000001.json < message-000002.json. * It makes it easier for us to customize the messages we send to a single user (coming soon). BTW we probably have a slicker version of chunkify somewhere in our codebase, but I couldn't remember where.	2021-12-09 12:22:34 -08:00
Steve Howell	2a73964e16	user export: Add reactions. We may eventually try to attach these to the messages in the message-NNNNNN.json files, but for now they're fine in user.json.	2021-12-09 12:22:34 -08:00
Steve Howell	f810833df5	export: Improve export_usermessages_batch. We no longer jankily read our input file into an "output" variable. Instead, we do things in a type-safe way.	2021-12-09 08:36:40 -08:00
Steve Howell	5c1e8cb8dc	mypy: Add MessagePartial TypedDict.	2021-12-09 08:36:40 -08:00
Steve Howell	09c57a3f9f	export: Log more consistently and sort ids. Now all file writes go through our three helper functions, and we consistently write a single log message after the file gets written. I killed off write_message_exports, since all but one of its callers can call write_table_data, which automatically sorts data. In particular, our Message and UserMessage data will now be sorted by ids.	2021-12-09 08:36:40 -08:00
Steve Howell	6ec49951c6	minor: Avoid creating intermediate list for message_ids. This probably just postpones the list creation until Django builds the "IN" query, but semantically it's good to work in sets where we don't have any meaningful ordering of the list that gets used.	2021-12-08 16:12:54 -08:00
Steve Howell	f8ed099d3c	export: Sort table data for most tables. This affects most of our tables, but it excludes table(s) like Message that go through kind of unique codepaths.	2021-12-08 16:12:54 -08:00
Steve Howell	a1d3f12e53	refactor: Extract write_table_data(). The immediate benefit of this is stronger mypy checks (avoiding the ugly union caused by message files). The subsequent commit will add sorting. We have test coverage on all these lines insofar as if you comment out the lines, tests will explode (i.e. more than superficial line coverage).	2021-12-08 16:12:54 -08:00
Steve Howell	c76ca2d0df	export: Sort records.json files by path.	2021-12-08 16:12:54 -08:00
Steve Howell	2ef38e3d48	refactor: Extract write_records_json_file.	2021-12-08 16:12:54 -08:00
Steve Howell	b79cfc19ab	user export: Broaden query for RealmAuditLog. We now check acting_user as well as modified_user to see if a row pertains to our exported user.	2021-12-08 16:01:38 -08:00
Steve Howell	927b04368e	minor: Use virtual_parent for custom fetchers. The distinction here wasn't super meaningful due to the way we order our "elif" statements, but we want to reserver "normal_parent" for the majority of use cases, where you simply tell the Config what the "foreign_key" is.	2021-12-08 15:58:07 -08:00
Steve Howell	50120a9387	export: Remove config parameter for custom fetchers.	2021-12-08 15:58:07 -08:00
Steve Howell	54a3a423e5	mypy: Fix CustomFetch=Any hack.	2021-12-08 15:58:07 -08:00
Steve Howell	4128b52ac5	export: Rename custom fetchers.	2021-12-08 15:58:07 -08:00
Steve Howell	a2c4931316	exports: Use realm for RealmAuditLog in realm exports. For realm-wide exports, there is no reason to query inefficiently against a list of modified users. We move the Config out of the common child configs.	2021-12-08 15:58:07 -08:00
Steve Howell	8dd3c1038f	exports: Rename parent_key to include_rows. Even though Django usually treats foo__in and foo_id__in identically for filters where foo is a ForeignKey type, we want to insist on somewhat more consistent syntax, because we have the odd combo of type and type_id in Recipient, where type_id is kinda like a foreign key, but not a ForeignKey. So we assert for now that all our include_rows values end in "_id__in".	2021-12-08 15:58:07 -08:00
Steve Howell	02207f47d5	minor: Move code blocks to be alphabetical.	2021-12-08 15:58:07 -08:00
Steve Howell	aae9f1b6f5	export: Make Config errors more clear.	2021-12-08 15:58:07 -08:00
Steve Howell	6d09eab285	export: Export file images for single users. We don't have automated test coverage on this yet, but below are the results from manual testing. Note that we include the realm icon and logo even though they were not created by Cordelia. ./manage.py export_single_user cordelia@zulip.com $ (cd /tmp/zulip-export-4v3mo802/ && find .) . ./emoji ./emoji/2 ./emoji/2/emoji ./emoji/2/emoji/images ./emoji/2/emoji/images/3.jpg ./emoji/records.json ./messages-000001.json ./realm_icons ./realm_icons/2 ./realm_icons/2/night_logo.original ./realm_icons/2/night_logo.png ./realm_icons/2/icon.png ./realm_icons/2/icon.original ./realm_icons/records.json ./avatars ./avatars/2 ./avatars/2/c5125af0447f4d66ce34c1b32eac75ac27ebe0e7.original ./avatars/2/c5125af0447f4d66ce34c1b32eac75ac27ebe0e7.png ./avatars/records.json ./uploads ./uploads/2 ./uploads/2/68 ./uploads/2/68/xyEkC5dTIp8m42_6HJ3kBfdt ./uploads/2/68/xyEkC5dTIp8m42_6HJ3kBfdt/denver.jpg ./uploads/2/96 ./uploads/2/96/ol5WE6RTUntvuPDSpJUrYTim ./uploads/2/96/ol5WE6RTUntvuPDSpJUrYTim/denver.jpg ./uploads/records.json ./user.json	2021-12-07 11:16:52 -08:00
Steve Howell	b8d9143318	export: Validate emoji paths. (We lift the RealmEmoji query to be used by both local and S3 storage helpers.)	2021-12-07 11:16:52 -08:00
Steve Howell	ef6d9b10d2	refactor: Extract get_emoji_path.	2021-12-07 11:16:52 -08:00
Steve Howell	5a41904201	export: Add handle_system_bots flag. We will set this to False for single-user exports.	2021-12-07 11:16:52 -08:00
Steve Howell	0e19deb558	exports: Limit s3 upload exports with path_id checks.	2021-12-07 11:16:52 -08:00
Steve Howell	f6cbf931ae	refactor: Pass attachments to export_uploads_from_local. The next commit will use attachments in the s3 path.	2021-12-07 11:16:52 -08:00
Steve Howell	03f40a64d4	refactor: Pass valid_hashes to export_files_from_s3.	2021-12-07 11:16:52 -08:00
Steve Howell	15bc677f35	export: Pass users to export_avatars_from_local.	2021-12-07 11:16:52 -08:00
Steve Howell	42ecabe967	export: Add check_metadata flag.	2021-12-06 15:09:37 -08:00
Steve Howell	0166f13d83	s3 exports: Validate user metadata for all assets. This preps us to download assets for just a single user.	2021-12-06 15:09:37 -08:00
Steve Howell	946ab22bba	refactor: Lift users query to caller. This preps us to reuse this code for single users (after a few more subsequent changes).	2021-12-06 15:09:37 -08:00
Steve Howell	b0e5c1d3b9	export: Remove paranoid assertion. There are tactical reasons to remove this assertion. Basically, the reason it's safe to remove is that it's been around a long time and we would have seen this operationally. Also, the check to make sure that the S3 filename thingy matches the avatar hash is a much stronger check. We will soon restore a stronger version of this check that applies to all of our asset types (emojis/avatars/etc.).	2021-12-06 15:09:37 -08:00
Steve Howell	59951ae52b	refactor: Move metadata checks for s3 export. This technically broadens the check for user_profile_id, but we write that metadata on every record.	2021-12-06 15:09:37 -08:00
Steve Howell	a27a6a4548	refactor: Inline _check_key_metadata. This was only called in one place.	2021-12-06 15:09:37 -08:00
Steve Howell	db39948be5	refactor: Pass in flavor to export_files_from_s3.	2021-12-06 15:09:37 -08:00
Steve Howell	f4354c896b	refactor: Pass object_key into export_files_from_s3. This makes it easier to read the calling code and see the big picture of how the four asset types are organized. I also handle uploads first, to be similar to the local code. This code is well tested--you can modify any of the callers to pass in a wrong value of `object_key` and get a failing test.	2021-12-06 15:09:37 -08:00
Steve Howell	4088be6017	import/export: Add UserStatus table. (We support both realm and single-user exports.)	2021-12-06 13:27:25 -08:00
Steve Howell	45addcd506	cosmetic: Sort DATE_FIELDS in export.py source.	2021-12-06 13:27:25 -08:00
Steve Howell	f83907d3bb	export: Add MutedUser table. Note that the import was already implemented, but its test was flawed.	2021-12-06 13:27:25 -08:00
Steve Howell	946e8064a3	user exports: Add several tables to the tarball. We add the following tables to the user export: AlertWord CustomProfileFieldValue RealmAuditLog Service UserActivity UserActivityInterval UserCount UserGroup UserHotspot UserPresence UserTopic Except for UserCount, we achieve this by sharing code with the realm export via add_user_profile_child_configs. UserCount is handled slightly differently than realm exports due to which key we trigger off. It's possible that RealmAuditLog is incomplete for single users, since we may also want rows where they are the acting_user. This commit finds rows where they are the modified_user. For non-admins I believe it's rarely the case that they are the actor, and they will tend to be the modified user if the two fields are different at all. For admins it's arguable we want to see both changes they enacted as well as changes that affected them.	2021-11-25 08:36:43 -08:00
Shlok Patel	893c9bc896	export: Remove `--delete-after-upload` flag in realm export. For export realm following changes have been made: - `./manage.py export --upload` would delete `.tar.gz` and unpacked dir - `./manage.py export` would only delete `unpacked dir` Besides, we have removed `--delete-after-upload` as we have set it as the default. Fixes #20081	2021-11-03 11:14:02 -07:00
Mateusz Mandera	73a6f2a1a7	auth: Add support for using SCIM for account management.	2021-10-14 12:29:10 -07:00
Anders Kaseorg	1e5157b66c	user_groups: Add a recursive group membership model. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-10-13 10:08:06 -07:00
Abhijeet Prasad Bodas	4455dac424	models: Use default db table name for UserTopic. Also update the realm import/export code and tests accordingly.	2021-09-17 12:14:28 -07:00
Abhijeet Prasad Bodas	2aea944a7e	models: Rename UserTopic.date_muted to last_updated. This is a follow-up to #19388. We will in the future allow patch requests to change the visibility of an existing topic, so `last_updated` is better name for this field. This commit does not affect the API or events in any way, but only the database.	2021-09-17 12:14:28 -07:00
sahil839	7d64a9053b	models: Ensure every realm has a RealmUserDefault object. Because we create all realms with do_create_user (including in the test suite), we just need to change that function, add a migration for existing realms, and ensure the data import code path correctly creates these objects. Note that the import code path will create a RealmUserDefault row with default values if it is not present in the import data, which is important for importing data from other tools like Slack.	2021-09-09 10:28:44 -07:00
Anders Kaseorg	4206e5f00b	python: Remove locally dead code. These changes are all independent of each other; I just didn’t feel like making dozens of commits for them. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-19 01:51:37 -07:00
Anders Kaseorg	1bdb7b1141	mypy: Add boto3-stubs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-09 20:32:19 -07:00
Anders Kaseorg	bfdb2f4628	export: Fix error message generation in _check_key_metadata. There is no key.name. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-09 20:32:19 -07:00
Abhijeet Prasad Bodas	683c8507e4	models: Remove MutedTopic alias for UserTopic. Part of #19272	2021-07-28 10:25:54 -07:00
Mateusz Mandera	43329b6a34	bots: Pass realm to get_system_bot calls in export/import.	2021-07-26 15:33:13 -07:00
Tim Abbott	09b5bb7930	export: Improve error message for missing registrations.	2021-07-24 17:36:15 -07:00
PIG208	66b1a4e7ca	backend: Add None-checks with assertions and if-elses. This fixes a batch of mypy errors of the following format: 'Item "None" of "Optional[Something]" has no attribute "abc"'	2021-07-24 15:00:21 -07:00
sahil839	d7dfe80454	models: Add RealmUserDefault table for realm-level default of settings. This table will be used to store the realm-level default of display and notification settings for new users.	2021-07-14 14:35:04 -07:00
Abhijeet Prasad Bodas	1709428cff	models: Create MissedMessageEmailEntry table. This will be used to store the missedmessage events received during the waiting time for email notifications (which is currently 2 minutes, hardcoded). The change in `test_retention` is because we've set `on_delete=CASCADE` for the message field this table. The new query is like so: ``` DELETE FROM "zerver_missedmessageemailentry" WHERE "zerver_missedmessageemailentry"."message_id" IN ( 1545, 1546, 1547, 1548, 1549, 1550, 1551, 1552, 1553 ) ```	2021-07-13 17:21:37 -07:00
Anders Kaseorg	544bbd5398	docs: Fix capitalization mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-10 09:57:26 -07:00
Anders Kaseorg	6060d0d364	docs: Add missing space to compound verbs “log in”, “set up”, etc. Noun: backup, checkout, cleanup, login, logout, setup, shutdown, signup, timeout. Verb: back up, check out, clean up, log in, log out, set up, shut down, sign up, time out. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-04-26 09:31:08 -07:00
Mateusz Mandera	1a8ad796f8	models: Replace __id syntax with _id where possible. model__id syntax implies needing a JOIN on the model table to fetch the id. That's usually redundant, because the first table in the query simply has a 'model_id' column, so the id can be fetched directly. Django is actually smart enough to not do those redundant joins, but we should still avoid this misguided syntax. The exceptions are ManytoMany fields and queries doing a backward relationship lookup. If "streams" is a many-to-many relationship, then streams_id is invalid - streams__id syntax is needed. If "y" is a foreign fields from X to Y: class X: y = models.ForeignKey(Y) then object x of class X has the field x.y_id, but y of class Y doesn't have y.x_id. Thus Y queries need to be done like Y.objects.filter(x__id__in=some_list)	2021-04-22 14:53:00 -07:00
Sumanth V Rao	40228972b9	models/realm: Add a model for storing realm playground information. Tweaked exports.py to add the config object there so that our export tool can include the table when exporting. Also includes all the changes required to import the new table from the exported data. Helper function `get_realm_playgrounds` added to fetch all playgrounds in a realm. Tests amended.	2021-04-07 08:20:53 +05:30
Abhijeet Prasad Bodas	3bfcaa3968	mute user: Add backend infrastructure code. Adds backend code for the mute users feature. This is just infrastructure work (database interactions, helpers, tests, events, API docs etc) and does not involve any behavioral/semantic aspects of muted users. Adds POST and DELETE endpoints, to keep the URL scheme mostly consistent in terms of `users/me`. TODOs: 1. Add tests for exporting `zulip_muteduser` database table. 2. Add dedicated methods to python-zulip-api to be used in place of the current `client.call_endpoint` implementation.	2021-04-06 18:44:08 -07:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
ryanreh99	1c370a975c	refactor: Access a bucket by calling `zerver.lib.uploads.get_bucket`.	2020-10-28 21:52:08 -07:00
Anders Kaseorg	dd48dbd912	docs: Add spaces to “check out”, “log in”, “set up”, “sign up” as verbs. “Checkout”, “login”, “setup”, and “signup” are nouns, not verbs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Anders Kaseorg	bb4fc3c4c7	python: Prefer --flag=option over --flag option. For less inflation by Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	a610bd19a1	python: Simplify away various unnecessary lists and list comprehensions. Loosely inspired by the flake8-comprehensions plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	ab120a03bc	python: Replace unnecessary intermediate lists with generators. Mostly suggested by the flake8-comprehension plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	1ded51aa9d	python: Replace list literal concatenation with * unpacking. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Tim Abbott	f94a8adf9e	export: Remove duplicate 'analytics' zerver_realm object. This fixes a harmless duplication of data in the Zulip data export format.	2020-08-14 15:45:11 -07:00
arpit551	7568f6f9a8	export: Renamed zerver_analytics to zerver_realm. While exporting analytics data we were using wrong table name 'zerver_analytics' in analytics config. Renamed it with correct table name 'zerver_realm'.	2020-08-14 15:45:11 -07:00
Anders Kaseorg	61d0417e75	python: Replace ujson with orjson. Fixes #6507. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:55:12 -07:00
Anders Kaseorg	03d2540899	export: Post-process authentication_methods BitHandler field to list. A BitHandler object is not JSON serializable, and orjson enforces this. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:47:13 -07:00
Anders Kaseorg	2cf2547b27	export: Add missing datetime fields for post-processing. datetime objects are not ordinarily JSON serializable. While both ujson and orjson have special cases to serialize datetime objects, they do it in different ways. So we want to fix the post-processing code to do its job. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:47:13 -07:00
Anders Kaseorg	60a25b2721	docs: Fix spelling errors caught by codespell. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:23:06 -07:00
Tim Abbott	6130a61be0	export: Only print .s with percent_callback to console. The S3 data export tool's upload code path uses this nice boto callback feature for showing a progress bar, which is nice for the management command. It's spammy/broken in production and the backend tests, so we change percent_callback to be a parameter passed in so that it can only be used in the contexts where it makes sense.	2020-07-30 13:14:53 -07:00
Hemanth V. Alluri	0e893b9045	models/drafts: Add a model for storing Draft messages. Also add a Draft object-to-dictionary conversion method. The following commits will provide an API around this model using which our clients can sync drafts across each other (if they so wish too). As of making this commit, we haven't finalized exactly how our clients will use this. See https://chat.zulip.org/#narrow/stream/2-general/topic/drafts For some of the discussion around this model and in general, around this feature. Signed-off-by: Hemanth V. Alluri <hdrive1999@gmail.com>	2020-07-28 17:18:35 -07:00
Steve Howell	318c55e030	export: Export AlertWord table.	2020-07-16 08:50:31 -07:00
Steve Howell	9662af842f	export: Remove stream sanity check. We also remove the post_process_data option. The sanity check is just overkill at this point, since the mechanism to find streams is very direct due to a recent commit.	2020-07-09 11:34:00 -07:00
Steve Howell	54c596cfc4	export: Just export all streams in a realm. Before this change we would only export streams that had actual subscribers, which is usually harmless, but it was mostly a relic of a one time migration that we did when we were cleaning up some dirty data in some of our very early databases (circa 2016). Now we work down the table hierarchy in a more natural way: - get Streams in Realm - get Recipients matching above Streams - get Subscriptions matching above Recipients Note that for per-user exports, I kept the same logic (users -> subscriptions -> recipients -> streams) we had before. One subtle detail here is that we make our final Config blocks--which build the final version of Recipient/Subscription--now hang off of realm_config. Fixes #15146.	2020-07-09 11:33:55 -07:00
Anders Kaseorg	74c17bf94a	python: Convert more percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Now including %d, %i, %u, and multi-line strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	57a80856a5	python: Convert more "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format. Now including %d, %i, %u, and multi-line strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:39:00 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Graham Bleaney	461d5b1a3e	pysa: Introduce sanitizers, models, and inline marking safe. This commit adds three `.pysa` model files: `false_positives.pysa` for ruling out false positive flows with `Sanitize` annotations, `req_lib.pysa` for educating pysa about Zulip's `REQ()` pattern for extracting user input, and `redirects.pysa` for capturing the risk of open redirects within Zulip code. Additionally, this commit introduces `mark_sanitized`, an identity function which can be used to selectively clear taint in cases where `Sanitize` models will not work. This commit also puts `mark_sanitized` to work removing known false postive flows.	2020-06-11 12:57:49 -07:00
Anders Kaseorg	67e7a3631d	python: Convert percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 15:02:09 -07:00
whoodes	cea7d713cd	requirements: Upgrade boto to boto3. Fixes: #3490 Contributors include: Author: whoodes <hoodesw@hawaii.edu> Author: zhoufeng1989 <zhoufengloop@gmail.com> Author: rht <rhtbot@protonmail.com>	2020-05-26 23:18:07 -07:00
Anders Kaseorg	bdc365d0fe	logging: Pass format arguments to logging. https://docs.python.org/3/howto/logging.html#optimization Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-02 10:18:02 -07:00
Wyatt Hoodes	5a58b7c549	data exports: Keep deleted export in UI table. It makes sense to keep a deleted export in the table, along with the time of deletion, for auditing reasons.	2020-04-30 13:00:59 -07:00
Wyatt Hoodes	82e7ad8e25	data exports: Handle pending and failed exports. Prior to this change, there were reports of 500s in production due to `export.extra_data` being a Nonetype. This was reproducible using the s3 backend in development when a row was created in the `RealmAuditLog` table, but the export failed in the `DeferredWorker`. This left an entry lying about that was never updated with an `extra_data` field. To fix this, we catch any exceptions in the `DeferredWorker`, and then update `extra_data` to encode the failure. We also fix the fact that we never updated the export UI table with pending exports. These changes also negated the use for the somewhat hacky `clear_success_banner` logic.	2020-04-30 13:00:59 -07:00

1 2 3 4 5 ...

361 Commits