zulip

Commit Graph

Author	SHA1	Message	Date
Mateusz Mandera	2811a1228f	import_util: Make build_message only take kwargs. build_message has a lot of arguments, so it's hard to verify correctness of callers that just try to get the order right. It's much clearer to be explicit via kwargs. mattermost.py and rocketchat.py already do this, so let's bring slack.py and gitter.py up to par.	2022-09-27 15:04:48 -07:00
Matt Keller	fd996c286e	slack: Filter out non-.json files for processing.	2022-09-23 09:59:34 -07:00
rht	a7cff0f091	Slack import: Translate to emoji name to codepoint using iamcal data. Because Slack emoji naming is different from Zulip's. According to https://emojipedia.org/slack/, Slack's emoji shortcodes are derived from https://github.com/iamcal/emoji-data. There are probably some deviations from that dataset, but this PR should at least catch the ones that are identical to iamcal's.	2022-09-17 12:04:07 -07:00
Florian Pritz	a276603766	rocketchat: Deduplicate and ignore huddle rooms with same users. If there are more than 1 room with the same set of users, the import will fail due to a unique constraint on the huddle_hash. Figuring out why and which room is causing this database error is kinda difficult. We deduplicate those cases here and simply merge the rooms together. Note however, that the deduplication does not work as expected so we simply ignore them all together for now and only raise an exception along some logging output. At least this way, it is pretty clear what is wrong and you do not have to wait to get a database error during the actual import. We also ignore empty huddle rooms since those are the duplicates that caused problems for me and if they are empty, ignoring them is easier than trying to get the merge to work. Not sure where those channels come from since we discovered this with production data. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Florian Pritz	3677aabcbd	rocketchat: Ignore mention mapping failures. Not sure where those come from since we discovered this with production data. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Florian Pritz	c308799133	rocketchat: Only set message content if it exists. Not sure where those come from since we discovered this with production data. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Florian Pritz	1cc2764d45	rocketchat: Ignore reactions from non-existant users. Not sure where those come from since we discovered this with production data. Somehow there were reactions with usernames that were old and no longer existed. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Florian Pritz	26fe028534	rocketchat: Truncate long stream names. These will lead to an error during import otherwise. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Florian Pritz	3a27919b5b	rocketchat: Ignore rocketchat attachments without types. Not sure where those come from since we discovered this with production data. There only was a single instance of this in my entire batch of data in an old message from the time when we started using Rocket.Chat. This might be an old issue or it might require some special settings that were later changed. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Florian Pritz	5ec8f4ef09	rocketchat: Ignore missing rocketchat attachments. Not sure where those come from since we discovered this with production data. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Florian Pritz	96fa0991f8	rocketchat: Handle long or invalid rocketchat attachment names. Signed-off-by: Florian Pritz <bluewind@xinu.at>	2022-09-09 16:57:24 -07:00
Mateusz Mandera	5bcf78e0cb	import: Fix timestamp check in long_term_idle_helper. This is supposed to be 60 days, but timestamps are in seconds.	2022-08-29 15:18:00 -07:00
Mateusz Mandera	d350406991	gitter: Make imported Realm start with only GitHub auth enabled. Users will only be able to login via GitHub, because imported users get GitHub's generated noreply email addresses - so this should be the only auth method enabled at first, to avoid confusion.	2022-08-29 11:10:18 -07:00
Mateusz Mandera	eed8800573	long_term_idle_helper: Change all_user_ids arg to an Iterator.	2022-08-29 11:03:27 -07:00
Mateusz Mandera	4c7a9816ff	gitter: Soft deactivate appropriate imported users. We want to use the long_term_idle_helper logic for gitter imports just like we do for slack.	2022-08-29 11:03:27 -07:00
Mateusz Mandera	75f26bb8ff	long_term_idle_helper: Take list of user_ids as arg instead of dicts. Only ["id"] is accessed on the dicts (representing the external tool users). Given that for some tools the id may be under a different name etc. due to different user dicts format, it's best to just pass those ids to the function so that it can stay generalized and not reliant on a specific user dict format.	2022-08-29 11:03:27 -07:00
Mateusz Mandera	7ac31223e8	gitter: Extract get_user_from_message helper.	2022-08-29 11:03:27 -07:00
Mateusz Mandera	c4c270380a	slack: Use get_timestamp_from_message helper function where relevant. get_timestamp_from_message was extracted in the previous commit. We can deduplicate and the code a bit cleaner by using it where appropriate instead of message["ts"].	2022-08-29 11:03:27 -07:00
Mateusz Mandera	9e56e71afe	long_term_idle_helper: Take timestamp_from_message callable arg. message["ts"] is slack-specific. For this to be a general util function it needs to take a callable that will grab a timestamp from the message dict (which has varying formats depending on what we're importing from).	2022-08-29 11:03:27 -07:00
Mateusz Mandera	a86aa13e57	gitter: Extract get_timestamp_from_message function.	2022-08-29 11:03:27 -07:00
Alex Vandiver	1b1faa3907	import_util: Factor out long_term_idle_helper.	2022-08-29 11:03:27 -07:00
Alex Vandiver	842cff5975	gitter: Some users (e.g. from matrix.org) may not have avatar URLs.	2022-08-29 11:03:27 -07:00
Alex Vandiver	e653bb2733	rocketchat: Handle PMs with only one recipient. These are either to a deleted user, or actually to the same user. In any case, treat them as self-messages.	2022-08-09 10:58:58 -07:00
Alex Vandiver	51421f378b	rocketchat: Skip mentions of unknown users. It is apparently possible to have a mention of a user who is not (or no longer?) in the `users.bson` table. Skip such mention for the purposes of Zulip import; there's nothing better for us to do.	2022-08-09 10:58:58 -07:00
Alex Vandiver	28a29e64a0	rocketchat: File upload chunks may exist without their metadata. This is likely an error somewhere in rocketchat's MongoDB "eventual consistency," but there is no problem with skipping the chunks at this step. In the one case where this was observed so far, the upload-id was not referenced in any message -- if it is referenced and has chunks, but has no metadata, we will fail later, at that reference.	2022-08-09 10:58:58 -07:00
Zixuan James Li	6ee0a979f3	import_util: Post-modify date fields with float values. We construct model instances in the import tool solely for the purpose of serializing them with the `model_to_dict` helper that returns a dictionary. Passing `float` to these models' DateTimeField is not accepted by the type checker. Modifying the dictionary instead avoids this typing issue. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-08-01 13:58:12 -07:00
Anders Kaseorg	b945aa3443	python: Use a real parser for email addresses. Now that we can assume Python 3.6+, we can use the email.headerregistry module to replace hacky manual email address parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-07-29 15:47:33 -07:00
Anders Kaseorg	7b4cfcddb3	import_util: Migrate from multiprocessing to ProcessPoolExecutor. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-07-29 15:27:09 -07:00
Zixuan James Li	9bfeebf064	user_profile: Fallback to "" for timezone upon creation. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-06-28 16:05:24 -07:00
Anders Kaseorg	f3254bb558	mattermost: Run html2text as a subprocess. html2text is GPL licensed. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-06-26 17:32:59 -07:00
Zixuan James Li	67fda5516f	import_utils: Fix wrong usage of model_to_dict. The argument `exclude` expects a `list` or `set` of field names, not a `str`. Signed-off-by: Zixuan James Li <p359101898@gmail.com>	2022-06-23 22:09:05 -07:00
Zixuan James Li	63e9ae8389	typing: Apply trivial fixes to adjust edge cases in typing. Add none-checks, rename variables (to avoid redefinition of the same variable with different types error), add necessary type annotations. This is a part of #18777. Signed-off-by: Zixuan James Li <359101898@qq.com>	2022-05-30 12:03:51 -07:00
Anders Kaseorg	a2825e5984	python: Use Python 3.8 typing.{Protocol,TypedDict}. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-27 12:57:49 -07:00
Mateusz Mandera	04fdf3e4d9	import_utils: Fix history_public_to_subscribers being set incorrectly. history_public_to_subscribers wasn't explicitly set when creating streams via build_stream, thus relying on the model's default of False. This lead to public streams being created with that value set to False, which doesn't make sense. We can solve this by inferring the correct value based on invite_only in the build_stream funtion itself - rather than needing to add a flag argument to it. This commit also includes a migration to fix public stream with the wrong history_public_to_subscribers value. Fixes #21784.	2022-04-27 12:08:01 -07:00
Alex Vandiver	2e50ead9d1	data_import: Fix bot email address de-duplication. `4815f6e28b` tried to de-duplicate bot email addresses, but instead caused duplicates to crash: ``` Traceback (most recent call last): File "./manage.py", line 157, in <module> execute_from_command_line(sys.argv) File "./manage.py", line 122, in execute_from_command_line utility.execute() File "/srv/zulip-venv-cache/56ac6adf406011a100282dd526d03537be84d23e/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/__init__.py", line 413, in execute self.fetch_command(subcommand).run_from_argv(self.argv) File "/srv/zulip-venv-cache/56ac6adf406011a100282dd526d03537be84d23e/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/base.py", line 354, in run_from_argv self.execute(args, cmd_options) File "/srv/zulip-venv-cache/56ac6adf406011a100282dd526d03537be84d23e/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/base.py", line 398, in execute output = self.handle(args, **options) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/management/commands/convert_slack_data.py", line 59, in handle do_convert_data(path, output_dir, token, threads=num_threads) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 1320, in do_convert_data ) = slack_workspace_to_realm( File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 141, in slack_workspace_to_realm ) = users_to_zerver_userprofile(slack_data_dir, user_list, realm_id, int(NOW), domain_name) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 248, in users_to_zerver_userprofile email = get_user_email(user, domain_name) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 406, in get_user_email return SlackBotEmail.get_email(user["profile"], domain_name) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 85, in get_email email_prefix += cls.duplicate_email_count[email] TypeError: can only concatenate str (not "int") to str ``` Fix the stringification, make it case-insensitive, append with a dash for readability, and add tests for all of the above.	2022-03-31 11:10:18 -07:00
Steve Howell	8f99894302	streams: Extract stream_color library. This is a pure code move.	2022-03-14 18:01:36 -07:00
Anders Kaseorg	b0ce4f1bce	docs: Fix many spelling mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-07 18:51:06 -08:00
Priyansh Garg	42f231c85c	data_import: Ignore Rocket.Chat livechat streams/messages. This resolves the issues reported in #20108, major chunk of which were due to the incomplete support for importing the livechat streams/messages in the tool. So, it's best not to import any livechat streams/messages for now until a complete support for importing the same is developed.	2021-11-07 09:50:55 -08:00
Priyansh Garg	17409a78be	data_import: Fix a few KeyError bugs in Rocket.Chat import tool. This commit fixes a few bugs in Rocket.Chat import tool as reported on CZO. Link: https://chat.zulip.org/#narrow/stream/9-issues/topic/Rocketchat.20Import	2021-11-03 16:50:56 -07:00
Priyansh Garg	0c2e4eec20	data_import: Import Rocket.Chat threads as separate topics.	2021-11-01 17:13:35 -07:00
Priyansh Garg	0db9b7287b	data_import: Import Rocket.Chat messages from direct discussions. This commit adds functionality to import messages from the Discussions having direct channels as their parent. As we don't have topics in the PMs, the messages are imported in interleaved form in the imported direct channels/PMs. This was completely unsupported earlier and would have resulted in an error.	2021-11-01 17:09:11 -07:00
Priyansh Garg	5f1e246230	data_import: Import wildcard mention data from Rocket.Chat.	2021-11-01 17:06:15 -07:00
Priyansh Garg	26f16b9eec	data_import: Separate logic for naming Rocket.Chat streams and topics.	2021-11-01 16:48:25 -07:00
rht	58b19761b8	slack import: Fix requests.get usage of get_slack_api_data. We also rewrite the tests using the `responses` module to avoid the problematic mocking that made this bug possible. Fixes #19833.	2021-10-07 11:46:23 -07:00
rht	d8e1409fe5	Slack import: Use Python ZipFile to unzip. This should handle the case when non-ASCII Unicode folder names are created on Windows. Fixes #19899.	2021-10-07 09:24:19 -07:00
Anders Kaseorg	4206e5f00b	python: Remove locally dead code. These changes are all independent of each other; I just didn’t feel like making dozens of commits for them. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-19 01:51:37 -07:00
Priyansh Garg	54452fef6c	data_import: Fix channel mentions in Rocket.Chat import. While the STREAM_LINK_REGEX and STREAM_TOPIC_LINK_REGEX identifies the stream and topic mentions in the content correctly (tested by printing out the matches), the stream/topic mentions are still not linked to the corresponding streams/topics for imported messages, as a `zulip_message` instance is required for linking these mentions to actual streams/topics (see `StreamPattern` class in `markdown/__init__.py`) which is not provided while processing the markdown for imported messages.	2021-08-09 06:38:26 -07:00
Priyansh Garg	aed4e48da7	data_import: Import attachments from Rocket.Chat.	2021-08-09 06:38:26 -07:00
Priyansh Garg	65e28907cb	data_import: Import custom emoji from Rocket.Chat.	2021-08-09 06:38:26 -07:00
Priyansh Garg	4815f6e28b	data_import: Make slack bot emails unique. Slack bot emails generated by us can be duplicate for two bots. If such a case occur, append a counter to the email to make it unique. For maintaining the counter of duplicate emails and the final email assigned to each bot, a class based approach is used with static variables and static (class) methods. This keeps all the data related to slack bot emails at the same place and easily accessible from anywhere inside the module (without defining any class object and passing it around). Fixes: #16793	2021-08-03 16:18:14 -07:00
Anders Kaseorg	5483ebae37	python: Convert "".format to Python 3.6 f-strings. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	ad5f0c05b5	python: Remove default "utf8" argument for encode(), decode(). Partially generated by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	3665deb93a	python: Remove unnecessary intermediate lists. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
rht	1bbd36d181	slack_import: Remove obsolete SlackImportAttachment placeholder. This was introduced in `f4ad464d82`, and incompletely removed in e037c2f93e649c28a71c02559b5ae7a3333f42a8; here we finish removing it.	2021-08-02 13:13:28 -07:00
Priyansh Garg	044fe547d3	data_import: Add huddle import support for Rocket.Chat.	2021-07-28 15:45:54 -07:00
Priyansh Garg	24dd0ff96c	data_import: Add rocket chat import tool. This commit allows to import the following from rocketchat: * All users * All public/private channels * All teams and its public/private channels * All discussion rooms as topics in their parent channel * All the messages in all the channels * All private conversations * Reactions on messages (except for custom emojis) * Mentions in messages (except @all, @here mentions)	2021-07-28 15:28:56 -07:00
Priyansh Garg	a21a280054	data_import: Rename mattermost_user to user_handler. This logic can be readily reused for new data import tools.	2021-07-15 14:28:36 -07:00
Priyansh Garg	94a2be06f3	markdown: Use a shared variable for IMAGE_EXTENSION.	2021-07-02 11:22:55 -07:00
Priyansh Garg	5b2e21965c	data_import: Add import attachments support for Mattermost. Add support for importing message attachments from Mattermost. Fixes: #18959	2021-07-02 11:19:45 -07:00
Alex Vandiver	ff9126ac1e	data_import: Protect better against bad Slack tokens. An invalid token would be treated the same as a token with no scopes; differentiate these better.	2021-05-27 22:46:58 -07:00
Alex Vandiver	94e4f33b29	data_import: Support importing from Slack conversions in a directory. Sometimes the Slack import zip file we get isn't quite the canonical form that Slack produces -- often because the user has unzip'd it, looked at it, and re-zip'd it, resulting in extra nested directories and the like. For such cases, support passing in a path to an unpacked Slack export tree.	2021-05-27 22:46:58 -07:00
Alex Vandiver	8228ea2a17	import_data: Do some quick verification of Slack import formats.	2021-05-27 22:46:58 -07:00
Anders Kaseorg	544bbd5398	docs: Fix capitalization mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-10 09:57:26 -07:00
Sumanth V Rao	40228972b9	models/realm: Add a model for storing realm playground information. Tweaked exports.py to add the config object there so that our export tool can include the table when exporting. Also includes all the changes required to import the new table from the exported data. Helper function `get_realm_playgrounds` added to fetch all playgrounds in a realm. Tests amended.	2021-04-07 08:20:53 +05:30
Cyril Pletinckx	ba7da6d5c0	import/export: Fix deprecated authentication method for Slack. The query string parameter authentication method is now deprecated for newly created Slack applications since the 24th of February[1]. This causes Slack imports to fail, claiming that the token has none of the required scopes. Two methods can be used to solve this problem: either include the authentication token in the header of an HTTP GET request, or include it in the body of an HTTP POST request. The former is preferred, as the code was already written to use HTTP GET requests. Change the way the parameters are passed to the "requests.get" method calls, to pass the token via the `Authorization` header. [1] https://api.slack.com/changelog/2020-11-no-more-tokens-in-querystrings-for-newly-created-apps Fixes: #17408.	2021-03-08 12:56:37 -08:00
Anders Kaseorg	a1ba3ca066	import_util: Strengthen get_users type using a Protocol. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-15 17:05:28 -08:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Aman Agrawal	c685d36821	hipchat_import: Remove tool from codebase. Remove functions and scripts used by HipChat import tool and those which will no longer be required in future.	2020-12-23 08:28:49 -08:00
Tim Abbott	ed498e2f8e	import: Import mattermost admins as Zulip owners. Otherwise, we violate the invariant that all organizations have an owner.	2020-12-17 18:45:45 -08:00
Alex Vandiver	7c849fa940	slack: Check token access scopes before importing. The Slack API always (even for failed requests) puts the access scopes of the token passed in, into "X-OAuth-Scopes"[1], which can be used to determine if any are missing -- and if so, which. [1] https://api.slack.com/legacy/oauth-scopes#working-with-scopes	2020-12-15 11:33:15 -08:00
Tim Abbott	067cd3a97a	docs: Remove incorrect references to chat.zulip.org. Most of these are Help Center links that should be pointing to the production Help Center.	2020-10-29 16:46:40 -07:00
Anders Kaseorg	4e9d587535	python: Pass query parameters as a dict when making GET requests. This provides automatic URL-encoding. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	9a2aad58d0	import_util: Migrate from run_parallel to multiprocessing. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-09-14 16:22:23 -07:00
Anders Kaseorg	b7b7475672	python: Use standard secrets module to generate random tokens. There are three functional side effects: • Correct an insignificant but mathematically offensive bias toward repeated characters in generate_api_key introduced in commit 47b4283c4b4c70ecde4d3c8de871c90ee2506d87; its entropy is increased from 190.52864 bits to 190.53428 bits. • Use the base32 alphabet in confirmation.models.generate_key; its entropy is reduced from 124.07820 bits to the documented 120 bits, but now it uses 1 syscall instead of 24. • Use the base32 alphabet in get_bigbluebutton_url; its entropy is reduced from 51.69925 bits to 50 bits, but now it uses 1 syscall instead of 10. (The base32 alphabet is A-Z 2-7. We could probably replace all of these with plain secrets.token_urlsafe, since I expect most callers can handle the full urlsafe_b64 alphabet A-Z a-z 0-9 - _ without problems.) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-09 15:52:57 -07:00
Anders Kaseorg	f91d287447	python: Pre-fix a few spots for better Black formatting. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 17:51:09 -07:00
Anders Kaseorg	a276eefcfe	python: Rewrite dict() as {}. Suggested by the flake8-comprehensions plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Anders Kaseorg	61d0417e75	python: Replace ujson with orjson. Fixes #6507. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:55:12 -07:00
Anders Kaseorg	768f9f93cd	docs: Capitalize Markdown consistently. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:23:06 -07:00
Anders Kaseorg	60a25b2721	docs: Fix spelling errors caught by codespell. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:23:06 -07:00
Alex Vandiver	2928bbc8bd	logging: Report stack_info on logging.exception calls. The exception trace only goes from where the exception was thrown up to where the `logging.exception` call is; any context as to where _that_ was called from is lost, unless `stack_info` is passed as well. Having the stack is particularly useful for Sentry exceptions, which gain the full stack trace. Add `stack_info=True` on all `logging.exception` calls with a non-trivial stack; we omit `wsgi.py`. Adjusts tests to match.	2020-08-11 10:16:54 -07:00
Steve Howell	c44500175d	database: Remove short_name from UserProfile. A few major themes here: - We remove short_name from UserProfile and add the appropriate migration. - We remove short_name from various cache-related lists of fields. - We allow import tools to continue to write short_name to their export files, and then we simply ignore the field at import time. - We change functions like do_create_user, create_user_profile, etc. - We keep short_name in the /json/bots API. (It actually gets turned into an email.) - We don't modify our LDAP code much here.	2020-07-17 11:15:15 -07:00
Steve Howell	0b65abcdf5	pointer: Remove pointer from UserProfile. Most of the changes here are just that we no longer need to provide a value for pointer when we create UserProfile objects.	2020-07-03 13:08:40 +00:00
Anders Kaseorg	3ffed617a2	mypy: Type simple generators as Iterator, not Iterable. A generator that yields values without receiving or returning them is an Iterator. Although every Iterator happens to be iterable, Iterable is a confusing annotation for generators because a generator is only iterable once. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-23 11:29:54 -07:00
Anders Kaseorg	74c17bf94a	python: Convert more percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Now including %d, %i, %u, and multi-line strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	1ed2d9b4a0	logging: Use logging.exception and exc_info for unexpected exceptions. logging.exception() and logging.debug(exc_info=True), etc. automatically include a traceback. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	0d6c771baf	python: Guard against default value mutation with read-only types. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:31:27 -07:00
Anders Kaseorg	91a86c24f5	python: Replace None defaults with empty collections where appropriate. Use read-only types (List ↦ Sequence, Dict ↦ Mapping, Set ↦ AbstractSet) to guard against accidental mutation of the default value. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:31:27 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Anders Kaseorg	67e7a3631d	python: Convert percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 15:02:09 -07:00
Anders Kaseorg	6480deaf27	python: Convert more "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, with more restrictions patched out. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 14:48:09 -07:00
Tim Abbott	71078adc50	docs: Update URLs to use https://zulip.com . We're migrating to using the cleaner zulip.com domain, which involves changing all of our links from ReadTheDocs and other places to point to the cleaner URL.	2020-06-08 18:10:45 -07:00
sahil839	2f7d684a84	slack_import: Map slack owners to zulip realm owners. Slack owners and primary owners will be mapped to zulip realm owners on import. Previously, we mapped the owner and primary owner roles of slack to realm admins in zulip. As we have added ROLE_REALM_OWNER in `8bbc074`, we now map slack owners and primary owners to owners in zulip. Tests are modified for checking all the 3 cases- - Slack workspace primary owner - Slack workspace owner - Slack workspace admin This commit also has docs changes in 'import-from-slack.md'.	2020-06-08 16:22:54 -07:00
Anders Kaseorg	8dd83228e7	python: Convert "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, but with the NamedTuple changes reverted (see commit `ba7906a3c6`, #15132). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-08 15:31:20 -07:00
Tim Abbott	496c08e26c	slack import: Fix DefaultStream import of deactivated #random. If the #random channel in Slack is deactivated, we should follow Zulip's data model of not allowing deactivated, default streams. This had apparently happened in zulipchat.com for a few organizations, resulting in weird exceptions trying to invite new users.	2020-05-12 17:18:57 -07:00
Rohitt Vashishtha	9506be0f4f	slack-import: Downgrade Slack legacy-token check failure to warning. Slack has disabled creation of legacy tokens, which means we have to use other tokens for importing the data. Thus, we shouldn't throw an error if the token doesn't match the legacy token format. Since we do not have any other validation for those tokens yet, we log a warning but still try to continue with the import assuming that the token has the right scopes. See https://api.slack.com/changelog/2020-02-legacy-test-token-creation-to-retire.	2020-05-11 13:41:50 -07:00
Cyril Cohen	0d6f80059b	gitter import: Subscribe every user to every stream.	2020-05-05 21:31:35 -07:00
Cyril Cohen	5598f8f6b0	gitter: Support importing data from multiple Gitter rooms. Features: Improving `./manage.py convert_gitter_data` - If messages have been post-processed to add a 'room' field, we create as many streams as existing rooms. - Messages with a 'room' field go to the corresponding stream. - This modification is backward compatible. I.e. + messages that have no 'room' field go to the default stream/topic + messages that do, go to a specific stream Implementation: - adding a map `stream_map` to map room names to stream ids - create as many streams as room field messages + 1 default streamFeatures: - If messages have been post-processed to add a 'room' field to messages, we create as many streams as existing rooms. - Up to renaming of the default stream/topic, this modification is backwards compatible. I.e. messages that have no 'room' field go to the default stream/topic messages that do, go to a specific stream Implementation: - adding a map stream_map to map room names to stream ids - create as many streams as room field messages + 1 default stream Takes advantage of https://github.com/minrk/archive-gitter/pull/5.	2020-05-02 10:30:18 -07:00

1 2 3 4 5 ...

306 Commits