zulip

Commit Graph

Author	SHA1	Message	Date
Matt Keller	4d87bf291c	slack: Skip files where file_access: file_not_found.	2022-10-25 12:18:20 -07:00
Matt Keller	c5f106ce1b	slack: Skip files where file_access: access_denied. These stubs are incomplete and should be treated akin to tombstones.	2022-10-11 10:53:16 -07:00
Mateusz Mandera	00b3546c9f	models: Add denormalized .realm column to Message. This commit adds the OPTIONAL .realm attribute to Message (and ArchivedMessage), with the server changes for making new Messages have this set. Old Messages still have to be migrated to backfill this, before it can be non-nullable. Appropriate test changes to correctly set .realm for Messages the tests manually create are included here as well.	2022-10-07 10:09:38 -07:00
Mateusz Mandera	2811a1228f	import_util: Make build_message only take kwargs. build_message has a lot of arguments, so it's hard to verify correctness of callers that just try to get the order right. It's much clearer to be explicit via kwargs. mattermost.py and rocketchat.py already do this, so let's bring slack.py and gitter.py up to par.	2022-09-27 15:04:48 -07:00
Matt Keller	fd996c286e	slack: Filter out non-.json files for processing.	2022-09-23 09:59:34 -07:00
rht	a7cff0f091	Slack import: Translate to emoji name to codepoint using iamcal data. Because Slack emoji naming is different from Zulip's. According to https://emojipedia.org/slack/, Slack's emoji shortcodes are derived from https://github.com/iamcal/emoji-data. There are probably some deviations from that dataset, but this PR should at least catch the ones that are identical to iamcal's.	2022-09-17 12:04:07 -07:00
Mateusz Mandera	eed8800573	long_term_idle_helper: Change all_user_ids arg to an Iterator.	2022-08-29 11:03:27 -07:00
Mateusz Mandera	75f26bb8ff	long_term_idle_helper: Take list of user_ids as arg instead of dicts. Only ["id"] is accessed on the dicts (representing the external tool users). Given that for some tools the id may be under a different name etc. due to different user dicts format, it's best to just pass those ids to the function so that it can stay generalized and not reliant on a specific user dict format.	2022-08-29 11:03:27 -07:00
Mateusz Mandera	c4c270380a	slack: Use get_timestamp_from_message helper function where relevant. get_timestamp_from_message was extracted in the previous commit. We can deduplicate and the code a bit cleaner by using it where appropriate instead of message["ts"].	2022-08-29 11:03:27 -07:00
Mateusz Mandera	9e56e71afe	long_term_idle_helper: Take timestamp_from_message callable arg. message["ts"] is slack-specific. For this to be a general util function it needs to take a callable that will grab a timestamp from the message dict (which has varying formats depending on what we're importing from).	2022-08-29 11:03:27 -07:00
Alex Vandiver	1b1faa3907	import_util: Factor out long_term_idle_helper.	2022-08-29 11:03:27 -07:00
Anders Kaseorg	b945aa3443	python: Use a real parser for email addresses. Now that we can assume Python 3.6+, we can use the email.headerregistry module to replace hacky manual email address parsing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-07-29 15:47:33 -07:00
Alex Vandiver	2e50ead9d1	data_import: Fix bot email address de-duplication. `4815f6e28b` tried to de-duplicate bot email addresses, but instead caused duplicates to crash: ``` Traceback (most recent call last): File "./manage.py", line 157, in <module> execute_from_command_line(sys.argv) File "./manage.py", line 122, in execute_from_command_line utility.execute() File "/srv/zulip-venv-cache/56ac6adf406011a100282dd526d03537be84d23e/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/__init__.py", line 413, in execute self.fetch_command(subcommand).run_from_argv(self.argv) File "/srv/zulip-venv-cache/56ac6adf406011a100282dd526d03537be84d23e/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/base.py", line 354, in run_from_argv self.execute(args, cmd_options) File "/srv/zulip-venv-cache/56ac6adf406011a100282dd526d03537be84d23e/zulip-py3-venv/lib/python3.8/site-packages/django/core/management/base.py", line 398, in execute output = self.handle(args, **options) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/management/commands/convert_slack_data.py", line 59, in handle do_convert_data(path, output_dir, token, threads=num_threads) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 1320, in do_convert_data ) = slack_workspace_to_realm( File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 141, in slack_workspace_to_realm ) = users_to_zerver_userprofile(slack_data_dir, user_list, realm_id, int(NOW), domain_name) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 248, in users_to_zerver_userprofile email = get_user_email(user, domain_name) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 406, in get_user_email return SlackBotEmail.get_email(user["profile"], domain_name) File "/home/zulip/deployments/2022-03-16-22-25-42/zerver/data_import/slack.py", line 85, in get_email email_prefix += cls.duplicate_email_count[email] TypeError: can only concatenate str (not "int") to str ``` Fix the stringification, make it case-insensitive, append with a dash for readability, and add tests for all of the above.	2022-03-31 11:10:18 -07:00
Anders Kaseorg	b0ce4f1bce	docs: Fix many spelling mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-07 18:51:06 -08:00
rht	58b19761b8	slack import: Fix requests.get usage of get_slack_api_data. We also rewrite the tests using the `responses` module to avoid the problematic mocking that made this bug possible. Fixes #19833.	2021-10-07 11:46:23 -07:00
rht	d8e1409fe5	Slack import: Use Python ZipFile to unzip. This should handle the case when non-ASCII Unicode folder names are created on Windows. Fixes #19899.	2021-10-07 09:24:19 -07:00
Priyansh Garg	4815f6e28b	data_import: Make slack bot emails unique. Slack bot emails generated by us can be duplicate for two bots. If such a case occur, append a counter to the email to make it unique. For maintaining the counter of duplicate emails and the final email assigned to each bot, a class based approach is used with static variables and static (class) methods. This keeps all the data related to slack bot emails at the same place and easily accessible from anywhere inside the module (without defining any class object and passing it around). Fixes: #16793	2021-08-03 16:18:14 -07:00
Anders Kaseorg	5483ebae37	python: Convert "".format to Python 3.6 f-strings. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
Anders Kaseorg	3665deb93a	python: Remove unnecessary intermediate lists. Generated automatically by pyupgrade. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-02 15:53:52 -07:00
rht	1bbd36d181	slack_import: Remove obsolete SlackImportAttachment placeholder. This was introduced in `f4ad464d82`, and incompletely removed in e037c2f93e649c28a71c02559b5ae7a3333f42a8; here we finish removing it.	2021-08-02 13:13:28 -07:00
Alex Vandiver	ff9126ac1e	data_import: Protect better against bad Slack tokens. An invalid token would be treated the same as a token with no scopes; differentiate these better.	2021-05-27 22:46:58 -07:00
Alex Vandiver	94e4f33b29	data_import: Support importing from Slack conversions in a directory. Sometimes the Slack import zip file we get isn't quite the canonical form that Slack produces -- often because the user has unzip'd it, looked at it, and re-zip'd it, resulting in extra nested directories and the like. For such cases, support passing in a path to an unpacked Slack export tree.	2021-05-27 22:46:58 -07:00
Alex Vandiver	8228ea2a17	import_data: Do some quick verification of Slack import formats.	2021-05-27 22:46:58 -07:00
Anders Kaseorg	544bbd5398	docs: Fix capitalization mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-05-10 09:57:26 -07:00
Cyril Pletinckx	ba7da6d5c0	import/export: Fix deprecated authentication method for Slack. The query string parameter authentication method is now deprecated for newly created Slack applications since the 24th of February[1]. This causes Slack imports to fail, claiming that the token has none of the required scopes. Two methods can be used to solve this problem: either include the authentication token in the header of an HTTP GET request, or include it in the body of an HTTP POST request. The former is preferred, as the code was already written to use HTTP GET requests. Change the way the parameters are passed to the "requests.get" method calls, to pass the token via the `Authorization` header. [1] https://api.slack.com/changelog/2020-11-no-more-tokens-in-querystrings-for-newly-created-apps Fixes: #17408.	2021-03-08 12:56:37 -08:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Alex Vandiver	7c849fa940	slack: Check token access scopes before importing. The Slack API always (even for failed requests) puts the access scopes of the token passed in, into "X-OAuth-Scopes"[1], which can be used to determine if any are missing -- and if so, which. [1] https://api.slack.com/legacy/oauth-scopes#working-with-scopes	2020-12-15 11:33:15 -08:00
Tim Abbott	067cd3a97a	docs: Remove incorrect references to chat.zulip.org. Most of these are Help Center links that should be pointing to the production Help Center.	2020-10-29 16:46:40 -07:00
Anders Kaseorg	4e9d587535	python: Pass query parameters as a dict when making GET requests. This provides automatic URL-encoding. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-27 13:47:02 -07:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	b7b7475672	python: Use standard secrets module to generate random tokens. There are three functional side effects: • Correct an insignificant but mathematically offensive bias toward repeated characters in generate_api_key introduced in commit 47b4283c4b4c70ecde4d3c8de871c90ee2506d87; its entropy is increased from 190.52864 bits to 190.53428 bits. • Use the base32 alphabet in confirmation.models.generate_key; its entropy is reduced from 124.07820 bits to the documented 120 bits, but now it uses 1 syscall instead of 24. • Use the base32 alphabet in get_bigbluebutton_url; its entropy is reduced from 51.69925 bits to 50 bits, but now it uses 1 syscall instead of 10. (The base32 alphabet is A-Z 2-7. We could probably replace all of these with plain secrets.token_urlsafe, since I expect most callers can handle the full urlsafe_b64 alphabet A-Z a-z 0-9 - _ without problems.) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-09 15:52:57 -07:00
Anders Kaseorg	61d0417e75	python: Replace ujson with orjson. Fixes #6507. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:55:12 -07:00
Steve Howell	c44500175d	database: Remove short_name from UserProfile. A few major themes here: - We remove short_name from UserProfile and add the appropriate migration. - We remove short_name from various cache-related lists of fields. - We allow import tools to continue to write short_name to their export files, and then we simply ignore the field at import time. - We change functions like do_create_user, create_user_profile, etc. - We keep short_name in the /json/bots API. (It actually gets turned into an email.) - We don't modify our LDAP code much here.	2020-07-17 11:15:15 -07:00
Steve Howell	0b65abcdf5	pointer: Remove pointer from UserProfile. Most of the changes here are just that we no longer need to provide a value for pointer when we create UserProfile objects.	2020-07-03 13:08:40 +00:00
Anders Kaseorg	74c17bf94a	python: Convert more percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Now including %d, %i, %u, and multi-line strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Anders Kaseorg	67e7a3631d	python: Convert percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 15:02:09 -07:00
Anders Kaseorg	6480deaf27	python: Convert more "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, with more restrictions patched out. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 14:48:09 -07:00
Tim Abbott	71078adc50	docs: Update URLs to use https://zulip.com . We're migrating to using the cleaner zulip.com domain, which involves changing all of our links from ReadTheDocs and other places to point to the cleaner URL.	2020-06-08 18:10:45 -07:00
sahil839	2f7d684a84	slack_import: Map slack owners to zulip realm owners. Slack owners and primary owners will be mapped to zulip realm owners on import. Previously, we mapped the owner and primary owner roles of slack to realm admins in zulip. As we have added ROLE_REALM_OWNER in `8bbc074`, we now map slack owners and primary owners to owners in zulip. Tests are modified for checking all the 3 cases- - Slack workspace primary owner - Slack workspace owner - Slack workspace admin This commit also has docs changes in 'import-from-slack.md'.	2020-06-08 16:22:54 -07:00
Anders Kaseorg	8dd83228e7	python: Convert "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, but with the NamedTuple changes reverted (see commit `ba7906a3c6`, #15132). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-08 15:31:20 -07:00
Tim Abbott	496c08e26c	slack import: Fix DefaultStream import of deactivated #random. If the #random channel in Slack is deactivated, we should follow Zulip's data model of not allowing deactivated, default streams. This had apparently happened in zulipchat.com for a few organizations, resulting in weird exceptions trying to invite new users.	2020-05-12 17:18:57 -07:00
Rohitt Vashishtha	9506be0f4f	slack-import: Downgrade Slack legacy-token check failure to warning. Slack has disabled creation of legacy tokens, which means we have to use other tokens for importing the data. Thus, we shouldn't throw an error if the token doesn't match the legacy token format. Since we do not have any other validation for those tokens yet, we log a warning but still try to continue with the import assuming that the token has the right scopes. See https://api.slack.com/changelog/2020-02-legacy-test-token-creation-to-retire.	2020-05-11 13:41:50 -07:00
Anders Kaseorg	bdc365d0fe	logging: Pass format arguments to logging. https://docs.python.org/3/howto/logging.html#optimization Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-02 10:18:02 -07:00
Anders Kaseorg	fead14951c	python: Convert assignment type annotations to Python 3.6 style. This commit was split by tabbott; this piece covers the vast majority of files in Zulip, but excludes scripts/, tools/, and puppet/ to help ensure we at least show the right error messages for Xenial systems. We can likely further refine the remaining pieces with some testing. Generated by com2ann, with whitespace fixes and various manual fixes for runtime issues: - invoiced_through: Optional[LicenseLedger] = models.ForeignKey( + invoiced_through: Optional["LicenseLedger"] = models.ForeignKey( -_apns_client: Optional[APNsClient] = None +_apns_client: Optional["APNsClient"] = None - notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - signup_notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + signup_notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - author: Optional[UserProfile] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) + author: Optional["UserProfile"] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) - bot_owner: Optional[UserProfile] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) + bot_owner: Optional["UserProfile"] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) - default_sending_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) - default_events_register_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_sending_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_events_register_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) -descriptors_by_handler_id: Dict[int, ClientDescriptor] = {} +descriptors_by_handler_id: Dict[int, "ClientDescriptor"] = {} -worker_classes: Dict[str, Type[QueueProcessingWorker]] = {} -queues: Dict[str, Dict[str, Type[QueueProcessingWorker]]] = {} +worker_classes: Dict[str, Type["QueueProcessingWorker"]] = {} +queues: Dict[str, Dict[str, Type["QueueProcessingWorker"]]] = {} -AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional[LDAPSearch] = None +AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional["LDAPSearch"] = None Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 11:02:32 -07:00
Siddharth Varshney	e03176b272	help: Add doc for setting profile picture back to gravatar.	2020-04-16 20:27:52 -07:00
Anders Kaseorg	c734bbd95d	python: Modernize legacy Python 2 syntax with pyupgrade. Generated by `pyupgrade --py3-plus --keep-percent-format` on all our Python code except `zthumbor` and `zulip-ec2-configure-interfaces`, followed by manual indentation fixes. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-09 16:43:22 -07:00
Stefan Weil	d2fa058cc1	text: Fix some typos (most of them found and fixed by codespell). Signed-off-by: Stefan Weil <sw@weilnetz.de>	2020-03-27 17:25:56 -07:00
Anders Kaseorg	e257253e64	emoji_codes: Replace JS module with JSON module. webpack optimizes JSON modules using JSON.parse("{…}"), which is faster than the normal JavaScript parser. Update the backend to use emoji_codes.json too instead of the three separate JSON files. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-02-12 10:09:12 -08:00
Vishnu KS	df5345705c	import: Support importing team icon from slack.	2020-02-03 14:09:05 -08:00
Tim Abbott	122e11c678	slack import: Fix handling of messages sent by user U00.	2020-01-25 22:47:49 -08:00
Tim Abbott	e052ec58db	slack import: Improve error messages around invalid tokens. This updates our error handling of invalid Slack API tokens (and other networking error handling) to mostly make sense: * A token that doesn't start with `xoxp-` gives an extended error early. * An AssertionError for the codebase is correctly declared as such. * We check for token shape errors before querying the Slack API. We could still do useful work to raise custom exception classes here. Thanks to @stavrospat for raising this issue.	2020-01-22 14:48:32 -08:00
Tlazypanda	6945ced76f	slack import: Map Slack guest users to Zulip guests. Slack's Single-User Guest and Multi-User Guest users should be imported as Zulip guests during data import. Fixes #13255.	2019-11-12 12:12:59 -08:00
Rishi Gupta	e10361a832	models: Replace is_guest and is_realm_admin with UserProfile.role. This new data model will be more extensible for future work on features like a primary administrator.	2019-10-06 16:24:37 -07:00
Vishnu KS	d434c0ee88	slack: Remove unnecessary comments. Remove comments that tries to explain code that is already readable. Also remove some todo comments that has been already taken care of.	2019-08-26 14:10:19 -07:00
Vishnu KS	99d34fd11d	slack: Rename default_channels to slack_default_channels.	2019-08-26 14:10:19 -07:00
Vishnu KS	b919514f7f	slack: Rename customprofilefield_id to custom_profile_field_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	c31355f9c1	slack: Rename custom_field_id_count to custom_profile_field_value_id_count.	2019-08-26 14:10:19 -07:00
Vishnu KS	138c659c97	slack: Rename slack_custom_field_name_to_zulip_custom_field_id. Rename custom_field_map to slack_custom_field_name_to_zulip_custom_field_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	9560736d86	slack: Rename slack_user_id_to_custom_profile_fields. Renames slack_user_custom_field_map to slack_user_id_to_custom_profile_fields for readability.	2019-08-26 14:10:19 -07:00
Vishnu KS	01a51c8f4e	slack: Rename added_recipient to slack_recipient_name_to_zulip_recipient_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	9d51a1b527	slack: Rename added_users to slack_user_id_to_zulip_user_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	3650f19692	slack: Lookup dir_name key in dict instead of in dict_keys. No reason to do the lookup in O(n) when we can do it in average O(1) time complexity.	2019-08-26 14:10:19 -07:00
Vishnu Ks	1e5c49ad82	slack: Support importing shared channels.	2019-08-26 14:10:19 -07:00
Vishnu Ks	e09a29f4d3	slack: Refactor get_slack_api_data to accept multiple query params.	2019-08-26 14:10:19 -07:00
Tim Abbott	9827801569	slack import: Improve readability of user recipient object allocation. This loop management tweak makes it a bit more obvious what's happening in this block of code.	2019-07-30 14:46:14 -07:00
Vishnu KS	ff3871fc63	slack_import: Clean up return values of channels_to_zerver_stream. This commits reduces the number of values returned by channel_to_zerver_stream function by setting the values directly in realm dict and returning it instead.	2019-07-30 14:46:14 -07:00
Vishnu Ks	6110f495df	slack_import: Support importing pms.	2019-07-30 14:46:14 -07:00
Vishnu Ks	5e6d86c8c4	slack_import: Support importing multiparty IMs.	2019-07-09 15:03:28 -07:00
Vishnu Ks	443439d388	slack_import: Support importing private slack channels.	2019-06-28 11:03:32 -07:00
Vishnu Ks	196388cee3	slack_import: Extract processing channels into a seperate function.	2019-06-28 11:00:59 -07:00
Vishnu Ks	55bf44152a	import: Handle hidden_by_limit case for files in slack import. Fixes #12011	2019-05-30 12:01:09 -07:00
Anders Kaseorg	643bd18b9f	lint: Fix code that evaded our lint checks for string % non-tuple. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-04-23 15:21:37 -07:00
Ben Muschol	d526ff00f2	settings: Rename "user avatar" to "profile picture" This renames references to user avatars, bot avatars, or organization icons to profile pictures. The string in the UI are updated, in addition to the help files, comments, and documentation. Actual variable/function names, changelog entries, routes, and s3 buckets are left as-is in order to avoid introducing bugs. Fixes #11824.	2019-03-15 13:29:56 -07:00
Tim Abbott	412d35900f	slack import: Fix handling of tombstone files. Apparently, the mode attribute is not always present.	2019-03-13 14:39:20 -07:00
Tim Abbott	49680a4503	slack import: Skip processing tombstone files. The tombstone files undocumented feature of Slack's export format appears sometimes and has no real data, so we just need to skip these. Fixes #11619.	2019-03-13 12:43:11 -07:00
Rishi Gupta	e183c316dd	help: Rename help/change-your-avatar to help/set-your-avatar.	2019-02-13 17:50:39 -08:00
Anders Kaseorg	56a675d5ec	export: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:25:27 -08:00
Pragati Agrawal	e1772b3b8f	tools: Upgrade Pycodestyle and fix new linter errors. Here, we are upgrading pycodestyle version from 2.4.0 to 2.5.0. Fixes: #11396.	2019-01-31 12:21:41 -08:00
Matthew Wegner	370cf1a2cb	import: Normalize Slackbot String Comparison. In very old Slack workspaces, slackbot can appear as "Slackbot", and the import script only checks for "slackbot" (case sensitive). This breaks the import--it throws the assert that immediately follows the test. I don't know how common this is, but it definitely affected our import. The simple fix is to compare against a lowercased-version of the user's full name.	2019-01-28 14:59:41 -08:00
Tim Abbott	8a90441d2f	slack import: Import long-inactive users as long-term idle. This avoids creating UserMessage rows for long-inactive users in organizations with many thousands of users.	2018-12-16 18:52:20 -08:00
Tim Abbott	a6ca95dfc4	slack import: Fix all messages being imported to one channel. This was an ugly variable-escape-from-loop regression introduced in `e59ff6e6db`.	2018-12-12 17:54:37 -08:00
Tim Abbott	d6217eb862	slack import: Fix empty values for custom profile fields. The Slack import process would incorrectly issue CustomProfileFieldValue entries with a value of "" for users who didn't have a given CustomProfileField (especially common for the "skype" and "phone" fields). This had no user-visible effect, but certainly added some clutter in the database.	2018-12-12 12:58:27 -08:00
rht	e59ff6e6db	slack import: Eliminate need to load all messages into memory. This works by yielding messages sorted based on timestamp. Because the Slack exports are broken into files by date, it's convenient to do a 2-layer sorting process, where we open all the files for a given day, and then sort their messages by timestamp before yielding them. Fixes #10930.	2018-12-05 12:20:50 -08:00
Steve Howell	d86dd165da	gitter/slack/hipchat: Remove "subject" from conversions. We (lexically) remove "subject" from the conversion code. The `build_message` helper calls `set_topic_name` under the hood, so things still have "subject" in the JSON. There was good code coverage on `build_message`.	2018-11-12 15:47:11 -08:00
Tim Abbott	8b661f2f03	slack import: Correctly detect the commenting user. Fixes #10772.	2018-11-06 13:14:23 -08:00
Steve Howell	30c493ed24	slack import: Generate message_id/reaction_id with NEXT_ID. This avoids the need to pass tuples of ints around, which is pretty brittle.	2018-10-29 13:24:50 -07:00
Steve Howell	2f58eb1057	slack import: Extract process_message_files(). This is mostly an extraction, but it does change the way we calculate `content`. We append the markdown links from ALL files to any content that came in the message itself. Separating this out also allows us to add more test coverage for the extracted code.	2018-10-29 13:24:50 -07:00
Steve Howell	00f822a26a	conversion: Generate attachment_ids with helpers.	2018-10-29 13:24:50 -07:00
Steve Howell	5cb60f7bea	conversions: Use subscriber_map for Slack/Gitter. We now use subscriber_map for building UserMessage rows in Slack/Gitter conversions. This is mostly designed to simplify the code, rather than having to scan the entire subscribers for each message. I am guessing this will improve performance for most conversions. We sort small lists on every message, in order to be deterministic, but the sorting cost is probably more than offset by avoiding the O(N) scans across all subscriptions. Also, it's probably negligible in the grand scheme of things, compared to JSON parsing, file I/O, etc. This commits also fixes some typos with mentioned_users_id -> mentioned_user_ids and cleans up a test a bit as well.	2018-10-29 13:24:50 -07:00
Steve Howell	5194701787	conversions: Use NEXT_ID for usermessage_id. This is mostly complicated due to the way that the Slack import passes around tuples of ids to maintain four different parallel sequences.	2018-10-29 13:24:50 -07:00
Tim Abbott	f9b6eeb488	import: Migrate from json to ujson for better perf. We expect to get better memory performace from ujson than json. We also do a better job of closing file handles. This likely fixes #10377.	2018-10-17 12:11:08 -07:00
Tim Abbott	78a15dd715	slack import: Fix obscure email address for Slackbot. Since we know what slackbot is, we don't need to give it a crazy hash as its email address.	2018-10-16 16:33:41 -07:00
Steve Howell	8accc60ca7	import_util: Support multiple message ids for attachments.	2018-10-13 16:47:44 -07:00
Steve Howell	23d7b3d2cc	import: De-dup create_converted_data_files helper.	2018-10-13 16:47:41 -07:00
Rhea Parekh	f70b9a3eba	import: Move 'build_message' to import_util.	2018-08-19 22:27:13 -07:00
Rhea Parekh	53e9da8e1f	import: Build CustomProfileField, CustomProfileFieldValue and RealmEmoji with model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	d98a5925cb	import: Build Reaction with the model class.	2018-08-19 22:27:13 -07:00

1 2 3 4

167 Commits