zulip

Commit Graph

Author	SHA1	Message	Date
sahil839	2f7d684a84	slack_import: Map slack owners to zulip realm owners. Slack owners and primary owners will be mapped to zulip realm owners on import. Previously, we mapped the owner and primary owner roles of slack to realm admins in zulip. As we have added ROLE_REALM_OWNER in `8bbc074`, we now map slack owners and primary owners to owners in zulip. Tests are modified for checking all the 3 cases- - Slack workspace primary owner - Slack workspace owner - Slack workspace admin This commit also has docs changes in 'import-from-slack.md'.	2020-06-08 16:22:54 -07:00
Anders Kaseorg	8dd83228e7	python: Convert "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, but with the NamedTuple changes reverted (see commit `ba7906a3c6`, #15132). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-08 15:31:20 -07:00
Tim Abbott	496c08e26c	slack import: Fix DefaultStream import of deactivated #random. If the #random channel in Slack is deactivated, we should follow Zulip's data model of not allowing deactivated, default streams. This had apparently happened in zulipchat.com for a few organizations, resulting in weird exceptions trying to invite new users.	2020-05-12 17:18:57 -07:00
Rohitt Vashishtha	9506be0f4f	slack-import: Downgrade Slack legacy-token check failure to warning. Slack has disabled creation of legacy tokens, which means we have to use other tokens for importing the data. Thus, we shouldn't throw an error if the token doesn't match the legacy token format. Since we do not have any other validation for those tokens yet, we log a warning but still try to continue with the import assuming that the token has the right scopes. See https://api.slack.com/changelog/2020-02-legacy-test-token-creation-to-retire.	2020-05-11 13:41:50 -07:00
Cyril Cohen	0d6f80059b	gitter import: Subscribe every user to every stream.	2020-05-05 21:31:35 -07:00
Cyril Cohen	5598f8f6b0	gitter: Support importing data from multiple Gitter rooms. Features: Improving `./manage.py convert_gitter_data` - If messages have been post-processed to add a 'room' field, we create as many streams as existing rooms. - Messages with a 'room' field go to the corresponding stream. - This modification is backward compatible. I.e. + messages that have no 'room' field go to the default stream/topic + messages that do, go to a specific stream Implementation: - adding a map `stream_map` to map room names to stream ids - create as many streams as room field messages + 1 default streamFeatures: - If messages have been post-processed to add a 'room' field to messages, we create as many streams as existing rooms. - Up to renaming of the default stream/topic, this modification is backwards compatible. I.e. messages that have no 'room' field go to the default stream/topic messages that do, go to a specific stream Implementation: - adding a map stream_map to map room names to stream ids - create as many streams as room field messages + 1 default stream Takes advantage of https://github.com/minrk/archive-gitter/pull/5.	2020-05-02 10:30:18 -07:00
Anders Kaseorg	bdc365d0fe	logging: Pass format arguments to logging. https://docs.python.org/3/howto/logging.html#optimization Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-02 10:18:02 -07:00
Anders Kaseorg	fead14951c	python: Convert assignment type annotations to Python 3.6 style. This commit was split by tabbott; this piece covers the vast majority of files in Zulip, but excludes scripts/, tools/, and puppet/ to help ensure we at least show the right error messages for Xenial systems. We can likely further refine the remaining pieces with some testing. Generated by com2ann, with whitespace fixes and various manual fixes for runtime issues: - invoiced_through: Optional[LicenseLedger] = models.ForeignKey( + invoiced_through: Optional["LicenseLedger"] = models.ForeignKey( -_apns_client: Optional[APNsClient] = None +_apns_client: Optional["APNsClient"] = None - notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - signup_notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + signup_notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - author: Optional[UserProfile] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) + author: Optional["UserProfile"] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) - bot_owner: Optional[UserProfile] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) + bot_owner: Optional["UserProfile"] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) - default_sending_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) - default_events_register_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_sending_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_events_register_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) -descriptors_by_handler_id: Dict[int, ClientDescriptor] = {} +descriptors_by_handler_id: Dict[int, "ClientDescriptor"] = {} -worker_classes: Dict[str, Type[QueueProcessingWorker]] = {} -queues: Dict[str, Dict[str, Type[QueueProcessingWorker]]] = {} +worker_classes: Dict[str, Type["QueueProcessingWorker"]] = {} +queues: Dict[str, Dict[str, Type["QueueProcessingWorker"]]] = {} -AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional[LDAPSearch] = None +AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional["LDAPSearch"] = None Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 11:02:32 -07:00
Anders Kaseorg	1cf63eb5bf	python: Whitespace fixes from autopep8. Generated by autopep8, with the setup.cfg configuration from #14532. I’m not sure why pycodestyle didn’t already flag these. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-21 17:58:09 -07:00
Siddharth Varshney	e03176b272	help: Add doc for setting profile picture back to gravatar.	2020-04-16 20:27:52 -07:00
Anders Kaseorg	c734bbd95d	python: Modernize legacy Python 2 syntax with pyupgrade. Generated by `pyupgrade --py3-plus --keep-percent-format` on all our Python code except `zthumbor` and `zulip-ec2-configure-interfaces`, followed by manual indentation fixes. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-09 16:43:22 -07:00
Stefan Weil	d2fa058cc1	text: Fix some typos (most of them found and fixed by codespell). Signed-off-by: Stefan Weil <sw@weilnetz.de>	2020-03-27 17:25:56 -07:00
Anders Kaseorg	e257253e64	emoji_codes: Replace JS module with JSON module. webpack optimizes JSON modules using JSON.parse("{…}"), which is faster than the normal JavaScript parser. Update the backend to use emoji_codes.json too instead of the three separate JSON files. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-02-12 10:09:12 -08:00
Vishnu KS	df5345705c	import: Support importing team icon from slack.	2020-02-03 14:09:05 -08:00
Tim Abbott	122e11c678	slack import: Fix handling of messages sent by user U00.	2020-01-25 22:47:49 -08:00
Tim Abbott	e052ec58db	slack import: Improve error messages around invalid tokens. This updates our error handling of invalid Slack API tokens (and other networking error handling) to mostly make sense: * A token that doesn't start with `xoxp-` gives an extended error early. * An AssertionError for the codebase is correctly declared as such. * We check for token shape errors before querying the Slack API. We could still do useful work to raise custom exception classes here. Thanks to @stavrospat for raising this issue.	2020-01-22 14:48:32 -08:00
Tlazypanda	6945ced76f	slack import: Map Slack guest users to Zulip guests. Slack's Single-User Guest and Multi-User Guest users should be imported as Zulip guests during data import. Fixes #13255.	2019-11-12 12:12:59 -08:00
Tim Abbott	aad99ce951	mattermost import: Fix handling of channels with no subscribers. Previously, we skipped setting the list of subscribers to the channel, which could result in problems if any messages had been posted there in the past (e.g. because the channel used to have members, but now doesn't). It could be correct to skip importing dead channels altogether, but probably simpler is to just set an empty subscriber list.	2019-11-04 18:10:37 -08:00
Tim Abbott	dc682da47a	mattermost: Handle replies to private messages. Previously, our logic to handle Mattermost's "replies" feature didn't copy the right fields for private messages, where `channel_members` is included on the message body rather than a `channel` name.	2019-11-04 18:10:37 -08:00
Vishnu KS	1585ad7bf4	mattermost: Add support for exporting DMs and huddles.	2019-10-10 16:37:03 -07:00
Rishi Gupta	e10361a832	models: Replace is_guest and is_realm_admin with UserProfile.role. This new data model will be more extensible for future work on features like a primary administrator.	2019-10-06 16:24:37 -07:00
Mateusz Mandera	dbe508bb91	models: Migration of Message.pub_date to date_sent, part 2. Fixes #1727. With the server down, apply migrations 0245 and 0246. 0246 will remove the pub_date column, so it's essential that the previous migrations ran correctly to copy data before running this.	2019-10-05 19:01:34 -07:00
Vishnu KS	a21856c569	mattermost: Rename user_id to sender_user_id in process_raw_message_batch.	2019-09-25 20:06:47 +05:30
Vishnu KS	23d70bb685	mattermost: Rename get_recipient_id to get_recipient_id_from_receiver_name.	2019-09-25 20:06:04 +05:30
Vishnu Ks	c4af0b7bc4	mattermost: Support importing messages without team name. Mattermost doesn't place private messages within a particular team, which is what this is needed for.	2019-09-18 11:57:37 -07:00
Vishnu Ks	bf5f531e90	import_util: Support huddles in SubscriberHandler.	2019-09-18 11:53:13 -07:00
Vishnu KS	d434c0ee88	slack: Remove unnecessary comments. Remove comments that tries to explain code that is already readable. Also remove some todo comments that has been already taken care of.	2019-08-26 14:10:19 -07:00
Vishnu KS	99d34fd11d	slack: Rename default_channels to slack_default_channels.	2019-08-26 14:10:19 -07:00
Vishnu KS	b919514f7f	slack: Rename customprofilefield_id to custom_profile_field_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	c31355f9c1	slack: Rename custom_field_id_count to custom_profile_field_value_id_count.	2019-08-26 14:10:19 -07:00
Vishnu KS	138c659c97	slack: Rename slack_custom_field_name_to_zulip_custom_field_id. Rename custom_field_map to slack_custom_field_name_to_zulip_custom_field_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	9560736d86	slack: Rename slack_user_id_to_custom_profile_fields. Renames slack_user_custom_field_map to slack_user_id_to_custom_profile_fields for readability.	2019-08-26 14:10:19 -07:00
Vishnu KS	01a51c8f4e	slack: Rename added_recipient to slack_recipient_name_to_zulip_recipient_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	9d51a1b527	slack: Rename added_users to slack_user_id_to_zulip_user_id.	2019-08-26 14:10:19 -07:00
Vishnu KS	3650f19692	slack: Lookup dir_name key in dict instead of in dict_keys. No reason to do the lookup in O(n) when we can do it in average O(1) time complexity.	2019-08-26 14:10:19 -07:00
Vishnu Ks	1e5c49ad82	slack: Support importing shared channels.	2019-08-26 14:10:19 -07:00
Vishnu Ks	e09a29f4d3	slack: Refactor get_slack_api_data to accept multiple query params.	2019-08-26 14:10:19 -07:00
Tim Abbott	69505c30a4	mattermost: Handle users who aren't on any team correctly. It's not clear to me how this is intended to work in Mattermost's system in that they don't document this behavior, but some users have `null` as their list of teams, and presumably are not meant to be included in any team at all.	2019-08-19 16:06:39 -07:00
Anders Kaseorg	e417d3a040	slack_message_conversion: Clean up type ignores. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-09 16:39:16 -07:00
Tim Abbott	9827801569	slack import: Improve readability of user recipient object allocation. This loop management tweak makes it a bit more obvious what's happening in this block of code.	2019-07-30 14:46:14 -07:00
Vishnu KS	ff3871fc63	slack_import: Clean up return values of channels_to_zerver_stream. This commits reduces the number of values returned by channel_to_zerver_stream function by setting the values directly in realm dict and returning it instead.	2019-07-30 14:46:14 -07:00
Vishnu Ks	6110f495df	slack_import: Support importing pms.	2019-07-30 14:46:14 -07:00
Vishnu Ks	5e6d86c8c4	slack_import: Support importing multiparty IMs.	2019-07-09 15:03:28 -07:00
Vishnu Ks	443439d388	slack_import: Support importing private slack channels.	2019-06-28 11:03:32 -07:00
Vishnu Ks	196388cee3	slack_import: Extract processing channels into a seperate function.	2019-06-28 11:00:59 -07:00
Vishnu Ks	55bf44152a	import: Handle hidden_by_limit case for files in slack import. Fixes #12011	2019-05-30 12:01:09 -07:00
Anders Kaseorg	643bd18b9f	lint: Fix code that evaded our lint checks for string % non-tuple. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-04-23 15:21:37 -07:00
Vishnu Ks	02c92e55a2	import: Add tool for importing teams from mattermost.	2019-04-05 17:53:03 -07:00
Vishnu Ks	f517f72dd2	import: Make use of is_mirror_dummy in build_user_profile.	2019-04-04 13:51:52 -07:00
Vishnu Ks	bd4c3b3ebb	import: Move make_user_messages to import_util.py.	2019-04-04 13:51:52 -07:00
Vishnu Ks	d921fd25e4	import: Move SubscriberHandler to import_util.	2019-03-20 11:29:51 -07:00
Vishnu Ks	6e3720e0b7	gitter: Fix minor comment typo in build_userprofile.	2019-03-20 10:12:18 -07:00
Ben Muschol	d526ff00f2	settings: Rename "user avatar" to "profile picture" This renames references to user avatars, bot avatars, or organization icons to profile pictures. The string in the UI are updated, in addition to the help files, comments, and documentation. Actual variable/function names, changelog entries, routes, and s3 buckets are left as-is in order to avoid introducing bugs. Fixes #11824.	2019-03-15 13:29:56 -07:00
Tim Abbott	412d35900f	slack import: Fix handling of tombstone files. Apparently, the mode attribute is not always present.	2019-03-13 14:39:20 -07:00
Tim Abbott	49680a4503	slack import: Skip processing tombstone files. The tombstone files undocumented feature of Slack's export format appears sometimes and has no real data, so we just need to skip these. Fixes #11619.	2019-03-13 12:43:11 -07:00
Tim Abbott	cbc62b8e07	streams: Prevent creation of multi-line stream descriptions. We do not anticipate our UI for showing stream descriptions looking reasonable for multi-line descriptions, so we should just ban creating them. Given the frontend changes, multi-line descriptions are only likely to show up from importing content from other tools, in which case replacing newlines with spaces is cleaner than the alternative.	2019-02-20 12:28:00 -08:00
Rishi Gupta	e183c316dd	help: Rename help/change-your-avatar to help/set-your-avatar.	2019-02-13 17:50:39 -08:00
Anders Kaseorg	56a675d5ec	export: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:25:27 -08:00
Hemanth V. Alluri	73d26c8b28	streams: Render and store the stream description from the backend. This commit does the following three things: 1. Update stream model to accomodate rendered description. 2. Render and save the stream rendered description on update. 3. Render and save stream descriptions on creation. Further, the stream's rendered description is also sent whenever the stream's description is being sent. This is preparatory work for eliminating the use of the non-authoritative marked.js markdown parser for stream descriptions.	2019-02-01 22:24:18 -08:00
Tim Abbott	9b25f8789f	hipchat: Fix handling of user IDs in Stride import. We've had this code oscillated a few times; the original comparison was added as part of Stride import but broke HipChat import. `c34a8f2e69` fixed HipChat import but regressed Stride. This change fixes this for both HipChat + Stride.	2019-01-31 12:40:05 -08:00
Pragati Agrawal	e1772b3b8f	tools: Upgrade Pycodestyle and fix new linter errors. Here, we are upgrading pycodestyle version from 2.4.0 to 2.5.0. Fixes: #11396.	2019-01-31 12:21:41 -08:00
Matthew Wegner	370cf1a2cb	import: Normalize Slackbot String Comparison. In very old Slack workspaces, slackbot can appear as "Slackbot", and the import script only checks for "slackbot" (case sensitive). This breaks the import--it throws the assert that immediately follows the test. I don't know how common this is, but it definitely affected our import. The simple fix is to compare against a lowercased-version of the user's full name.	2019-01-28 14:59:41 -08:00
Tim Abbott	4c603990d2	hipchat: Use HTML2Text for the content. While the result is by no means perfect, it's significantly cleaner than what we had before this.	2019-01-09 16:59:45 -08:00
Tim Abbott	53436766c1	hipchat: Improve import of public room subscribers. Now, if you pass an api_key, we'll initialize the public room subscribers to be whatever they were at the time the import happened. Also, document the situation on the caveats section.	2019-01-09 16:50:00 -08:00
Tim Abbott	035138dd98	hipchat: Refactor code for building subscriptions. This moves the filtering of invite-only into the caller, and also adjusts the indentation.	2019-01-09 16:50:00 -08:00
Tim Abbott	c34a8f2e69	hipchat: Fix importing of private messages. Apparently a stupid typing issue meant that we broke this a few weeks ago.	2019-01-09 16:50:00 -08:00
Tim Abbott	b7fb919dfa	hipchat: Don't enable slim_mode by default. For small organizations, generally one prefers the non-slim_mode default behavior.	2019-01-04 11:30:25 -08:00
Tim Abbott	492230f405	hipchat import: Always include deleted users in import. The slim_mode setting had been incorrectly configured to skip "deleted" users, resulting in bugs where private messages with deleted users would not be imported.	2019-01-04 11:23:02 -08:00
Tim Abbott	41b7d9a4c8	hipchat: Handle unusual emoticons.json format. Apparently, hc-migrate can generate emoticons.json files with a somewhat different format. Assuming that other files are in the normal format, we should be able to handle it like this. See report in #11135.	2018-12-29 18:59:31 -08:00
Tim Abbott	44f117ac72	hipchat: Handle case where emoticons.json is not in export. Apparently, some methods of exporting from HipChat do not include an emoticons.json file. We could test for this using the `include_emoticons` field in `metadata.json`, but we currently don't even bother to read that file. Rather than changing that, we just print a warning and proceed. This is arguably better anyway, in that often not having emoticons.json is the result of user error when exporting, and it's nice to flag that this is happening. Fixes #11135.	2018-12-29 18:54:50 -08:00
Tim Abbott	c995e8e2ae	import: Ensure presence of basic avatar images for HipChat. Our HipChat conversion tool didn't properly handle basic avatar images, resulting in only the medium-size avatar images being imported properly. This fixes that bug by asking the import tool to do the thumbnailing for the basic avatar image (from the .original file) as well as the medium avatar image.	2018-12-27 17:47:09 -08:00
Tim Abbott	8a90441d2f	slack import: Import long-inactive users as long-term idle. This avoids creating UserMessage rows for long-inactive users in organizations with many thousands of users.	2018-12-16 18:52:20 -08:00
Tim Abbott	a6ca95dfc4	slack import: Fix all messages being imported to one channel. This was an ugly variable-escape-from-loop regression introduced in `e59ff6e6db`.	2018-12-12 17:54:37 -08:00
Tim Abbott	d6217eb862	slack import: Fix empty values for custom profile fields. The Slack import process would incorrectly issue CustomProfileFieldValue entries with a value of "" for users who didn't have a given CustomProfileField (especially common for the "skype" and "phone" fields). This had no user-visible effect, but certainly added some clutter in the database.	2018-12-12 12:58:27 -08:00
Tim Abbott	e9900b2bdf	gitter: Do something reasonable with invalid fullnames.	2018-12-12 10:07:52 -08:00
rht	e59ff6e6db	slack import: Eliminate need to load all messages into memory. This works by yielding messages sorted based on timestamp. Because the Slack exports are broken into files by date, it's convenient to do a 2-layer sorting process, where we open all the files for a given day, and then sort their messages by timestamp before yielding them. Fixes #10930.	2018-12-05 12:20:50 -08:00
Tim Abbott	48a3975ec0	import: Avoid unnecessary forks when downloading attachments. The previous implementation used run_parallel incorrectly, passing it a set of very small jobs (each was to download a single file), which meant that we'd end up forking once for every file to download. This correct implementation sends each of N threads 1/N of the files to download, which is more consistent with the goal of distributing the download work between N threads.	2018-12-02 13:50:27 -08:00
Tim Abbott	00826486bd	hipchat: Fix typo in logging output.	2018-11-26 16:44:31 -08:00
Steve Howell	38f81d5d20	hipchat: Skip public stream subs in slim mode.	2018-11-26 16:37:30 -08:00
Steve Howell	c2e9f5eb0a	hipchat: Limit messages in slim mode. For messages with strange senders, we don't import messages. Basically, we only import a message if it has sender with an id that maps to a non-deleted user.	2018-11-26 16:37:30 -08:00
Steve Howell	3a7788217e	hipchat: Skip really long messages.	2018-11-26 16:37:30 -08:00
Steve Howell	e57a932692	hipchat: Fix avatars. This code was not reading any avatars because it was not referencing 'User' to get to the avatar, and it was not re-mapping user ids for some reason.	2018-11-26 16:37:30 -08:00
Steve Howell	ad35e371fe	hipchat: Support slim_mode flag. We now skip deleted users. There is a flag here that's hard coded to True--we may decide later to make this a command line option.	2018-11-26 16:37:30 -08:00
Steve Howell	bd1e96cf63	hipchat: Rework stream/subscriber logic. We now account for streams having users that may be deleted. We do a couple things: - use a loop instead of map - only pass in users to hipchat_subscriber - early-exit if there are not users - skip owner/members logic for public streams	2018-11-26 16:37:30 -08:00
Steve Howell	1335dfd295	hipchat: Handle messages with missing recipients. If a message is for a stream or user that we didn't load, then we just skip it.	2018-11-26 16:37:30 -08:00
Steve Howell	ff68757358	hipchat: Just skip over missing attachments. It seems like we get a lot of exports with bad attachment data, and some folks don't necessarily care, so we just skip for now.	2018-11-26 16:37:30 -08:00
Steve Howell	ea26372083	hipchat: Make conversion work with UUID ids from Stride. Normal hipchat exports use integer ids for their users and "rooms," which we just borrowed during conversion. Atlassian Stride uses stride UUIDs for these instead, but otherwise has the same export format. We now introduce IdMapper to handle external ids that aren't integer. The IdMapper will map UUID ids to ints and remember them. For ints it just leaves them alone. Fixes #10805.	2018-11-14 23:22:40 -08:00
Steve Howell	aff84cd1e9	hipchat: Skip attachments without paths. This is a short term workaround. Some variants of HipChat exports are missing `path`, and we just punt for now.	2018-11-14 23:14:13 -08:00
Steve Howell	d86dd165da	gitter/slack/hipchat: Remove "subject" from conversions. We (lexically) remove "subject" from the conversion code. The `build_message` helper calls `set_topic_name` under the hood, so things still have "subject" in the JSON. There was good code coverage on `build_message`.	2018-11-12 15:47:11 -08:00
Tim Abbott	e88998e6d4	import: Fix buggy handling of avatars in Slack conversion. This was a pretty nasty error, where we were accidentally accessing the parent list in this inner loop function. This appears to have been introduced as a refactoring bug in `7822ef38c2`.	2018-11-08 15:03:39 -08:00
Tim Abbott	8b661f2f03	slack import: Correctly detect the commenting user. Fixes #10772.	2018-11-06 13:14:23 -08:00
Tim Abbott	81a4c846f4	hipchat: Set s3_path for exported emoji. This fixes an issue where the import process would fail when importing to a server using the S3 backend.	2018-11-06 13:02:04 -08:00
Tim Abbott	539e84e9a1	hipchat import: Stop setting last_modified=None. The last_modified field is intended to support setting the orig-last-modified field in the S3 backend when importing, basically to keep track of this bit of pre-export data for debugging. In the event that it isn't available, the correct thing to do is not write out an invalid `last_modified` field; we should just not write it out at all.	2018-11-06 12:50:36 -08:00
Tim Abbott	d54af3cb5b	hipchat import: Handle deactivated users without an email address. We saw this in a recent HipChat import data set.	2018-11-01 10:09:19 -07:00
Steve Howell	30c493ed24	slack import: Generate message_id/reaction_id with NEXT_ID. This avoids the need to pass tuples of ints around, which is pretty brittle.	2018-10-29 13:24:50 -07:00
Steve Howell	2f58eb1057	slack import: Extract process_message_files(). This is mostly an extraction, but it does change the way we calculate `content`. We append the markdown links from ALL files to any content that came in the message itself. Separating this out also allows us to add more test coverage for the extracted code.	2018-10-29 13:24:50 -07:00
Steve Howell	00f822a26a	conversion: Generate attachment_ids with helpers.	2018-10-29 13:24:50 -07:00
Steve Howell	5cb60f7bea	conversions: Use subscriber_map for Slack/Gitter. We now use subscriber_map for building UserMessage rows in Slack/Gitter conversions. This is mostly designed to simplify the code, rather than having to scan the entire subscribers for each message. I am guessing this will improve performance for most conversions. We sort small lists on every message, in order to be deterministic, but the sorting cost is probably more than offset by avoiding the O(N) scans across all subscriptions. Also, it's probably negligible in the grand scheme of things, compared to JSON parsing, file I/O, etc. This commits also fixes some typos with mentioned_users_id -> mentioned_user_ids and cleans up a test a bit as well.	2018-10-29 13:24:50 -07:00
Steve Howell	adb458a5df	refactor: Use build_user_message for Slack/Gitter. We now have all three third party conversions (Gitter/Slack/Hipchat) go through build_user_message(). Hipchat was already using this helper. We also avoid callers having to pass in an id to build_user_message().	2018-10-29 13:24:50 -07:00
Steve Howell	5194701787	conversions: Use NEXT_ID for usermessage_id. This is mostly complicated due to the way that the Slack import passes around tuples of ids to maintain four different parallel sequences.	2018-10-29 13:24:50 -07:00
Steve Howell	9145cd16cf	minor: Change topic for imported hipchat messages.	2018-10-25 14:16:11 -05:00
Steve Howell	78f6e3ac7d	hipchat import: Fix data issues with PMs. We now set the is_private flag on UserMessage rows for PMs and set their subject to ''.	2018-10-25 09:11:36 -05:00
Steve Howell	272b954790	hipchat import: Add option to mask content. Masking content can be useful for testing out conversions where you're dealing with data from customers and want to avoid inadvertently reading their content (while still having semi-realistic messages).	2018-10-25 08:31:01 -05:00
Steve Howell	6e8ae2e3fd	hipchat import: Support private stream subscribers. We now create private stream subscriptions that are based off of `members` and `owner` from room data in `rooms.json`.	2018-10-25 08:31:01 -05:00
Steve Howell	25f532ca2f	refactor: Break up build_subscriptions. Having two smaller functions should make it easier to customize the behavior for each specific use case. The only reason they were ever coupled was to keep ids in sequence, but the recent NEXT_ID changes make that a non-issue now.	2018-10-25 08:31:01 -05:00
Steve Howell	2ed9fbd25b	conversions: Use NEXT_ID for recipient and subscription ids. The NEXT_ID scheme seems pretty robust, so I'm fixing a few easy places.	2018-10-25 08:31:01 -05:00
Steve Howell	50f76e58ce	conversions: Make NEXT_ID a true singleton. We now instantiate NEXT_ID in sequencer.py, which avoids having multiple modules make multiple copies of a sequencer and possibly causing id collisions.	2018-10-25 08:31:01 -05:00
Steve Howell	fe6df1c222	hipchat import: Fix bug w/rogue UserMessage records. This bug was introduced very recently and is an aliasing bug. It caused extra UserMessage rows to be created as we inadvertently updated the underlying subscriber_map sets for multiple messages. This probably mostly affected PMs. It's doubtful the bug ever got out into the field.	2018-10-24 18:44:18 -05:00
Steve Howell	409e2b4134	hipchat import: Support sender_id == 0 use case.	2018-10-23 17:27:37 -05:00
Steve Howell	876a72c467	hipchat import: Extract get_hipchat_sender_id().	2018-10-23 17:27:37 -05:00
Steve Howell	481488a35e	Extract make_subscriber_map(). We extract this function and put it in the shared library `import_util.py`. Also, we make it one time higher up in the call stack, rather than re-building it for every batch of messages. I doubt this was super expensive, but there's no reason to repeatedly execute this.	2018-10-23 17:27:37 -05:00
Steve Howell	737e02a2e6	hipchat import: Fix PM messages. Before this fix, we were creating two copies of every PM Message in zerver_message with only corresponding UserMessage row. Now we only create one PM Message per message, which we accomplish by making sure we only use imported messages from the sender's history.json file. And then we write UserMessage rows for both participants by making sure to include sender_id in the set of user_ids that feeds into making UserMessage. For the case where you PM yourself, there's just one UserMessage row. It does not appear that we need to support huddles yet.	2018-10-23 17:27:37 -05:00
Steve Howell	bd9e4ef0c8	import: Use pub_date to sort message ids. When we create new ids for message rows, we now sort the new ids by their corresponding pub_date values in the rows. This takes a sizable chunk of memory. This feature only gets turned on if you set sort_by_date to True in realm.json.	2018-10-23 17:27:37 -05:00
Steve Howell	d1ff903534	refactor: Rename build_user -> build_user_profile. This makes greps less confusing.	2018-10-23 17:27:37 -05:00
Steve Howell	ff61c56f47	hipchat import: Add NotificationMessage support.	2018-10-17 12:11:08 -07:00
Tim Abbott	f9b6eeb488	import: Migrate from json to ujson for better perf. We expect to get better memory performace from ujson than json. We also do a better job of closing file handles. This likely fixes #10377.	2018-10-17 12:11:08 -07:00
Tim Abbott	78a15dd715	slack import: Fix obscure email address for Slackbot. Since we know what slackbot is, we don't need to give it a crazy hash as its email address.	2018-10-16 16:33:41 -07:00
Steve Howell	b1dd9a251b	hipchat import: Break messages into smaller batches. Even individual "room" files from hipchat can be large, so we process only 1000 messages at a time within each file, which produces smaller JSON files.	2018-10-15 10:54:23 -07:00
Steve Howell	6650bb2240	minor: Move fix_mentions() closer to caller.	2018-10-15 10:54:23 -07:00
Steve Howell	219ff0f749	hipchat import: Extract UserHandler class.	2018-10-15 10:54:23 -07:00
Steve Howell	2d523fd668	hipchat import: Extract make_user_messages().	2018-10-15 10:54:23 -07:00
Steve Howell	ca0495cbe6	hipchat import: Support attachments.	2018-10-15 10:54:23 -07:00
Steve Howell	d71f3eb1bf	hipchat import: Add some more logging.	2018-10-14 09:29:04 -07:00
Steve Howell	d933779477	hipchat import: Support PrivateUserMessage data. We now import PM data from HipChat.	2018-10-13 16:47:44 -07:00
Steve Howell	f0c3ee0a2e	hipchat import: Write smaller message files. We now write new message files for each new input file + message type we process. This helps the importer not run out of memory later.	2018-10-13 16:47:44 -07:00
Steve Howell	75fc5d41c9	hipchat import: Refactor write_message_data. The goal here is to make it easier to handle other message types by moving the key-specific stuff to the top of the file.	2018-10-13 16:47:44 -07:00
Steve Howell	cc55eb8154	hipchat import: Only process UserMessage rows for now.	2018-10-13 16:47:44 -07:00
Steve Howell	3baac7ddf3	hipchat import: Handle missing emails for guest users.	2018-10-13 16:47:44 -07:00
Steve Howell	8accc60ca7	import_util: Support multiple message ids for attachments.	2018-10-13 16:47:44 -07:00
Steve Howell	23d7b3d2cc	import: De-dup create_converted_data_files helper.	2018-10-13 16:47:41 -07:00
Steve Howell	91905bd66a	import: Add sequencer library. This avoids some tedious code related to making ids in conversion programs.	2018-10-13 16:47:39 -07:00
Steve Howell	9f2aad55b5	hipchat import: Handle users without avatars.	2018-10-12 07:03:25 -04:00
Steve Howell	4b82326376	hipchat import: Support guest users. We simplify the code for is_realm_admin and set is_guest as well. I verified that build_user() is not used by Slack/Gitter, so the extra argument there should be fine. Fixes #10639	2018-10-11 15:28:58 -07:00
Steve Howell	4da664817b	hipchat conversion: Add messages.	2018-10-02 16:55:16 -07:00
Steve Howell	f296d60dad	hipchat conversion: Add emoji support.	2018-10-02 16:55:16 -07:00
Steve Howell	9518b1344a	hipchat conversion: Process avatars. This processes the avatar payloads that we get in users.json.	2018-10-02 16:55:16 -07:00
Steve Howell	c0f15c3860	hipchat conversion: Include deactivated users/streams. We now include deleted/deactivated data from the old system.	2018-10-02 16:55:16 -07:00
Steve Howell	faea26783b	Create convert_hipchat_data. This is a very early version of a tool to convert Hipchat tar files into data files that can be used by the Zulip import process. We include the most fundamental entities--users and streams. Customers who don't care about past messages or customizations could start an instance off of this and start communicating. Of course, there are a lot of things missing in the initial version: * messages! * file assets -- avatars, emojis, attachments * probably lots of other minor things We currently ignore any incoming dates from Hipchat data and just use the current time. This is consistent with other imports. We also don't have any docs yet, although the process will be extremely similar to the "Slack" process: https://zulipchat.com/help/import-from-slack Also, there's a comment at the top of convert_hipchat_data.py that describes how to test this in dev mode. I tested this by following the steps in the comment above. The users just "show up" in /devlogin, so that's nice, and you can send messages to other users. To verify the stream data you have to go into the gear menu and click on "All Streams", then you can subscribe and send a message. Production users will need to get new passwords and re-subscribe to streams. We will probably auto-subscribe all users to public streams.	2018-10-02 16:55:16 -07:00
Rhea Parekh	7822ef38c2	import: Change absolute path of downloaded avatars in records.json to relative path.	2018-09-09 09:18:18 -04:00
Rhea Parekh	f70b9a3eba	import: Move 'build_message' to import_util.	2018-08-19 22:27:13 -07:00
Rhea Parekh	53e9da8e1f	import: Build CustomProfileField, CustomProfileFieldValue and RealmEmoji with model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	d98a5925cb	import: Build Reaction with the model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	a5bc701181	import: Move 'build_stream' to import_util.	2018-08-19 22:27:13 -07:00
Rhea Parekh	c4f8abbd30	import: Build Message with the model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	4ea7302e14	import: Add missing fields in UserProfile object. The missing fields are checked by `full_clean()` method. The datetime field errors are ignored as they are fixed in the `import_realm` script. The field that are allowed to be null are not included while building this object.	2018-08-19 22:27:13 -07:00
Rhea Parekh	66d34b23ef	import: Build Attachment with the model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	9617b1fbc5	import: Build Recipient and Subscription with model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	c77763bd8e	import: Move 'build_realm' to import_util.	2018-08-19 22:27:13 -07:00
Tim Abbott	8a22838acf	slack import: Fix computation of owner email for uploaded files. The previous code was just always returning the first user in the organization, due to an incorrect comparison.	2018-08-10 16:20:36 -07:00
Rhea Parekh	3ff339c294	slack import: Add support for uploads in messages through 'files' keyword. It appears that Slack just changed their export format, and how uses this `files` list for user-uploaded files.	2018-08-10 16:20:36 -07:00

1 2 3 4 5 ...

262 Commits