zulip

Commit Graph

Author	SHA1	Message	Date
Steve Howell	876a72c467	hipchat import: Extract get_hipchat_sender_id().	2018-10-23 17:27:37 -05:00
Steve Howell	481488a35e	Extract make_subscriber_map(). We extract this function and put it in the shared library `import_util.py`. Also, we make it one time higher up in the call stack, rather than re-building it for every batch of messages. I doubt this was super expensive, but there's no reason to repeatedly execute this.	2018-10-23 17:27:37 -05:00
Steve Howell	737e02a2e6	hipchat import: Fix PM messages. Before this fix, we were creating two copies of every PM Message in zerver_message with only corresponding UserMessage row. Now we only create one PM Message per message, which we accomplish by making sure we only use imported messages from the sender's history.json file. And then we write UserMessage rows for both participants by making sure to include sender_id in the set of user_ids that feeds into making UserMessage. For the case where you PM yourself, there's just one UserMessage row. It does not appear that we need to support huddles yet.	2018-10-23 17:27:37 -05:00
Steve Howell	bd9e4ef0c8	import: Use pub_date to sort message ids. When we create new ids for message rows, we now sort the new ids by their corresponding pub_date values in the rows. This takes a sizable chunk of memory. This feature only gets turned on if you set sort_by_date to True in realm.json.	2018-10-23 17:27:37 -05:00
Steve Howell	d1ff903534	refactor: Rename build_user -> build_user_profile. This makes greps less confusing.	2018-10-23 17:27:37 -05:00
Steve Howell	ff61c56f47	hipchat import: Add NotificationMessage support.	2018-10-17 12:11:08 -07:00
Tim Abbott	f9b6eeb488	import: Migrate from json to ujson for better perf. We expect to get better memory performace from ujson than json. We also do a better job of closing file handles. This likely fixes #10377.	2018-10-17 12:11:08 -07:00
Tim Abbott	78a15dd715	slack import: Fix obscure email address for Slackbot. Since we know what slackbot is, we don't need to give it a crazy hash as its email address.	2018-10-16 16:33:41 -07:00
Steve Howell	b1dd9a251b	hipchat import: Break messages into smaller batches. Even individual "room" files from hipchat can be large, so we process only 1000 messages at a time within each file, which produces smaller JSON files.	2018-10-15 10:54:23 -07:00
Steve Howell	6650bb2240	minor: Move fix_mentions() closer to caller.	2018-10-15 10:54:23 -07:00
Steve Howell	219ff0f749	hipchat import: Extract UserHandler class.	2018-10-15 10:54:23 -07:00
Steve Howell	2d523fd668	hipchat import: Extract make_user_messages().	2018-10-15 10:54:23 -07:00
Steve Howell	ca0495cbe6	hipchat import: Support attachments.	2018-10-15 10:54:23 -07:00
Steve Howell	d71f3eb1bf	hipchat import: Add some more logging.	2018-10-14 09:29:04 -07:00
Steve Howell	d933779477	hipchat import: Support PrivateUserMessage data. We now import PM data from HipChat.	2018-10-13 16:47:44 -07:00
Steve Howell	f0c3ee0a2e	hipchat import: Write smaller message files. We now write new message files for each new input file + message type we process. This helps the importer not run out of memory later.	2018-10-13 16:47:44 -07:00
Steve Howell	75fc5d41c9	hipchat import: Refactor write_message_data. The goal here is to make it easier to handle other message types by moving the key-specific stuff to the top of the file.	2018-10-13 16:47:44 -07:00
Steve Howell	cc55eb8154	hipchat import: Only process UserMessage rows for now.	2018-10-13 16:47:44 -07:00
Steve Howell	3baac7ddf3	hipchat import: Handle missing emails for guest users.	2018-10-13 16:47:44 -07:00
Steve Howell	8accc60ca7	import_util: Support multiple message ids for attachments.	2018-10-13 16:47:44 -07:00
Steve Howell	23d7b3d2cc	import: De-dup create_converted_data_files helper.	2018-10-13 16:47:41 -07:00
Steve Howell	91905bd66a	import: Add sequencer library. This avoids some tedious code related to making ids in conversion programs.	2018-10-13 16:47:39 -07:00
Steve Howell	9f2aad55b5	hipchat import: Handle users without avatars.	2018-10-12 07:03:25 -04:00
Steve Howell	4b82326376	hipchat import: Support guest users. We simplify the code for is_realm_admin and set is_guest as well. I verified that build_user() is not used by Slack/Gitter, so the extra argument there should be fine. Fixes #10639	2018-10-11 15:28:58 -07:00
Steve Howell	4da664817b	hipchat conversion: Add messages.	2018-10-02 16:55:16 -07:00
Steve Howell	f296d60dad	hipchat conversion: Add emoji support.	2018-10-02 16:55:16 -07:00
Steve Howell	9518b1344a	hipchat conversion: Process avatars. This processes the avatar payloads that we get in users.json.	2018-10-02 16:55:16 -07:00
Steve Howell	c0f15c3860	hipchat conversion: Include deactivated users/streams. We now include deleted/deactivated data from the old system.	2018-10-02 16:55:16 -07:00
Steve Howell	faea26783b	Create convert_hipchat_data. This is a very early version of a tool to convert Hipchat tar files into data files that can be used by the Zulip import process. We include the most fundamental entities--users and streams. Customers who don't care about past messages or customizations could start an instance off of this and start communicating. Of course, there are a lot of things missing in the initial version: * messages! * file assets -- avatars, emojis, attachments * probably lots of other minor things We currently ignore any incoming dates from Hipchat data and just use the current time. This is consistent with other imports. We also don't have any docs yet, although the process will be extremely similar to the "Slack" process: https://zulipchat.com/help/import-from-slack Also, there's a comment at the top of convert_hipchat_data.py that describes how to test this in dev mode. I tested this by following the steps in the comment above. The users just "show up" in /devlogin, so that's nice, and you can send messages to other users. To verify the stream data you have to go into the gear menu and click on "All Streams", then you can subscribe and send a message. Production users will need to get new passwords and re-subscribe to streams. We will probably auto-subscribe all users to public streams.	2018-10-02 16:55:16 -07:00
Rhea Parekh	7822ef38c2	import: Change absolute path of downloaded avatars in records.json to relative path.	2018-09-09 09:18:18 -04:00
Rhea Parekh	f70b9a3eba	import: Move 'build_message' to import_util.	2018-08-19 22:27:13 -07:00
Rhea Parekh	53e9da8e1f	import: Build CustomProfileField, CustomProfileFieldValue and RealmEmoji with model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	d98a5925cb	import: Build Reaction with the model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	a5bc701181	import: Move 'build_stream' to import_util.	2018-08-19 22:27:13 -07:00
Rhea Parekh	c4f8abbd30	import: Build Message with the model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	4ea7302e14	import: Add missing fields in UserProfile object. The missing fields are checked by `full_clean()` method. The datetime field errors are ignored as they are fixed in the `import_realm` script. The field that are allowed to be null are not included while building this object.	2018-08-19 22:27:13 -07:00
Rhea Parekh	66d34b23ef	import: Build Attachment with the model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	9617b1fbc5	import: Build Recipient and Subscription with model class.	2018-08-19 22:27:13 -07:00
Rhea Parekh	c77763bd8e	import: Move 'build_realm' to import_util.	2018-08-19 22:27:13 -07:00
Tim Abbott	8a22838acf	slack import: Fix computation of owner email for uploaded files. The previous code was just always returning the first user in the organization, due to an incorrect comparison.	2018-08-10 16:20:36 -07:00
Rhea Parekh	3ff339c294	slack import: Add support for uploads in messages through 'files' keyword. It appears that Slack just changed their export format, and how uses this `files` list for user-uploaded files.	2018-08-10 16:20:36 -07:00
Rhea Parekh	20bca1409f	import: Set emoji records 'last_modified' value in 'import_uploads_s3'. The 'last_modified' value in emoji records is needed for uploading the file to the S3 backend. We set the same in the function 'import_uploads_s3'. We also have to remove the keyword 'last_modified' while building the RealmEmoji dict, as it is not a field which exists in RealmEmoji objects.	2018-08-10 16:20:36 -07:00
Tim Abbott	cf8a0ae819	slack import: Set a last_modified timestamp for custom emoji.	2018-08-10 09:27:43 -07:00
Rhea Parekh	18a4904437	import: Move 'build_attachment' to import_util.	2018-08-07 16:45:42 -07:00
Rhea Parekh	b6ccc0bc52	import: Move 'build_defaultstream' to import_util.	2018-08-07 16:45:42 -07:00
Rhea Parekh	bee3964f14	import: Move 'build_usermessages' to import_util.	2018-08-07 16:45:42 -07:00
Rhea Parekh	eefe7cccd2	import: Move 'process_uploads' and 'process_emojis' to import_util.	2018-08-07 16:45:42 -07:00
Rhea Parekh	30cc7354eb	import: Move 'process_avatars' to import_util.	2018-08-07 16:45:40 -07:00
Rhea Parekh	87cc1a6280	import: Move 'build_subscription' and 'build_recipient' to import_util.	2018-08-07 16:35:56 -07:00
Rhea Parekh	a516f80646	import: Move 'build_avatar' to import_util.	2018-08-07 16:35:56 -07:00
Rhea Parekh	1117455a90	import: Move 'ZerverFieldsT' and 'build_zerver_realm' to import_util.	2018-08-07 16:35:56 -07:00
Rhea Parekh	ee37866687	import: Add gitter import file in zerver/data_import directory.	2018-08-01 11:52:14 -07:00
Rhea Parekh	b8e1e8b31d	import: Add slack import files in zerver/data_import directory.	2018-08-01 11:52:14 -07:00

... 4 5 6 7 8

353 Commits