zulip

Commit Graph

Author	SHA1	Message	Date
Alex Vandiver	e125ad823d	exports: Add a separate bucket for realm exports. This allows finer-grained access control and auditing. The links generated also expire after one week, and the suggested configuration is that the underlying data does as well. Co-authored-by: Prakhar Pratyush <prakhar@zulip.com>	2024-09-20 15:43:49 -07:00
Alex Vandiver	c1e8ecd08f	uploads: Cache boto client in the module and be writable. The `get_signed_upload_url` code is called for every S3 file serve request, and is thus in the hot path. The boto3 client caching optimization is thus potentially useful as a performance optimization.	2024-09-20 15:43:49 -07:00
Alex Vandiver	1a7b3ef7ed	upload: Use get_export_tarball_url in upload_export_tarball.	2024-09-20 15:43:49 -07:00
Alex Vandiver	4cf835d9dd	upload: Remove common cache from get_export_tarball_url. This is not called in the hot path like get_avatar_url is.	2024-09-20 15:43:49 -07:00
Alex Vandiver	a5bf452202	upload: Realm is not Optional in upload_export_tarball. `af4eb8c0d5` marked the base class and local backend as non-Optional, but left the S3 backend as Optional for some reason. Remove it.	2024-09-20 15:43:49 -07:00
Alex Vandiver	9a1f78db22	thumbnail: Support checking for images from streaming sources. We may not always have trivial access to all of the bytes of the uploaded file -- for instance, if the file was uploaded previously, or by some other process. Downloading the entire image in order to check its headers is an inefficient use of time and bandwidth. Adjust `maybe_thumbnail` and dependencies to potentially take a `pyvips.Source` which supports streaming data from S3 or disk. This allows making the ImageAttachment row, if deemed appropriate, based on only a few KB of data, and not the entire image.	2024-09-17 12:51:30 -07:00
Alex Vandiver	903bfb31e6	upload: Provide the frontend with the less-modified filename.	2024-09-09 12:40:17 -07:00
Alex Vandiver	b4764f49df	upload: Download files with their original names. Fixes: #29491.	2024-09-09 12:40:17 -07:00
Alex Vandiver	ca72e756eb	upload: Rename "upload_image_to_s3"; it is not only for images.	2024-09-09 12:40:17 -07:00
Anders Kaseorg	91ade25ba3	python: Simplify with str.removeprefix, str.removesuffix. These are available in Python ≥ 3.9. https://docs.python.org/3/library/stdtypes.html#str.removeprefix Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-09-03 12:30:16 -07:00
Alex Vandiver	4351cc5914	thumbnail: Move get_image_thumbnail_path and split_thumbnail_path.	2024-07-18 13:50:28 -07:00
Alex Vandiver	2e38f426f4	upload: Generate thumbnails when images are uploaded. A new table is created to track which path_id attachments are images, and for those their metadata, and which thumbnails have been created. Using path_id as the effective primary key lets us ignore if the attachment is archived or not, saving some foreign key messes. A new worker is added to observe events when rows are added to this table, and to generate and store thumbnails for those images in differing sizes and formats.	2024-07-16 13:22:15 -07:00
Alex Vandiver	229dcd0218	upload: Clean up empty directories in local storage.	2024-07-16 13:22:15 -07:00
Anders Kaseorg	0fa5e7f629	ruff: Fix UP035 Import from `collections.abc`, `typing` instead. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-13 22:28:22 -07:00
Anders Kaseorg	531b34cb4c	ruff: Fix UP007 Use `X \| Y` for type annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-13 22:28:22 -07:00
Anders Kaseorg	e08a24e47f	ruff: Fix UP006 Use `list` instead of `List` for type annotation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-13 22:28:22 -07:00
Alex Vandiver	0385e5bab9	emoji: Store in S3 with a long public cache-control.	2024-07-12 13:26:47 -07:00
Alex Vandiver	262689da76	thumbnail: Fix MAX_EMOJI_GIF_FILE_SIZE_BYTES check to be post-resize. This check was intended to check the post-resized image size, not the pre-resized image.	2024-07-12 13:26:47 -07:00
Alex Vandiver	54f2fabac0	thumbnail: Still emoji are always pngs.	2024-07-12 13:26:47 -07:00
Alex Vandiver	f6b99171ce	emoji: Derive the file extension from a limited set of content-types. We thumbnail and serve emoji with the same format as they were uploaded. However, we preserved the original extension, which might mismatch with the provided content-type. Limit the content-type to a subset which is both (a) an image format we can thumbnail, and (b) a media format which is widely-enough supported that we are willing to provide it to all browsers. This prevents uploading a `.tiff` emoji, for instance. Based on this limited content-type, we then reverse to find the reasonable extension to use when storing it. This is particularly important because the local file storage uses the file extension to choose what content-type to re-serve the emoji as. This does nothing for existing emoji, which may have odd or missing file extensions.	2024-07-12 13:26:47 -07:00
Alex Vandiver	62a0611ddb	emoji: Pass down content-type, rather than guessing from extension.	2024-07-12 13:26:47 -07:00
Alex Vandiver	4bc563128e	thumbnail: Use a consistent set of supported image types.	2024-07-11 07:31:39 -07:00
Alex Vandiver	ff90e5355f	upload: Pass down content-type of realm icon/logo to backend. This saves having to try to re-derive it from the file extension, which may be ".original" in some cases.	2024-07-11 07:31:39 -07:00
Alex Vandiver	79f858b4b8	upload: Pass bytes to create_attachment. This will be used to analyze the bytes for image metadata.	2024-07-07 14:40:07 -07:00
Alex Vandiver	f97a30f240	upload: Reorder arguments to parallel upload_message_attachment.	2024-07-07 14:40:07 -07:00
Alex Vandiver	f52a93bc14	upload: Stop requiring callers pass in the file size. This can be calculated because we have the contents.	2024-07-07 14:40:07 -07:00
Alex Vandiver	58a9fe9af1	upload: Drop unused parameters to upload_message_attachment.	2024-07-07 14:40:07 -07:00
Alex Vandiver	0a296b2a6e	upload: Start storing content-type for new uploads.	2024-07-07 14:40:07 -07:00
Alex Vandiver	e29a455b2d	avatars: Encode version into the filename. Hash the salt, user-id, and now avatar version into the filename. This allows the URL contents to be immutable, and thus to be marked as immutable and cacheable. Since avatars are served unauthenticated, hashing with a server-side salt makes the current and past avatars not enumerable. This requires plumbing the current (or future) avatar version through various parts of the upload process. Since this already requires a full migration of current avatars, also take the opportunity to fix the missing `.png` on S3 uploads (#12852). We switch from SHA-1 to SHA-256, but truncate it such that avatar URL data does not substantially increase in size. Fixes: #12852.	2024-07-07 14:40:07 -07:00
Alex Vandiver	feca9939bb	s3: Support setting a cache-control on uploads.	2024-07-07 14:40:07 -07:00
Alex Vandiver	6258817bfd	s3: Stop setting empty Content-Disposition header.	2024-07-07 14:40:07 -07:00
Sahil Batra	5ef14c3a8e	users: Fix uploading user avatars. Due to recent refactoring in `9fb03cb2c7`, a user could not upload avatar if the server uses local upload backend and there was already an avatar file for that user. This commit fixes it to just check if there exists a file only when importing and not when the user is actually trying to change the avatar. Fixes #30676.	2024-07-02 13:26:21 -07:00
Alex Vandiver	2eaf098c5d	upload: Content-type is always defined.	2024-06-26 16:43:11 -07:00
Alex Vandiver	17fb23746f	upload: Move methods into zerver.lib.upload from .base.	2024-06-26 16:43:11 -07:00
Alex Vandiver	c826d80061	upload: Factor out common code into zerver.lib.upload.	2024-06-26 16:43:11 -07:00
Alex Vandiver	5cd10ce51d	s3: Allow setting a CloudFront URL prefix for avatar and emoji images.	2024-06-26 16:43:11 -07:00
Alex Vandiver	08b24484d1	upload: Remove redundant acting_user_profile argument. This argument, effectively added in `9eb47f108c`, was never actually used.	2024-06-26 16:43:11 -07:00
Alex Vandiver	fb929ca218	thumbnailing: Remove unnecessary third return value from resize_emoji.	2024-06-26 16:43:09 -07:00
Alex Vandiver	b14a33c659	thumbnailing: Switch to libvips, from PIL/pillow. This is done in as much of a drop-in fashion as possible. Note that libvips does not support animated PNGs[^1], and as such this conversion removes support for them as emoji; however, libvips includes support for webp images, which future commits will take advantage of. This removes the MAX_EMOJI_GIF_SIZE limit, since that existed to work around bugs in Pillow. MAX_EMOJI_GIF_FILE_SIZE_BYTES is fixed to actually be 128KiB (not 128MiB, as it actually was), and is counted _after_ resizing, since the point is to limit the amount of data transfer to clients. [^1]: https://github.com/libvips/libvips/discussions/2000	2024-06-26 16:42:57 -07:00
Alex Vandiver	9fb03cb2c7	upload: Factor out common avatar logic.	2024-06-26 16:38:01 -07:00
Alex Vandiver	d92993c972	upload: Factor out common emoji logic.	2024-06-26 16:38:01 -07:00
Alex Vandiver	0153d6dbcd	thumbnailing: Move resizing functions into zerver.lib.thumbnail.	2024-06-20 23:06:08 -04:00
Anders Kaseorg	5f053c4aa7	upload: Serve more cross-browser audio and image formats as inline. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-06-20 15:29:20 -07:00
Anders Kaseorg	fb4ad1422e	mime_types: Add audio and image types missing from Python library. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-06-20 15:29:20 -07:00
Prakhar Pratyush	508c5611d1	claim_attachment: Remove the stale 'user_profile' parameter. This commit removes the unused 'user_profile' parameter of the 'claim_attachement' function.	2024-05-21 09:24:43 -07:00
Vector73	8ab526a25a	models: Replace realm.uri with realm.url. In #23380, we are changing all occurrences of uri with url in order to follow the latest URL standard. Previous PRs #25038 and #25045 has replaced the occurences of uri that has no direct relation with realm. This commit changes just the model property, which has no API compatibility concerns.	2024-05-08 11:12:43 -07:00
Alex Vandiver	043d3127eb	upload: Only load S3 backend (and thus boto3) if necessary. Because loading boto3 is so slow, this saves a significant amount of time (0.3s or so) in process startup on servers which are not using the S3 file storage backend.	2024-04-15 13:12:51 -07:00
Anders Kaseorg	93198a19ed	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-01-29 10:41:54 -08:00
Alex Vandiver	75d6f35069	s3: Add a setting for S3 addressing style. This controls if boto3 attempts to use `https://bucketname.endpointname/` or `https://endpointname/bucket/` as its prefix. See https://botocore.amazonaws.com/v1/documentation/api/latest/reference/config.html Fixes: #28424.	2024-01-05 11:12:18 -08:00
Alex Vandiver	3aea67a8ed	s3: Only use get_bucket to get to boto3 clients and resources. boto3 has two different modalities of making API calls -- through resources, and through clients. Resources are a higher-level abstraction, and thus more generally useful, but some APIs are only accessible through clients. It is possible to get to a client object from a resource, but not vice versa. Use `get_bucket(...).meta.client` when we need direct access to the client object for more complex API calls; this lets all of the configuration for how to access S3 to sit within `get_bucket`. Client objects are not bound to only one bucket, but we get to them based on the bucket we will be interacting with, for clarity. We removed the cached session object, as it serves no real purpose.	2024-01-05 11:12:18 -08:00

1 2

84 Commits