zulip

Commit Graph

Author	SHA1	Message	Date
Alex Vandiver	6c17cca208	zilencer: Drop unwanted data that old servers might still send.	2024-06-03 12:35:35 -07:00
Alex Vandiver	09e9c75ec6	analytics: Remove `active_users` and `active_users_log` metrics. Both of these are inaccurate, not currently used anywhere, and have been superseded by the `active_users_audit` metric.	2024-06-03 12:35:35 -07:00
Alex Vandiver	0100440a86	analytics: Make active_users_audit into a RealmCount. With `realm_active_humans` no longer dependent on the per-user rows, there is no reason to preserve them -- any measure of "was a user active" should look directly at the much richer RealmAuditLog. This removes the bulk of the UserCount table, since the remaining rows all require user interaction of some sort to produce rows.	2024-06-03 12:35:35 -07:00
Alex Vandiver	195defb031	analytics: Rewrite realm_active_humans::day query. This makes it no longer dependent on active_users_audit:is_bot:day, which subsequent commits will make a RealmCount, not UserCount, query. This folds the same behaviour of `active_users_audit` directly into the query; however, only running over active users, using the index from the earlier commit, and using the new `DISTINCT ON` formulation make this a fast query compared to `active_users_audit:is_bot:day` + the old `realm_active_humans::day`.	2024-06-03 12:35:35 -07:00
Alex Vandiver	e638ae44a8	analytics: Use a DISTINCT ON rather than a self-join. This produces a query which is more comprehensible, is 2x faster when limited to a realm, and has equivalent speed when performing the full table scan.	2024-06-03 12:35:35 -07:00
Mateusz Mandera	9406bfbc0a	analytics: Store realm disk space used as a CountStat. Fixes #29632. The issue description explains this well: We currently recalculate `currently_used_upload_space_bytes` every file upload, by dint of calling `flush_used_upload_space_cache` on save/delete, and then immediately calling `user_profile.realm.currently_used_upload_space_bytes()` in `notify_attachment_update`. Since this walks the Attachments table, recalculating this can take seconds in large realms. Switch this to using a CountStat, so we don't need to walk significant chunks of the Attachment table when we upload an attachment. This will also give us a historical daily graph of usage.	2024-05-09 10:54:44 -07:00
Mahhheshh	1198785c62	analytics: Improve do_increment_logging_stat performance. The previous implementation using Django's `get_or_create` for `do_increment_logging_stat` involved two separate database queries, potentially leading to race conditions. Use an `ON CONFLICT ... DO UPDATE` (aka "upsert") query, which eliminates race conditions and improves performance. This is mildly complicated due to the different unique indexes across the various tables, and the need for bug-for-bug compatibility with the previous implementation. Fixes #28947. Co-authored-by: Alex Vandiver <alexmv@zulip.com>	2024-05-06 16:34:01 -07:00
Lauryn Menard	cf82d3316b	push-bouncer: Exclude LoggingCountStats with partial data. LoggingCountStats with a daily duration and that are directly stored on the RealmCount table (not via aggregation in process_count_stat), can be in a state, after the hourly cron job to update analytics counts, where the logged value will be live-updated later, because the end time for the stat is still in the future. As these logging counts are designed to be used on the self-hosted installation for either debugging or rate limiting, sending these partial/incomplete counts to the bouncer has low value.	2024-02-26 17:53:12 -08:00
Anders Kaseorg	93198a19ed	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-01-29 10:41:54 -08:00
Alex Vandiver	7233841171	analytics: Move logging config into LOGGING, use "zulip.analytics". This should not reuse (and reconfigure!) the "zulip.management" logger.	2023-11-21 10:49:57 -08:00
Alex Vandiver	efa9bf36eb	analytics: Factor out UserCount / StreamCount common checks.	2023-11-21 10:49:57 -08:00
Mateusz Mandera	48db4bf854	counts: Add new mobile_pushes RemoteRealmCount stats. This requires a bit of complexity to avoid a name collision in COUNT_STATS with the RemoteInstallationCount stats with the same name.	2023-11-10 16:09:11 -08:00
Mateusz Mandera	2512e66c06	counts: Don't allow syncing mobile_pushes_forwarded::day count. `6819ecee92` forgot to add this.	2023-11-10 16:09:11 -08:00
Mateusz Mandera	8a6d5b4997	counts: Add new Add new mobile_pushes_sent::day LoggingCountStat. This is a CountStat for tracking how many mobile notifications the server requested. 1. On a self-hosted server, that means requesting from the push bouncer. 2. On a server that's its own push bouncer, that's just the number directly sent. This number has room for inaccuracy due to incrementing by the number of user devices on a self-hosted server, as it doesn't account for errors that may occur in the GCM/APNs low-level sending codepaths on the bouncer. Also tests that a server that's its own push bouncer correctly increments its mobile_pushes_sent::day CountStat, by basing it on the values returned from the send_apple/android_push_notification functions which tell us the actual number of successfully sent notifications. Since the return values of send_..._push_notification are now used in those codepaths, we need to tweak our mocks in some unrelated tests to set up some return value to avoid errors.	2023-11-10 16:09:11 -08:00
Mateusz Mandera	6819ecee92	zilencer: Add new LoggingCountStat mobile_pushes_forwarded. This one counts actual successful deliveries.	2023-11-01 17:26:10 -07:00
Mateusz Mandera	b7117d51b2	zilencer: Don't allow syncing mobile_pushes_received::day count.	2023-11-01 17:26:10 -07:00
Mateusz Mandera	183c775603	zilencer: Add new mobile_pushes_received::day LoggingCountStat.	2023-11-01 17:26:10 -07:00
Mateusz Mandera	c4fbb6319b	do_increment_logging_stat: Rename zerver_object argument. We are about to add support for having RemoteZulipServer here, which is a zilencer, not zerver, object. So let's rename this argument to something more appropriately general.	2023-11-01 17:26:10 -07:00
Mateusz Mandera	21c94953c8	do_increment_logging_stat: Assert that .frequency is either DAY or HOUR. An assert is appropriate here to ensure that some future additions of other frequencies don't make this if/else logic wrong without explicitly failing.	2023-11-01 17:26:10 -07:00
Anders Kaseorg	a50eb2e809	mypy: Enable new error explicit-override. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-10-12 12:28:41 -07:00
Anders Kaseorg	2665a3ce2b	python: Elide unnecessary list wrappers. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-09-13 12:41:23 -07:00
Alex Vandiver	b94402152d	models: Always search Messages with a realm_id or id limit. Unless there is a limit on `id`, always provide a `realm_id` limit as well. We also notate which index is expected to be used in each query.	2023-09-11 15:00:37 -07:00
Anders Kaseorg	0ce6dcb905	mypy: Upgrade mypy from 1.4.1 to 1.5.1. _default_manager is the same as objects on most of our models. But when a model class is stored in a variable, the type system doesn’t know which model the variable is referring to, so it can’t know that objects even exists (Django doesn’t add it if the user added a custom manager of a different name). django-stubs used to incorrectly assume it exists unconditionally, but it no longer does. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-09-07 17:51:42 -07:00
Anders Kaseorg	e32366638a	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-08-17 17:05:34 -07:00
Anders Kaseorg	710d1f7f51	analytics: Do not reseed the global random generator. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-08-15 17:57:16 -07:00
Anders Kaseorg	ca40e60469	ruff: Enable PERF rules. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-08-07 17:23:55 -07:00
Anders Kaseorg	c2c96eb0cf	python: Annotate type aliases with TypeAlias. This is not strictly necessary but it’s clearer and improves mypy’s error messages. https://docs.python.org/3/library/typing.html#typing.TypeAlias https://mypy.readthedocs.io/en/stable/kinds_of_types.html#type-aliases Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-08-07 10:02:49 -07:00
Anders Kaseorg	2d9b2a2a05	models: Remove type prefixes from __str__ values. The Django convention is for __repr__ to include the type and __str__ to omit it. In fact its default __repr__ implementation for models automatically adds a type prefix to __str__, which has resulted in the type being duplicated: >>> UserProfile.objects.first() <UserProfile: <UserProfile: emailgateway@zulip.com <Realm: zulipinternal 1>>> Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-03-08 22:56:55 -08:00
Anders Kaseorg	df001db1a9	black: Reformat with Black 23. Black 23 enforces some slightly more specific rules about empty line counts and redundant parenthesis removal, but the result is still compatible with Black 22. (This does not actually upgrade our Python environment to Black 23 yet.) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-02-02 10:40:13 -08:00
Anders Kaseorg	73374996a5	analytics: Add Composable type annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-07-30 06:46:34 -07:00
Anders Kaseorg	cc30ed8ec7	actions: Delete zerver.lib.actions. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-04-14 17:14:38 -07:00
Anders Kaseorg	21cd1c10b3	docs: Add missing space in “time zone”. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-24 14:05:12 -08:00
Anders Kaseorg	1629d6bfb3	python: Reformat with Black 22 (stable). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-18 18:03:13 -08:00
PIG208	7386918539	typing: Use accurate type hints for dictionaries. This fixes the mypy errors related to dictionaries with django-stubs.	2021-08-20 06:02:28 -07:00
Anders Kaseorg	09564e95ac	mypy: Add types-psycopg2. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-08-09 20:32:19 -07:00
PIG208	caaa424ef5	typing: Use assertions for broader types. For types like `Union[Realm, UserProfile, Stream]` and `Union[AnonymousUser, AbstractBaseUser]`, we need assertions to tell mypy which type we would be expecting.	2021-07-27 11:44:54 -07:00
PIG208	df1bf9e352	analytics: Fix type annotation for sql_data_collector.	2021-07-26 14:46:45 -07:00
Anders Kaseorg	fb3ddf50d4	python: Fix mypy no_implicit_reexport errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-07-16 14:02:31 -07:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	5028c081cb	python: Merge concatenated string literals that Black would uglify. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Vishnu KS	9d5a1271d4	analytics: Make last_successful_fill handle FillState.STARTED case properly. Subtracting an hour from end_time is correct only for CountStats with hourly frequency. For daily frequency we should subtract a day instead.	2020-12-22 16:44:31 -08:00
Vishnu KS	235a347639	analytics: Move last_successful_fill to CountStat. This is a prep commit. Currenty we only pass CountStat.property to last_successful_fill function. But it needs access to CountStat.time_increment as well. We can pass the entire CountStat object to the function as a workaround. But making last_successful_fill a property of CountStat seems to be much more cleaner.	2020-12-22 16:44:31 -08:00
Vishnu KS	189e9a2759	analytics: Create time_increment property in CountStat.	2020-12-22 16:44:31 -08:00
Anders Kaseorg	72d6ff3c3b	docs: Fix more capitalization issues. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-23 11:46:55 -07:00
Anders Kaseorg	ab120a03bc	python: Replace unnecessary intermediate lists with generators. Mostly suggested by the flake8-comprehension plugin. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-02 11:15:41 -07:00
Vishnu KS	4dc83a139c	counts: Create 7day_actives::day counstat.	2020-08-10 17:22:19 -07:00
Anders Kaseorg	5dc9b55c43	python: Manually convert more percent-formatting to f-strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
arpit551	c4b5d09283	analytics: Add LoggingCount for messages read stats. Whenever we use API queries to mark messages as read we now increment two new LoggingCount stats, messages_read::hour and messages_read_interactions::hour. We add an early return in do_increment_logging_stat function if there are no changes (increment is 0), as an optimization to avoid unnecessary database queries. We also log messages_read_interactions::hour Logging stat as the number of API queries to mark messages as read. We don't include tests for the case where do_update_pointer is called because do_update_pointer will most likely be removed from the codebase in the near future.	2020-06-14 21:15:27 -07:00
Anders Kaseorg	0d6c771baf	python: Guard against default value mutation with read-only types. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:31:27 -07:00

1 2 3 4

169 Commits