zulip

Commit Graph

Author	SHA1	Message	Date
Mateusz Mandera	634015411a	update_analytics_count: Use a correct lock mechanism. Adds a re-usable lockfile_nonblocking helper to context_managers. Relying on naive `os.mkdir` is not enough especially now that the successful operation of this command is necessary for push notifications to work for many servers. We can't use `lockfile` context manager from `zerver.lib.context_managers`, because we want the custom behavior of failing if the lock can't be acquired, instead of waiting. That's because if an instance of this gets stuck, we don't want to start queueing up more processes waiting forever whenever the cronjob runs again and fail->exit is preferrable instead.	2024-03-05 10:21:14 -08:00
Prakhar Pratyush	c1daabd3c0	remote_server: Rename to 'send_server_data_to_push_bouncer'. This commit renames 'send_analytics_to_push_bouncer' to 'send_server_data_to_push_bouncer'.	2023-12-11 14:07:39 -08:00
Prakhar Pratyush	d763fae9d0	remote_server: Eliminate separate realms-only code path. Given that most of the use cases for realms-only code path would really like to upload audit logs too, and the others would likely produce a better user experience if they upoaded audit logs, we should just have a single main code path here i.e. 'send_analytics_to_push_bouncer'. We still only upload usage statistics according to documented option, and only from the analytics cron job. The error handling takes place in 'send_analytics_to_push_bouncer' itself.	2023-12-11 14:07:39 -08:00
Tim Abbott	7db15176f3	push bouncer: Submit basic metadata unconditionally. These metadata are essentially all publicily available anyway, and making uploading them unconditional will simplify some things. The documentation is not quite accurate in that it claims the server will upload some metadata that is not actually uploaded yet (but will by soon). This seems harmless.	2023-11-29 14:45:53 -08:00
Alex Vandiver	b363999d19	analytics: Slew record reporting by up to 10 minutes. This reduces the giant load spike at 5 minute past the hour, when all remote servers currently attempt to submit their records. We do not wish to slew over a full hour, because we want to ensure that we do not hold the lock when the next hour's analytics runs. It is also not necessary to have that much variation; 10 minutes is picked as an arbitrary "long enough" time to spread requests over.	2023-11-21 10:49:57 -08:00
Mateusz Mandera	48db4bf854	counts: Add new mobile_pushes RemoteRealmCount stats. This requires a bit of complexity to avoid a name collision in COUNT_STATS with the RemoteInstallationCount stats with the same name.	2023-11-10 16:09:11 -08:00
Mateusz Mandera	986f2fd962	send_analytics_to_remote_server: Rename to ..._to_push_bouncer. The former name is kind of misleading - this function is for the remote server to send analytics to the push bouncer. Under our usual terminology, a "remote server" is a self-hosted Zulip server. So data is sent FROM not TO a remote server.	2023-10-25 11:09:49 -07:00
Anders Kaseorg	a50eb2e809	mypy: Enable new error explicit-override. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2023-10-12 12:28:41 -07:00
Anders Kaseorg	21cd1c10b3	docs: Add missing space in “time zone”. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2022-02-24 14:05:12 -08:00
PIG208	66b1a4e7ca	backend: Add None-checks with assertions and if-elses. This fixes a batch of mypy errors of the following format: 'Item "None" of "Optional[Something]" has no attribute "abc"'	2021-07-24 15:00:21 -07:00
Anders Kaseorg	6e4c3e41dc	python: Normalize quotes with Black. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	11741543da	python: Reformat with Black, except quotes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 13:11:19 -08:00
Anders Kaseorg	9773c0f1a8	python: Fix string literal concatenation mistakes. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2021-02-12 08:02:51 -05:00
Anders Kaseorg	a50fae89e2	python: Elide type=str from argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Anders Kaseorg	b4597a8ca8	python: Elide default for store_{true,false} argparse arguments. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-09-03 16:17:14 -07:00
Tim Abbott	c05315ace2	update_analytics_counts: Fix warning output.	2020-08-28 14:24:48 -07:00
Anders Kaseorg	5dc9b55c43	python: Manually convert more percent-formatting to f-strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	4b6d2cf25f	logging: Pass more format arguments to logging. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	67e7a3631d	python: Convert percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 15:02:09 -07:00
Anders Kaseorg	1f565a9f41	timezone: Use standard library datetime.timezone.utc consistently. datetime.timezone is available in Python ≥ 3.2. This also lets us remove a pytz dependency from the PostgreSQL scripts. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-05 09:34:17 -07:00
Tim Abbott	8e7ce7cc79	python: Sort migrations/management command imports with isort. This is a preparatory commit for using isort for sorting all of our imports, merging changes to files where we can easily review the changes as something we're happy with. These are also files with relatively little active development, which means we don't expect much merge conflict risk from these changes.	2020-01-14 13:07:47 -08:00
Tim Abbott	216d2ec1bf	production: Add optional support for submitting usage statistics. See documentation for details.	2019-02-26 17:35:10 -08:00
rht	d1689b5884	analytics: Use python 3 syntax for typing.	2017-11-17 13:16:49 -08:00
Tim Abbott	2b43a0302a	python: Sort imports in smaller apps.	2017-11-15 15:55:49 -08:00
rht	51c1a6dfc9	analytics: Text-wrap long lines exceeding 110. License: Apache-2.0 Signed-off-by: rht <rhtbot@protonmail.com>	2017-11-10 16:22:00 -08:00
rht	fa09076ec9	analytics/management: Remove unused imports (F401).	2017-11-07 16:37:09 -08:00
Rishi Gupta	e31758c257	analytics: Do not run update_analytics_counts if there are no realms. Having no realms was not possible before, but will be once system bots are no longer on a special system realm.	2017-10-05 11:22:06 -07:00
rht	4494112862	analytics: Remove absolute_import.	2017-09-27 20:20:07 -07:00
rht	74f8a527e4	analytics: Remove print_function.	2017-09-27 18:05:45 -07:00
Rishi Gupta	92978d6fb2	analytics: Fix --utc argument in update_analytics_counts.py.	2017-04-28 16:15:07 -07:00
Rishi Gupta	73ae2abd4e	analytics: Add --verbose option to update_analytics_counts.py.	2017-04-28 16:15:07 -07:00
Rishi Gupta	dfbeab73b5	analytics: Change update_analytics_counts to only use hour boundaries. Fixes a recent regression where analytics were not being run on hour boundaries. Includes a migration that dumps all the analytics data.	2017-04-28 16:15:07 -07:00
hackerkid	b2504084ab	Replace timezone.now with timezone_now.	2017-04-16 12:28:56 -07:00
hackerkid	55c3d12078	Replace timezone.utc with timezone_utc.	2017-04-16 12:28:56 -07:00
Raghav Jajodia	a3a03bd6a5	mypy: Added Dict, List and Set imports. Fixed mypy errors associated with the upgrade.	2017-03-04 14:33:44 -08:00
Rishi Gupta	4dc791f393	Clean up timestamps.py and add a test.	2017-03-01 23:03:56 -08:00
Tim Abbott	e8b0880320	analytics: Log updates to analytics counts.	2017-02-01 17:02:46 -08:00
umkay	76f3d02590	analytics: Add cron job to run analytics jobs. This adds a cron job to update the Zulip analytics counts, complete with locking etc. Substantially tweaked by tabbott.	2017-02-01 17:02:46 -08:00
Rishi Gupta	552d626ef2	analytics: Fix FillState.last_modified not being updated. We were updating FillState with FillState.objects.filter(..).update(..), which does not update the last_modified field (which has auto_now=True). The correct incantation is the save() method of the actual FillState object.	2017-01-08 23:36:34 -08:00
Rishi Gupta	655ee51e35	analytics: Add table to keep track of fill state. Adds two simplifying assumptions to how we process analytics stats: * Sets the atomic unit of work to: a stat processed at an hour boundary. * For any given stat, only allows these atomic units of work to be processed in chronological order. Adds a table FillState that, for each stat, keeps track of the last unit of work that was processed.	2016-10-14 10:18:37 -07:00
Tim Abbott	273c17a072	update_analytics_counts: Add missing future imports.	2016-10-05 17:13:46 -07:00
Tim Abbott	3973ae5dbb	update_analytics_counts: Fix buggy argument parsing.	2016-10-04 20:43:19 -07:00
umkay	d260a22637	Add a new statistics/analytics framework. This is a first pass at building a framework for collecting various stats about realms, users, streams, etc. Includes: * New analytics tables for storing counts data * Raw SQL queries for pulling data from zerver/models.py tables * Aggregation functions for aggregating hourly stats into daily stats, and aggregating user/stream level stats into realm level stats * A management command for pulling the data Note that counts.py was added to the linter exclude list due to errors around %%s.	2016-10-04 17:18:54 -07:00

44 Commits