zulip

Commit Graph

Author	SHA1	Message	Date
Tim Abbott	c05315ace2	update_analytics_counts: Fix warning output.	2020-08-28 14:24:48 -07:00
Anders Kaseorg	51f993e084	python: Remove unittest.mock.Mock uses from production code. It’s somewhat expensive to import and confuses mypy. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-28 11:34:09 -07:00
Anders Kaseorg	61d0417e75	python: Replace ujson with orjson. Fixes #6507. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-11 10:55:12 -07:00
Vishnu KS	4dc83a139c	counts: Create 7day_actives::day counstat.	2020-08-10 17:22:19 -07:00
Anders Kaseorg	02fcb75ded	analytics: Convert datetime to UNIX timestamp before JSON serialization. datetime objects are not ordinarily JSON serializable. While both ujson and orjson have special cases to serialize datetime objects, they do it in different ways. So we want to do this explicitly. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-08-07 15:05:01 -07:00
arpit551	a68d38cc52	migrations: Upgrade migrations to remove duplicates in all Count tables. This commit upgrades 0015_clear_duplicate_counts migration to remove duplicate count in StreamCount, UserCount, InstallationCount as well. Fixes https://github.com/zulip/docker-zulip/issues/266	2020-07-30 15:18:00 -07:00
Vishnu KS	18ad35013f	support: Return json error if the POST request is invalid.	2020-07-28 11:03:06 -07:00
Vishnu KS	5b0b1efb15	support: Add functionality to approve sponsorship requests. This should make it much easier to process these requests.	2020-07-24 17:55:38 -07:00
Vishnu KS	1a1396d07e	support: Show customer plan details in support page.	2020-07-24 17:37:41 -07:00
Tim Abbott	bcab06509a	analytics: Remove unused analytics management commands. Unlike stream_stats, I'm not aware of any of these having been used in the last few years, and it's basically just really bad subsets of the data in /activity, which also doesn't require shell access to use. These haven't had real work or usage, AFAIK, since 2013.	2020-07-24 13:10:43 -07:00
Vishnu KS	49d06b9d69	activity: Always include realms on standard plan.	2020-07-24 13:09:43 -07:00
Steve Howell	c44500175d	database: Remove short_name from UserProfile. A few major themes here: - We remove short_name from UserProfile and add the appropriate migration. - We remove short_name from various cache-related lists of fields. - We allow import tools to continue to write short_name to their export files, and then we simply ignore the field at import time. - We change functions like do_create_user, create_user_profile, etc. - We keep short_name in the /json/bots API. (It actually gets turned into an email.) - We don't modify our LDAP code much here.	2020-07-17 11:15:15 -07:00
arpit551	87aaa84b42	audit_log: Log acting_user in do_change_user_role.	2020-07-06 17:32:11 -07:00
arpit551	19a8841a9e	audit_log: Log acting_user in do_scrub_realm.	2020-07-06 17:24:18 -07:00
Vishnu KS	1e68525f83	support: Ensure that only one form is posted at a time. The forms to change plan_type, add discount, scrub_realm etc all post to the same endpoint. Our frontend code is written so that only one form posts at a time. But there should be no harm in enforcing the same in backend as well.	2020-07-01 16:45:38 -07:00
Vishnu KS	4c6350fa4b	billing: Add option to request a sponsorship in /upgrade.	2020-07-01 16:45:38 -07:00
sahil839	1f8f227444	models: Update values of PreregistrationUser.invite_as dict. This commit changes the PreregistrationUser.invite_as dict to have same set of values as we have for UserProfile.role. This also adds a data migration to update the already exisiting PreregistrationUser and MultiuseInvite objects.	2020-06-24 11:09:07 -07:00
wowol	035d047dff	urls: Migrate analytics urls to use modern django patterns.	2020-06-23 15:02:42 -07:00
Anders Kaseorg	3916ea23a9	python: Combine some split import groups. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-18 15:54:11 -07:00
Anders Kaseorg	f364d06fb5	python: Convert percent formatting to .format for translated strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-15 16:24:46 -07:00
Anders Kaseorg	5dc9b55c43	python: Manually convert more percent-formatting to f-strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	74c17bf94a	python: Convert more percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Now including %d, %i, %u, and multi-line strings. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	4b6d2cf25f	logging: Pass more format arguments to logging. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
Anders Kaseorg	1a3441dbf5	confirmation: Pass realm rather than host to confirmation_url. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-14 23:27:22 -07:00
arpit551	aa70baba71	analytics: Add backend for messages read over time graph. This commit includes changes in the api /json/analytics/chart_data to send data for the newly added graph, as well as tests.	2020-06-14 21:19:24 -07:00
arpit551	a4b857b635	analytics: Populate data for messages_read::hour in development. Generate data for messages_read::hour so that we can test messages_ read_over_time graph.	2020-06-14 21:19:14 -07:00
arpit551	c4b5d09283	analytics: Add LoggingCount for messages read stats. Whenever we use API queries to mark messages as read we now increment two new LoggingCount stats, messages_read::hour and messages_read_interactions::hour. We add an early return in do_increment_logging_stat function if there are no changes (increment is 0), as an optimization to avoid unnecessary database queries. We also log messages_read_interactions::hour Logging stat as the number of API queries to mark messages as read. We don't include tests for the case where do_update_pointer is called because do_update_pointer will most likely be removed from the codebase in the near future.	2020-06-14 21:15:27 -07:00
arpit551	27daf38587	analytics: Derive AnalyticsTestCase class from ZulipTestCase. The ZulipTestCase class contains various useful utility functions that we could be using in test_counts.	2020-06-14 21:08:24 -07:00
Anders Kaseorg	0d6c771baf	python: Guard against default value mutation with read-only types. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:31:27 -07:00
Anders Kaseorg	69c0959f34	python: Fix misuse of Optional types for optional parameters. There seems to have been a confusion between two different uses of the word “optional”: • An optional parameter may be omitted and replaced with a default value. • An Optional type has None as a possible value. Sometimes an optional parameter has a default value of None, or None is otherwise a meaningful value to provide, in which case it makes sense for the optional parameter to have an Optional type. But in other cases, optional parameters should not have Optional type. Fix them. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-13 15:31:27 -07:00
Anders Kaseorg	365fe0b3d5	python: Sort imports with isort. Fixes #2665. Regenerated by tabbott with `lint --fix` after a rebase and change in parameters. Note from tabbott: In a few cases, this converts technical debt in the form of unsorted imports into different technical debt in the form of our largest files having very long, ugly import sequences at the start. I expect this change will increase pressure for us to split those files, which isn't a bad thing. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-11 16:45:32 -07:00
Anders Kaseorg	69730a78cc	python: Use trailing commas consistently. Automatically generated by the following script, based on the output of lint with flake8-comma: import re import sys last_filename = None last_row = None lines = [] for msg in sys.stdin: m = re.match( r"\x1b\[35mflake8 \\|\x1b\[0m \x1b\[1;31m(.+):(\d+):(\d+): (\w+)", msg ) if m: filename, row_str, col_str, err = m.groups() row, col = int(row_str), int(col_str) if filename == last_filename: assert last_row != row else: if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) with open(filename) as f: lines = f.readlines() last_filename = filename last_row = row line = lines[row - 1] if err in ["C812", "C815"]: lines[row - 1] = line[: col - 1] + "," + line[col - 1 :] elif err in ["C819"]: assert line[col - 2] == "," lines[row - 1] = line[: col - 2] + line[col - 1 :].lstrip(" ") if last_filename is not None: with open(last_filename, "w") as f: f.writelines(lines) Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-06-11 16:04:12 -07:00
Anders Kaseorg	67e7a3631d	python: Convert percent formatting to Python 3.6 f-strings. Generated by pyupgrade --py36-plus. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 15:02:09 -07:00
Anders Kaseorg	6480deaf27	python: Convert more "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, with more restrictions patched out. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-10 14:48:09 -07:00
Anders Kaseorg	5839fdf963	analytics: Improve escaping correctness with psycopg2.sql. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-09 21:12:43 -07:00
Anders Kaseorg	8dd83228e7	python: Convert "".format to Python 3.6 f-strings. Generated by pyupgrade --py36-plus --keep-percent-format, but with the NamedTuple changes reverted (see commit `ba7906a3c6`, #15132). Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-08 15:31:20 -07:00
arpit551	fb2aae1c02	analytics tests: Save recipient in stream object. At the time of creating streams in test_counts.py we earlier did not saved recipient in the stream object. stream.recipient is used in many functions so they would throw error. The right long-term fix here is probably to just use the standard stream creation functions rather than having a hacky duplicate here.	2020-06-08 11:33:24 -07:00
Anders Kaseorg	1f565a9f41	timezone: Use standard library datetime.timezone.utc consistently. datetime.timezone is available in Python ≥ 3.2. This also lets us remove a pytz dependency from the PostgreSQL scripts. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-06-05 09:34:17 -07:00
Sahil Batra	77d4be56a4	users: Modify do_create_user and create_user to accept role. We change do_create_user and create_user to accept role as a parameter instead of 'is_realm_admin' and 'is_guest'. These changes are done to minimize data conversions between role and boolean fields.	2020-06-02 16:11:36 -07:00
Anders Kaseorg	840cf4b885	requirements: Drop direct dependency on mock. mock is just a backport of the standard library’s unittest.mock now. The SAMLAuthBackendTest change is needed because MagicMock.call_args.args wasn’t introduced until Python 3.8 (https://bugs.python.org/issue21269). The PROVISION_VERSION bump is skipped because mock is still an indirect dev requirement via moto. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-26 11:40:42 -07:00
sahil839	1aebf3cab9	actions: Merge do_change_is_admin and do_change_is_guest. This commit merges do_change_is_admin and do_change_is_guest to a single function do_change_user_role which will be used for changing role of users. do_change_is_api_super_user is added as a separate function for changing is_api_super_user field of UserProfile.	2020-05-25 16:17:10 -07:00
sahil839	9b78a73e36	populate_db: Add new admin user as 'Desdemona'. This commit adds a second admin user named 'Desdemona' to dev and test database.	2020-05-19 11:42:27 -07:00
sahil839	36dca5ba5c	analytics: Add order_by to query used for fetching admin emails. This commit adds order_by to the query that fetches admin emails in views.py. This is added to show emails alphabetically in case of multiple admins.	2020-05-19 11:42:27 -07:00
Anders Kaseorg	8cdf2801f7	python: Convert more variable type annotations to Python 3.6 style. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-08 16:42:43 -07:00
Pragati Agrawal	bd9b74436c	org settings: Enable message_retention_days in org settings UI. Since production testing of `message_retention_days` is finished, we can enable this feature in the organization settings page. We already had this setting in frontend but it was bit rotten and not rendered in templates. Here we replaced our past text-input based setting with a dropdown-with-text-input setting approach which is more consistent with our existing UI. Along with frontend changes, we also incorporated a backend change to handle making retention period forever. This change introduces a new convertor `to_positive_or_allowed_int` which only allows positive integers and an allowed value for settings like `message_retention_days` which can be a positive integer or has the value `Realm.RETAIN_MESSAGE_FOREVER` when we change the setting to retain message forever. This change made `to_not_negative_int_or_none` redundant so removed it as well. Fixes: #14854	2020-05-08 14:09:31 -07:00
Anders Kaseorg	bdc365d0fe	logging: Pass format arguments to logging. https://docs.python.org/3/howto/logging.html#optimization Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-05-02 10:18:02 -07:00
Tim Abbott	69ae4931c3	migrations: Use django.db.backends.postgresql.schema. This replaces django.db.backends.postgresql_psycopg2, which has been an alias to django.db.backends.postgresql since Django 1.9.	2020-04-26 22:20:24 -07:00
Anders Kaseorg	fead14951c	python: Convert assignment type annotations to Python 3.6 style. This commit was split by tabbott; this piece covers the vast majority of files in Zulip, but excludes scripts/, tools/, and puppet/ to help ensure we at least show the right error messages for Xenial systems. We can likely further refine the remaining pieces with some testing. Generated by com2ann, with whitespace fixes and various manual fixes for runtime issues: - invoiced_through: Optional[LicenseLedger] = models.ForeignKey( + invoiced_through: Optional["LicenseLedger"] = models.ForeignKey( -_apns_client: Optional[APNsClient] = None +_apns_client: Optional["APNsClient"] = None - notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - signup_notifications_stream: Optional[Stream] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) + signup_notifications_stream: Optional["Stream"] = models.ForeignKey('Stream', related_name='+', null=True, blank=True, on_delete=CASCADE) - author: Optional[UserProfile] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) + author: Optional["UserProfile"] = models.ForeignKey('UserProfile', blank=True, null=True, on_delete=CASCADE) - bot_owner: Optional[UserProfile] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) + bot_owner: Optional["UserProfile"] = models.ForeignKey('self', null=True, on_delete=models.SET_NULL) - default_sending_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) - default_events_register_stream: Optional[Stream] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_sending_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) + default_events_register_stream: Optional["Stream"] = models.ForeignKey('zerver.Stream', null=True, related_name='+', on_delete=CASCADE) -descriptors_by_handler_id: Dict[int, ClientDescriptor] = {} +descriptors_by_handler_id: Dict[int, "ClientDescriptor"] = {} -worker_classes: Dict[str, Type[QueueProcessingWorker]] = {} -queues: Dict[str, Dict[str, Type[QueueProcessingWorker]]] = {} +worker_classes: Dict[str, Type["QueueProcessingWorker"]] = {} +queues: Dict[str, Dict[str, Type["QueueProcessingWorker"]]] = {} -AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional[LDAPSearch] = None +AUTH_LDAP_REVERSE_EMAIL_SEARCH: Optional["LDAPSearch"] = None Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 11:02:32 -07:00
Anders Kaseorg	f8c95cda51	mypy: Add specific codes to type: ignore annotations. https://mypy.readthedocs.io/en/stable/error_codes.html Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-22 10:46:33 -07:00
Anders Kaseorg	1cf63eb5bf	python: Whitespace fixes from autopep8. Generated by autopep8, with the setup.cfg configuration from #14532. I’m not sure why pycodestyle didn’t already flag these. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-21 17:58:09 -07:00
Anders Kaseorg	c734bbd95d	python: Modernize legacy Python 2 syntax with pyupgrade. Generated by `pyupgrade --py3-plus --keep-percent-format` on all our Python code except `zthumbor` and `zulip-ec2-configure-interfaces`, followed by manual indentation fixes. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2020-04-09 16:43:22 -07:00
Vishnu KS	dae5f54a63	stats: Show info message in /stats page of newly created realms. This provides a better user experience for folks joining a new Zulip realm and visiting this page.	2020-04-03 11:59:18 -07:00
Vishnu KS	905f3d2d23	stats: Don't show analytics data unavailable error for new realms. The value of start would be set to realm creation date or installation date unless a value is explicitly provided by the user. We don't want to show analytics data missing error in both these cases if start is less than 24.5 hours from timezone_now() since some of the stats are populated by a daily cron job that may take a couple of minutes to run. The same case applies if the value of the start is passed in the request.	2020-04-03 11:51:20 -07:00
Steve Howell	1306239c16	tests: Use email/delivery_email more explicitly. We try to use the correct variation of `email` or `delivery_email`, even though in some databases they are the same. (To find the differences, I temporarily hacked populate_db to use different values for email and delivery_email, and reduced email visibility in the zulip realm to admins only.) In places where we want the "normal" realm behavior of showing emails (and having `email` be the same as `delivery_email`), we use the new `reset_emails_in_zulip_realm` helper. A couple random things: - I fixed any error messages that were leaking the wrong email - a test that claimed to rely on the order of emails no longer does (we sort user_ids instead) - we now use user_ids in some place where we used to use emails - for IRC mirrors I just punted and used `reset_emails_in_zulip_realm` in most places - for MIT-related tests, I didn't fix email vs. delivery_email unless it was obvious I also explicitly reset the realm to a "normal" realm for a couple tests that I frankly just didn't have the energy to debug. (Also, we do want some coverage on the normal case, even though it is "easier" for tests to pass if you mix up `email` and `delivery_email`.) In particular, I just reset data for the analytics and corporate tests.	2020-03-19 16:04:03 -07:00
Mateusz Mandera	b4ce167a88	models: Add recipient foreign key to Huddle. This follows the already tested approach from `8acfa17fe6`.	2020-03-17 05:41:11 -07:00
Steve Howell	1b16693526	tests: Limit email-based logins. We now have this API... If you really just need to log in and not do anything with the actual user: self.login('hamlet') If you're gonna use the user in the rest of the test: hamlet = self.example_user('hamlet') self.login_user(hamlet) If you are specifically testing email/password logins (used only in 4 places): self.login_by_email(email, password) And for failures uses this (used twice): self.assert_login_failure(email)	2020-03-11 17:10:22 -07:00
Mateusz Mandera	64b85415f5	migrations: Fix unused import error.	2020-03-06 12:17:19 -08:00
arpit551	f299f31340	analytics: Fix missing unique constraint when subgroup is null. Replaced unique_together with UniqueConstraint in models that covered nullable fields as in unique_together database indexes don't work where subgroup=None. So added conditional unique index handling invalid duplicate Count data. Added 0015_clear_duplicate_counts migration to handle existing data that violates the constraints. Also corrected a test case in test_counts.py which didn't clear its state properly and thus was accidentally taking advantage of this database schema bug.	2020-03-06 11:10:04 -08:00
arpit551	2899e44218	analytics: Added comments. A few comments are added to explain more clearly the changes made in `b23a5431cd` namely about not using realm arguments in LoggingCount Stats and the need to pass realm argument in pull function. The comments were tweaked by tabbott for readability.	2020-01-28 14:57:32 -08:00
Tim Abbott	9ac3e1099c	analytics: Remove last_modified field from FillState. This field wasn't used for anything, and I think it has very limited use for debugging, since fundamentally, it'll almost always have a value within the hour of the actual timestamp in FillState, and any more fine-grained logging we might want would be available in the analytics job's own logs. The proximal reason to remove it is that apparently Django's model_to_dict doesn't support auto_now fields, and that caused some trouble when working on adding more complete import/export support for analytics data.	2020-01-26 20:38:26 -08:00
arpit551	b23a5431cd	analytics: Add realm argument to analytics. This changeset is prepartory work for doing something reasonable with analytics data during the zulip -> zulip data import process (and potentially e.g. slack -> Zulip as well). To support that, we need to make it possible to do our analytics calculations for a single realm. We do this while maintaining backwards compatibility and avoiding massive duplicated code by adding an optional `realm` argument to the entrypoints to the analytics system, especially process_count_stat. More work involving restructuring FillState will be required for this to be actually usable for its intented purpose, but this commit is a nice checkpoint along the way. Tweaked by tabbott to adjust comments and disable InstallationCount updates when a realm argument is specified.	2020-01-23 17:36:13 -08:00
Tim Abbott	8e7ce7cc79	python: Sort migrations/management command imports with isort. This is a preparatory commit for using isort for sorting all of our imports, merging changes to files where we can easily review the changes as something we're happy with. These are also files with relatively little active development, which means we don't expect much merge conflict risk from these changes.	2020-01-14 13:07:47 -08:00
Mateusz Mandera	8acfa17fe6	models: Add recipient foreign key in UserProfile and Stream. This is adds foreign keys to the corresponding Recipient object in the UserProfile on Stream tables, a denormalization intended to improve performance as this is a common query. In the migration for setting the field correctly for existing users, we do a direct SQL query (because Django 1.11 doesn't provide any good method for doing it properly in bulk using the ORM.). A consequence of this change to the model is that a bit of code needs to be added to the functions responsible for creating new users (to set the field after the Recipient object gets created). Fortunately, there's only a few code paths for doing that. Also an adjustment is needed in the import system - this introduces a circular relation between Recipient and UserProfile. The field cannot be set until the Recipient objects have been created, but UserProfiles need to be created before their corresponding Recipients. We deal with this by first importing UserProfiles same way as before, but we leave the personal_recipient field uninitialized. After creating the Recipient objects, we call a function to set the field for all the imported users in bulk. A similar change is made for managing Stream objects.	2019-12-09 15:14:41 -08:00
Vishnu Ks	a26b379a14	support: Send confirmation email on realm activation.	2019-12-02 09:51:45 -08:00
Anders Kaseorg	0d20145b93	mypy: Upgrade from 0.730 to 0.740. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-11-13 12:38:45 -08:00
Vishnu KS	ec955f8f78	support: Show confirmation links in search. Fixes #13060 #12784	2019-10-21 16:56:50 -07:00
Vishnu KS	139ebf387b	support: Pass various realm functions as template context. We currently have code to calculate the value of realm_icon_url, admin_emails and default_discount in two diffrent places. With the addition of showing confirmation links it would become three. The easiest way to deduplicate the code and make the view cleaner is by doing the calculations in template. Alternatively one can write a function that takes users, realms and confirmations as arguments and sets the value of realm_icon_url, admin_emails and default_discount appropriately in realm object according to the type of the confirmation. But that seems more messy than passing the functions directly to template approach.	2019-10-21 16:52:46 -07:00
Mateusz Mandera	bbf2474bd0	tests: setUp overrides should call super().setUp(). MigrationsTestCase is intentionally omitted from this, since migrations tests are different in their nature and so whatever setUp() ZulipTestCase may do in the future, MigrationsTestCase may not necessarily want to replicate.	2019-10-19 17:27:01 -07:00
Rishi Gupta	e10361a832	models: Replace is_guest and is_realm_admin with UserProfile.role. This new data model will be more extensible for future work on features like a primary administrator.	2019-10-06 16:24:37 -07:00
Rishi Gupta	4256ee61cf	billing: Change RealmAuditLog.event_type from str to int. This is a more robust long-term model for storing these data.	2019-10-06 15:55:56 -07:00
Mateusz Mandera	dbe508bb91	models: Migration of Message.pub_date to date_sent, part 2. Fixes #1727. With the server down, apply migrations 0245 and 0246. 0246 will remove the pub_date column, so it's essential that the previous migrations ran correctly to copy data before running this.	2019-10-05 19:01:34 -07:00
Tim Abbott	a352f2e10d	test_counts: Remove custom user creation code. Like the last commit, this avoids us needing to update this random test class in the analytics subsystem when we adjust the UserProfile model.	2019-09-19 14:31:58 -07:00
Tim Abbott	aae610f65c	analytics: Fix creation of shylock user. Previously, the shylock user was created using a bad duplicate of our standard create_user code path; we fix this by just calling into that code path.	2019-09-19 14:16:38 -07:00
Anders Kaseorg	7494f1600c	templates: Move page_params from an inline script to the <body> dataset. This sidesteps tricky escaping issues, and will make it easier to build a strict Content-Security-Policy. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-09-17 16:06:33 -07:00
Anders Kaseorg	490ce993f4	analytics: Clean up type ignores. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-09 16:39:16 -07:00
Anders Kaseorg	becef760bf	cleanup: Delete leading newlines. Previous cleanups (mostly the removals of Python __future__ imports) were done in a way that introduced leading newlines. Delete leading newlines from all files, except static/assets/zulip-emoji/NOTICE, which is a verbatim copy of the Apache 2.0 license. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-08-06 23:29:11 -07:00
Wyatt Hoodes	97c10b35c2	analytics/views: Remove redundant check of `if end is None`.	2019-07-31 12:15:57 -07:00
Wyatt Hoodes	fa227d79af	typing: Fix options typing in management/commands. We simply state that certain options are `Optional`. The following files are affected: add_users_to_mailing_list send_to_email_mirror fill_memcached_caches client_activity	2019-07-31 12:13:56 -07:00
Mateusz Mandera	b321dcd50b	test_views: Prepare for moving system bots to zulipinternal.	2019-07-24 16:26:10 -07:00
Rishi Gupta	9fc791926b	support: Use delivery_email instead of email. Structurally, these pages want email addresses one can actually send emails to.	2019-07-21 14:47:15 -07:00
Tim Abbott	4eb9d67b70	models: Extract get_human_admin_users function. This function is an alternative to get_admin_users that we use in all places where we explicitly want only human administrative users (not administrative bots). The following commits will rename get_admin_users for better clarity.	2019-06-20 14:32:30 -07:00
Mohit Gupta	db3d81613b	decorator: Refactor @require_non_guest_human_user decorator. Rename @require_non_guest_human_user to @require_member_or_admin. This is a refactor commit prior to introduction of Administrator Bots.	2019-06-18 17:11:58 -07:00
Vishnu Ks	14e582fb59	support: Add functionality to copy admin emails. Also renamed a bunch of functions in test_views for better readability.	2019-06-14 10:19:50 -07:00
Roman Godov	a50824e031	models: Rename Subscription.in_home_view field to is_muted. This renames Subscription.in_home_view field to is_muted, for greater clarity as to what it does just from seeing the setting name, without having to look it up. Also disabled an obsolete test_migrations test. Fixes #10042.	2019-05-12 22:08:10 -07:00
Tim Abbott	1d44fd724b	audit log: Log which server admin deactivated a realm too.	2019-05-08 15:09:48 -07:00
Rishi Gupta	0218da64f1	support: Update success message for plan type change.	2019-05-08 15:09:48 -07:00
Rishi Gupta	98da11c558	support: Rename deactive to deactivated.	2019-05-08 15:09:48 -07:00
Anders Kaseorg	9efda71a4b	get_realm: raise DoesNotExist instead of returning None. This makes the implementation of `get_realm` consistent with its declared return type of `Realm` rather than `Optional[Realm]`. Fixes #12263. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-05-06 21:58:16 -07:00
Vishnu Ks	6c58603eaf	support: Add support for scrubbing realm.	2019-05-06 20:12:54 -07:00
Vishnu Ks	f6203f068b	support: Add support for activating and deactivating realm.	2019-05-06 20:12:48 -07:00
Vishnu Ks	123bcea518	management: Don't use sys.exit(1). Using sys.exit in a management command makes it impossible to unit test the code in question. The correct approach to do the same thing in Django management commands is to raise CommandError. Followup of `b570c0dafa`	2019-05-03 14:20:39 -07:00
Anders Kaseorg	643bd18b9f	lint: Fix code that evaded our lint checks for string % non-tuple. Signed-off-by: Anders Kaseorg <anders@zulipchat.com>	2019-04-23 15:21:37 -07:00
Vishnu Ks	8eeb8280b4	activity: Create interface for doing support operations. This should grow into a tool that makes it much easier to do common organization management tasks without using a manage.py shell.	2019-03-11 12:01:11 -07:00
Vishnu Ks	42de9a0c71	analytics: Extract out the function for getting plan name.	2019-03-11 12:01:11 -07:00
Harshit Bansal	0e401c4f18	minor: Fix an artifact of delivery email migration in `populate_analytics_db`.	2019-03-05 13:52:59 -08:00
Rishi Gupta	9962377018	analytics: Fix midnight-related bug in test. Previously, this would flake if the day changed between user2 = do_create_user('email2', 'password', ...) and do_deactivate_user(user2).	2019-02-28 14:48:30 -08:00
Tim Abbott	216d2ec1bf	production: Add optional support for submitting usage statistics. See documentation for details.	2019-02-26 17:35:10 -08:00
Tim Abbott	c41bfcb9e0	Revert "activity: Change definition of active site." This reverts commit `9f9b7cb991`. This commit made the page not perform well enough to load.	2019-02-13 14:52:13 -08:00
bartek	9f9b7cb991	activity: Change definition of active site. Signed-off-by: bartek <bartek.jachowicz@gmail.com> Edits by Rishi Gupta <rishig@zulipchat.com> Fixes: #10432	2019-02-12 16:36:44 -08:00
Wyatt Hoodes	bb6a75c3dc	populate_analytics_db.py: Subscribe user for exporting. AssertionErrors were raised when attempting to run manual comparison tests to ensure correctness when exporting the analytics realm using export_from_config. This was caused by this populate_analytics_db stream being created without any subscribers, which violates an invariant. We fix this by simply subscribing the 'shylock' user to that stream.	2019-02-04 10:59:24 -08:00
Anders Kaseorg	c2cb804ebe	analytics/views.py: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:43:58 -08:00
Anders Kaseorg	f5197518a9	analytics/zilencer/zproject: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:31:45 -08:00
Tim Abbott	13b9e90a40	analytics: Fix importing zilencer. In `ebdd55814c`, we added zilencer imports without using the proper mocking procedure for when zilencer is not enabled. This whole setup is a mess and probably we should enable zilencer unconditionally in a future version.	2019-02-02 17:18:57 -08:00
Anders Kaseorg	4bd28f7ae6	migrations: Remove unused imports. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2019-02-02 17:01:04 -08:00
Tim Abbott	96f096f38e	analytics: Add section for remote Zulip servers.	2019-02-02 16:55:12 -08:00
Tim Abbott	ebdd55814c	analytics: Add support for graphing remote analytics data. Combined with recent work on submitting data, one can now view the high-level usage numbers for chat.zulip.org on zulipchat.com's dashboards.	2019-02-02 16:55:12 -08:00
Rishi Gupta	85f7ac8172	analytics: Remove Anomaly model.	2019-02-01 18:48:18 -08:00
ss62171	5649b954ef	stream_stats: Add a column representing type of stream. This adds a column which represents whether a stream is public or private. Fixes #11374.	2019-01-31 15:04:45 -08:00
ss62171	5fd9748e13	stream_stats: List number of private and public streams for each realm. Previously, we only displayed public streams and didn't list the number of them. Now, we just list all streams.	2019-01-31 15:04:09 -08:00
Rishi Gupta	e7220fd71f	billing: Do subscription management in-house instead of with Stripe Billing. This is a major rewrite of the billing system. It moves subscription information off of stripe Subscriptions and into a local CustomerPlan table. To keep this manageable, it leaves several things unimplemented (downgrading, etc), and a variety of other TODOs in the code. There are also some known regressions, e.g. error-handling on /upgrade is broken.	2018-12-22 13:39:30 -08:00
Tim Abbott	e603237010	email: Convert accounts code to use delivery_email. A key part of this is the new helper, get_user_by_delivery_email. Its verbose name is important for clarity; it should help avoid blind copy-pasting of get_user (which we'll also want to rename). Unfortunately, it requires detailed understanding of the context to figure out which one to use; each is used in about half of call sites. Another important note is that this PR doesn't migrate get_user calls in the tests except where not doing so would cause the tests to fail. This probably deserves a follow-up refactor to avoid bugs here.	2018-12-06 16:21:38 -08:00
Tim Abbott	7ddcbd0d3a	analytics: Set delivery_email in hacky test user creation code.	2018-12-06 15:33:28 -08:00
Tim Abbott	c679920c01	python: Fix unnecessary uses of str_utils library.	2018-11-27 11:44:09 -08:00
Rishi Gupta	91b02373dc	activity: Rename standard free to open source. "standard free" looks too similar to "standard". Also it makes it hard to Ctr+f for "standard".	2018-11-16 21:03:37 -08:00
Vishnu Ks	2e04cdbe5e	billing: Show estimated subscription revenue on /activity. [Substantial edits by Rishi Gupta]	2018-11-16 13:30:16 -08:00
Vishnu Ks	9d398ee335	activity: Move the total row from last to first.	2018-11-16 09:22:32 -08:00
Steve Howell	c0fd8660d2	subject -> topic: Fix analytics test (minor).	2018-11-14 23:24:06 -08:00
Yashashvi Dave	02a5849d4c	statistics: Guest user can't access realm statistics. Don't allow guest user to access realm statistics from UI or at API level. Fixes part of #10749.	2018-11-02 11:43:09 -07:00
Rishi Gupta	458169928c	billing: Rename Zulip Premium to Zulip Standard.	2018-10-24 10:42:16 -07:00
Tim Abbott	0deeffff6d	populate_analytics_db: Make shylock an organization admin. This means the analytics realm satisfies the usual Zulip invariant that every realm has at least one organization admin.	2018-08-31 15:26:15 -07:00
Vishnu Ks	6fcb095e70	activity: Show plan_type in activity page.	2018-08-21 18:15:14 -07:00
Roman Godov	34ae3dfd44	models: Delete unused Subscription.notifications field. This deletes the unused Subscription.notifications field and removes it from some testing and analytics code (which should not have been using it in the first place). Fixes #10042.	2018-07-26 15:54:57 -07:00
Anders Kaseorg	8d52f0e0c0	analytics/management/commands/check_analytics_state.py: Avoid shelling out for mv. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2018-07-19 10:43:37 -07:00
Vishnu Ks	201b99a6f8	models: Add USER_REACTIVATED event type constant to RealmAuditLog.	2018-07-10 15:42:26 +05:30
Vishnu Ks	d0b89cbb44	models: Add USER_DEACTIVATED event type constant to RealmAuditLog.	2018-07-10 15:42:26 +05:30
Vishnu Ks	ce3fffdbb2	models: Add USER_ACTIVATED event type constant to RealmAuditLog.	2018-07-10 15:42:26 +05:30
Vishnu Ks	2c8effe9fe	models: Add USER_CREATED event type constant to RealmAuditLog.	2018-07-10 15:42:26 +05:30
Vishnu Ks	4d1a68430a	analytics: Remove unused RealmAuditLog import.	2018-07-10 15:42:26 +05:30
Tim Abbott	b9fa7d7b6d	populate_analytics_db: Flush memcached after deleting analytics realm. This fixes a subtle bug where if you reran populate_analytics_db directly, we'd end up in a weird state where memcached fetched the "old" pre-flush UserProfile object for shylock when loading /stats, which ultimately would result in /stats appearing totally broken.	2018-07-09 15:13:59 +05:30
Eeshan Garg	0a43e5e257	Replace all user-facing references to "invite-only" with "private". Fixes #9611.	2018-06-12 13:37:45 -04:00
Nikhil Kumar Mishra	fa9d79e203	stats: Add 1 day actives and total users to number of users chart.	2018-05-20 10:56:16 -07:00
Rishi Gupta	66a589c7a7	stats: Extend get_chart_data to support charts with multiple CountStats.	2018-05-20 10:56:16 -07:00
Rishi Gupta	08bf0a66b8	stats: Refactor the get_time_series_by_subgroup calls in get_chart_data. This code is going to end up pretty complex -- each stat has multiple levels of aggregation (UserCount, RealmCount, InstallationCount), and refinement (subgroups), and soon we'll have charts that take data from multiple stats as input. Not sure what the best way to present it is, but hopefully this simplifies it a bit.	2018-05-20 10:56:16 -07:00
Nikhil Kumar Mishra	26decb4c48	stats: Add 1day_actives::day CountStat to analytics tables.	2018-05-20 10:56:16 -07:00
Rishi Gupta	1af7fc7344	stats: Add /stats/installation.	2018-05-18 15:12:36 -07:00
Rishi Gupta	2fe3fba6ce	stats: Rename data.realm to data.everyone. We use "Everyone" for the button labels already. Soon we'll support "Everyone" meaning either the installation or the realm, depending on the URL route used to access the stats.	2018-05-18 15:12:36 -07:00
Rishi Gupta	af758755bd	stats: Rename target_realm_name to target_name in stats.html.	2018-05-18 15:12:36 -07:00
Rishi Gupta	e099959a41	stats: Move API route computation from frontend to backend. Will make it easier to extend to additional routes. No changes in behavior.	2018-05-18 15:12:36 -07:00
Tim Abbott	f0ef335412	models: Remove unused ModelReprMixin class. It appeared to be used as a base class in various Django migrations, but because it didn't define any model fields, it wasn't actually.	2018-05-15 19:11:22 -07:00
Yago González	184bd8304e	i18n: Tag missing strings for translation.	2018-05-12 16:44:56 -07:00
Aditya Bansal	5adf983c3c	analytics: Change use of typing.Text to str.	2018-05-10 14:19:49 -07:00
Shubham Dhama	03a2a9c792	activity: Add realm stats link to "realm activtiy table".	2018-04-18 11:07:00 -07:00
Shubham Dhama	b26c38bc47	analytics: Make stats of all realms accessible to server admins. In this commit: Two new URLs are added, to make all realms accessible for server admins. One is for the stats page itself and another for getting chart data i.e. chart data API requests. For the above two new URLs corresponding two view functions are added.	2018-04-18 11:06:50 -07:00
neiljp (Neil Pilgrim)	8b697b4093	mypy: Annotate stream_data in populate_analytics_db.py handle function.	2018-03-25 08:59:08 -07:00
neiljp (Neil Pilgrim)	9e1dbde82d	mypy: Final small migrations to python3.5 annotations in many files.	2018-03-12 11:23:30 -07:00
Archana BS	b5a860b234	analytics: Populate messages_in_stream:is_bot:day in dev.	2018-03-04 13:17:00 -08:00
Tim Abbott	6bbce451bc	analytics: Fix a minor mypy error.	2018-02-12 11:57:47 -08:00
Aman Jain	a0b58b1560	activity: Add a link to copy list of realm admins. Tweaked by tabbott to do the database queries properly. This should help user to copy realm admin emails in a go. Fixes: #7885	2018-02-12 08:55:45 -08:00
Rishi Gupta	1d581a9c6e	nagios: Add nagios check for analytics state. This should help us detect issues where the analytics cron jobs aren't running properly. The cron/nagios part of the implementation done by tabbott.	2018-02-09 16:36:05 -08:00
rht	9a8d2244ca	django-2.0: Shift to resolvers from urlresolvers. The old name is deprecated.	2018-01-30 10:53:54 -08:00
rht	8106a25e61	django-2.0: Add on_delete on ForeignKeys. In Django 2.0, one must specify the on_delete behavior for all ForeignKeys explicitly.	2018-01-30 10:53:54 -08:00
Greg Price	b830b446f1	logging: Reduce `create_logger` to new `log_to_file`. The name `create_logger` suggests something much bigger than what this function actually does -- the logger doesn't any more or less exist after the function is called than before. Its one real function is to send logs to a specific file. So, pull out that logic to an appropriately-named function just for it. We already use `logging.getLogger` in a number of places to simply get a logger by name, and the old `create_logger` callsites can do the same.	2017-12-12 17:17:08 -08:00
Greg Price	ebcf0b4876	logging: Stop having `create_logger` force loglevels to INFO. This is already the loglevel we set on the root logger, so this has no effect -- except in tests, where `test_settings.py` attempts to set some of these same loggers to higher loglevels. Because the `create_logger` call generally runs after we've configured settings, it clobbers that effect. The code in `test_settings.py` that tries to suppress logs only works because it also sets `propagate=False`, which has nothing to do with loglevels but does cause logs at this logger (and descendants) to be dropped completely unless we've configured handlers for this logger (or one of its relevant descendants.)	2017-12-12 17:17:07 -08:00
Greg Price	20b2c11830	activity: Show the time the data is from. I've wanted this when looking at a tab from the day before. Also provides the date and time in UTC, which is handy for interpreting some of the data. Pretty sure this is not the world's cleanest way to do this in the front-end code. It'll do for now.	2017-12-12 15:30:03 -08:00
Greg Price	bdea0960de	activity: Supply a missing field. In prod this has no effect, but in dev we render a warning here, which makes the table not look right. Fix it.	2017-12-12 15:30:03 -08:00
Greg Price	d36b1cd2d7	activity: Correct description of message-history figures. This explains why the first number was usually the smallest!	2017-12-12 15:30:03 -08:00
Rishi Gupta	fbd8dde1f8	invitations: Add LoggingCountStat to keep track of sent invitations.	2017-12-06 20:35:50 -08:00
Greg Price	34ee019b20	activity: Consistently define "user" as non-deactivated, non-bot.	2017-11-30 20:43:46 -08:00
Greg Price	0ebfc2fb5c	activity: Highlight recently-created realms.	2017-11-30 20:43:46 -08:00
Greg Price	8de34c93dd	activity: Show each realm's creation date. I'd rather have something rich and fancy like a sparkline of activity... but this is a lot quicker to implement.	2017-11-30 20:43:46 -08:00
Greg Price	2c9b698cb6	activity: Replace "at-risk users" with WAU. Substantively, this makes the table more readable by grouping users into expanding sets by level of activity: active in last day, active in last week, have an account at all. The class "active in last week", as opposed to "active in last week but not in last day", makes more natural comparisons both between realms and for one realm through time, and it's less sensitive to the details of our definitions. This also makes the terminology more standard. We already made that change in the display, in the previous commit; as we go through the logic here, we adjust the terminology in the code too.	2017-11-30 20:43:46 -08:00
rht	01885cdedc	analytics: Use Python 3 syntax for typing (final).	2017-11-22 12:16:59 -08:00
rht	6c286b5eb6	analytics: Use Python 3 syntax for typing (part 2).	2017-11-22 12:16:58 -08:00
Tim Abbott	a0cfe45150	analytics: Wrap some longer lines.	2017-11-17 13:19:48 -08:00
rht	d1689b5884	analytics: Use python 3 syntax for typing.	2017-11-17 13:16:49 -08:00
Tim Abbott	2b43a0302a	python: Sort imports in smaller apps.	2017-11-15 15:55:49 -08:00
rht	51c1a6dfc9	analytics: Text-wrap long lines exceeding 110. License: Apache-2.0 Signed-off-by: rht <rhtbot@protonmail.com>	2017-11-10 16:22:00 -08:00
derAnfaenger	19bc55aa45	Fix various typos. The typos and their corrections were found with the aid of https://github.com/lucasdemarchi/codespell.	2017-11-09 16:26:38 +01:00
rht	17a19993f1	analytics/tests: Remove unused imports (F401).	2017-11-07 16:37:11 -08:00
rht	fa09076ec9	analytics/management: Remove unused imports (F401).	2017-11-07 16:37:09 -08:00
rht	b557b02f2f	analytics/lib: Remove unused imports (F401).	2017-11-07 16:37:07 -08:00
rht	6cce0e346e	refactor: Remove six.moves.filter import.	2017-11-07 10:51:44 -08:00
rht	80a8d4f9f3	refactor: Remove six.moves.map import.	2017-11-07 10:46:42 -08:00
rht	549a26860f	refactor: Remove six.moves.range import.	2017-11-07 10:46:42 -08:00
rht	ec5120e807	refactor: Remove six.moves.zip import.	2017-11-07 10:46:42 -08:00
rht	5cfffb0e51	analytics: Remove inheritance from object.	2017-11-06 08:53:48 -08:00
Tim Abbott	732a5bae84	analytics: Fix checks for pointer update route.	2017-11-04 19:27:00 -07:00
rht	dcc831f767	refactor: Replace all __unicode__ method with __str__. Close #6627.	2017-11-02 11:01:47 -07:00
rht	e51d98cd96	refactor: Remove usage of ModelReprMixin.	2017-11-02 11:01:47 -07:00
rht	c4fcff7178	refactor: Replace super(.*self) with Python 3-specific super(). We change all the instances except for the `test_helpers.py` TimeTrackingCursor monkey-patching, which actually needs to specify the base class.	2017-10-30 14:30:25 -07:00
Tim Abbott	1cd017288d	views: Fix imports of REQ/has_request_variables from the wrong place. These were never in zerver/decorator.py, and so it makes sense to import them zerver/lib/request.py, mostly for ease of finding things.	2017-10-27 15:07:31 -07:00
rht	691598a88b	py3: Remove "from six.moves import range". This is no longer required, since in Python 3, this is what the range built-in does.	2017-10-17 23:28:14 -07:00
rht	2f3ae84e5a	py3: Remove all `__future__ import division`.	2017-10-17 23:09:12 -07:00
rht	b2ad8fd747	py3: Remove all `from __future__ import unicode_literals`. This was mostly used in migrations, so it's a pretty safe change.	2017-10-17 23:07:42 -07:00
rht	a603a4f9f5	Remove `from __future__ import absolute_import`. Except in: - docs/writing-bots-guide.md, because bots are supposed to be Python 2 compatible - puppet/zulip_ops/files/zulip-ec2-configure-interfaces, because this script is still on python2.7 - tools/lint - tools/linter_lib - tools/lister.py For the latter two, because they might be yanked away to a separate repo for general use with other FLOSS projects.	2017-10-17 22:59:42 -07:00
Rishi Gupta	e31758c257	analytics: Do not run update_analytics_counts if there are no realms. Having no realms was not possible before, but will be once system bots are no longer on a special system realm.	2017-10-05 11:22:06 -07:00
Rishi Gupta	c7bdabbda8	analytics: Disallow non-UTC fill times in process_count_stat. No change in behavior, but we aren't supporting non-UTC times in analytics as a whole any more, so might as well change this check as well.	2017-10-05 11:22:06 -07:00
Rishi Gupta	0596c4a810	analytics: Enforce various datetime arguments are in UTC. Sort of a hacky hammer, but * The original design of the analytics system mistakenly attempted to play nicely with non-UTC datetimes. * Timezone errors are really hard to find and debug, and don't jump out that easily when reading code. I don't know of any outstanding errors, but putting a few "assert this timezone is in UTC" around will hopefully reduce the chance that there are any current or future timezone errors. Note that none of these functions are called outside of the analytics code (and tests). This commit also doesn't change any current behavior, assuming a database where all datetimes have been being stored in UTC.	2017-10-05 11:22:06 -07:00
Rishi Gupta	0c2b4d22a7	analytics: Convert datetimes coming from the API into UTC. Previously, entering a non-UTC end time for a daily stat would give you incorrect results. This is because: * All daily stats are collected at and have end_times in the database in midnight UTC. * For daily stats, time_range returns a list of datetimes at midnight in the timezone of its end argument. These datetimes are the only ones we look for when looking for rows corresponding to the stat in the database. * Previously, we passed on the end argument from the API to time_range, without modification.	2017-10-05 11:22:06 -07:00
Rishi Gupta	0f31cddf49	analytics: Add management command to clear single stat.	2017-10-05 11:22:06 -07:00
Tim Abbott	575bd0e255	analytics: Update name for old Android app.	2017-10-03 11:59:41 -07:00
rht	dc5136ed96	analytics: Remove unused optparse import.	2017-09-30 09:22:08 -07:00
rht	4494112862	analytics: Remove absolute_import.	2017-09-27 20:20:07 -07:00
rht	74f8a527e4	analytics: Remove print_function.	2017-09-27 18:05:45 -07:00
Aditya Bansal	d9c9bfe7f6	logger: Add new create_logger abstraction to simplify logging. This deduplicates a ton of Python logger-creation code to use a single standard implementation, so we can avoid copy-paste problems.	2017-08-27 18:31:53 -07:00
Umair Khan	6d4ba50ceb	result.json: Upgrade test_views.	2017-08-17 09:03:35 -07:00
Greg Price	61666a9262	zulip_ops: Delete the long-disused `stats1.zulip.net` config and its dependencies. This consists of the `zulip_ops::stats` Puppet class, which has apparently not been used since 2014, and a number of files that I believe were only used for that. Also a couple of tiny loose ends in other files.	2017-08-15 17:30:31 -07:00
Tim Abbott	e3e2d9093c	analytics: Remove references to UserProfile.objects.get.	2017-08-15 12:48:33 -07:00
neiljp (Neil Pilgrim)	b782db48e1	mypy: Remove superfluous older 'type: ignore' annotations.	2017-08-08 11:27:51 -07:00
Greg Price	c127630dcf	Delete some obsolete usage-stats tools. These are no longer useful, with our spiffy new analytics framework, and we haven't in fact been using them for some time, while the `active-user-stats` cron job does cause regular mail from cron. Just delete them.	2017-07-31 17:06:15 -07:00
Vishnu Ks	a99c60ce07	analytics: Add translation tags to stats.html.	2017-07-16 16:16:43 -07:00
Vishnu Ks	b0e4cfd480	analytics: Replace get_user_profile_by email in client_activity.	2017-07-14 13:35:43 -07:00
Vishnu Ks	e5c5960faa	analytics: Remove unused get_user_profile_by_email import.	2017-07-14 13:35:43 -07:00
Vishnu Ks	0a67e00702	analytics: Use example_user in test_views.py.	2017-07-14 13:35:43 -07:00
Tim Abbott	0b60f11cf8	analytics: Display Zulip Electron app as the main desktop app. This should make the /stats data significantly clearer.	2017-07-07 18:31:47 -07:00
Tim Abbott	31caab4229	analytics: Rename 'new iOS app' to 'Mobile app'.	2017-07-07 18:31:13 -07:00
Umair Khan	c74f125b7c	analytics: Add on_delete in foreign keys. on_delete will be a required arg for ForeignKey in Django 2.0. Set it to models.CASCADE on models and in existing migrations if you want to maintain the current default behavior. See https://docs.djangoproject.com/en/1.11/ref/models/fields/#django.db.models.ForeignKey.on_delete	2017-06-13 15:13:49 -07:00
Aditya Bansal	42b0680ab2	pep8: Add compliance with rule E261 populate_analytics_db.py.	2017-05-31 17:07:15 -07:00
Aditya Bansal	30dfb54c65	pep8: Add compliance with rule E261 to analytics/views.py.	2017-05-31 17:07:15 -07:00
Aditya Bansal	061cb9ae44	pep8: Add compliance with rule E261 to analytics/tests/test_views.py.	2017-05-31 17:07:15 -07:00
umkay	d9b23b39d3	mypy: Fix strict-optional in analytics.	2017-05-26 15:39:39 -07:00
Christian Hudon	c80e6edb4e	mypy: Declare models with null=True Optional.	2017-05-23 14:36:40 -07:00
Rishi Gupta	ee16abaf3b	analytics: Fix sort function in views.sort_client_labels. sort_client_labels sorts first by total, and then to ensure deterministic outcomes, sorts (reverse) alphabetically by label. Fixes regression introduced in `0c0e539`.	2017-05-09 11:32:35 -07:00
Aditya Bansal	3f0b22ce31	pep8: Add compliance with rule E261 to test_counts.py.	2017-05-07 23:21:50 -07:00
Aditya Bansal	2ca1f60ac5	pep8: Add compliance with rule E261 to analytics/models.py.	2017-05-07 23:21:50 -07:00
Aditya Bansal	13d9b98c39	pep8: Add compliance with rule E261 to analyze_mit.py.	2017-05-07 23:21:50 -07:00
Aditya Bansal	9e11185fe2	pep8: Add compliance with rule E261 to active_user_stats.py.	2017-05-07 23:21:50 -07:00
Aditya Bansal	27b87943af	pep8: Add compliance with rule E261 to counts.py.	2017-05-07 23:21:50 -07:00
Rishi Gupta	92978d6fb2	analytics: Fix --utc argument in update_analytics_counts.py.	2017-04-28 16:15:07 -07:00
Rishi Gupta	73ae2abd4e	analytics: Add --verbose option to update_analytics_counts.py.	2017-04-28 16:15:07 -07:00
Rishi Gupta	61bf445da4	analytics: Restrict fill_to_time to hour boundaries in process_count_stat.	2017-04-28 16:15:07 -07:00
Rishi Gupta	dfbeab73b5	analytics: Change update_analytics_counts to only use hour boundaries. Fixes a recent regression where analytics were not being run on hour boundaries. Includes a migration that dumps all the analytics data.	2017-04-28 16:15:07 -07:00
Rishi Gupta	f595f4f7f2	analytics: Change Number of Users chart to use realm_active_humans::day. Previously we showed the total number of users with an active account. This changes it to show only the number of users that have logged in in the past two weeks.	2017-04-25 18:35:13 -07:00
Maxim Averin	73a1dd63d5	analytics: Refactor legacy 'zulip_internal' decorator. Rename 'zulip_internal' decorator to 'require_server_admin', add documentation for 'server_admin', explaining how to give permission for ./activity page. Fixes: #1463.	2017-04-22 11:42:02 -07:00
Rishi Gupta	5e49da9285	analytics: Only update daily stats on day boundaries. Previously we would update FillState for daily stats on hourly boundaries as well. This would create two extra queries on the FillState table every hour (for each CountStat), which adds roughly 50ms of extra processing for each CountStat each day, as well as two extra lines each hour in the analytics log. This can be a minor annoyance when backfilling stats.	2017-04-18 11:02:51 -07:00
Rishi Gupta	c5f1398052	analytics: Add section comments in counts.count_stats_. Also reorders the stats a bit.	2017-04-18 11:02:51 -07:00
Rishi Gupta	b335ad2794	models: Add MIN_INTERVAL_LENGTH to UserActivityInterval. Was previously a floating magic number appearing in both zerver/lib/actions.py and analytics/lib/counts.py.	2017-04-18 11:02:51 -07:00
hackerkid	5c8f011d66	Remove unused timezone import.	2017-04-16 12:28:56 -07:00
hackerkid	b2504084ab	Replace timezone.now with timezone_now.	2017-04-16 12:28:56 -07:00
hackerkid	55c3d12078	Replace timezone.utc with timezone_utc.	2017-04-16 12:28:56 -07:00
Rishi Gupta	49bd330304	analytics: Add class DependentCountStat and stat realm_active_humans::day.	2017-04-14 11:41:07 -07:00
Rishi Gupta	62de1cf898	test_counts: Modernize TestProcessCountStat tests.	2017-04-14 11:41:07 -07:00
Rishi Gupta	1e8d2b984d	counts.py: Rename DataCollector-level operations to be more generic. We're about to use these for DependentCountStats that will run SQL queries on the analytics tables instead of the zerver tables.	2017-04-14 11:41:07 -07:00
Rishi Gupta	47cf1d15ba	counts.py: Move performance logging call out of pull_functions. Makes it less likely someone will write a pull function in the future and forget.	2017-04-14 11:41:07 -07:00
Rishi Gupta	6dff22cbaf	counts.py: Change check for LoggingCountStat to use isinstance. I think this is more pythonic? We could also get rid of LoggingCountStats altogether, since it's now just a special case of CountStat (is_logging == data_collector.pull_function is None). But I think it's nice to keep the distinction since they behave so differently.	2017-04-14 11:41:07 -07:00
Rishi Gupta	b45185562a	counts.py: Fix out of date comments.	2017-04-14 11:41:07 -07:00
Rishi Gupta	ac2cc9e2da	counts.py: Reorganize file into logical sections. No changes to code or behavior.	2017-04-14 11:41:07 -07:00
Rishi Gupta	50868b98a9	counts.py: Change pull_function to take a property instead of a full stat. Removes the circular dependency of CountStat containing a DataCollector, and DataCollector containing a function that takes a CountStat as an argument.	2017-04-14 11:41:07 -07:00
Rishi Gupta	eadfc743c8	counts.py: Remove CustomPullCountStat.	2017-04-14 11:41:07 -07:00
Rishi Gupta	118b44d4f0	counts.py: Change DataCollector to take a pull_function argument. This will allow us to appropriately generalize CountStat to include LoggingCountStat and CustomPullCountStat. It'll also make life easier when we introduce DependentCountStat.	2017-04-14 11:41:07 -07:00
Rishi Gupta	f9e56ad25d	counts.py: Move DataCollector declarations into CountStat declarations. The previous zerver_* names were unwieldy and not very readable. This also puts more of the useful information in one place; in particular, makes it easier to skim a CountStat declaration and see if we're collecting it at a user/stream granularity or a realm granularity.	2017-04-14 11:41:07 -07:00
Rishi Gupta	c20e79ab1f	counts.py: Rename DataCollector.analytics_table to output_table.	2017-04-14 11:41:07 -07:00
Rishi Gupta	6369d23633	counts.py: Rename ZerverCountQuery to DataCollector. Not the final form of DataCollector, but the name change causes a big diff so separating it out.	2017-04-14 11:41:07 -07:00
Rishi Gupta	b3991e2557	counts.py: Move CountStat.group_by into ZerverCountQuery. Part of a larger refactoring to reduce cyclic dependencies between CountStat and DataCollector (coming soon).	2017-04-14 11:41:07 -07:00
Rishi Gupta	341e1b54fc	counts.py: Remove zerver_table from ZerverCountQuery. Was only needed for filter_args, which are now gone.	2017-04-14 11:41:07 -07:00
Rishi Gupta	661de6bf25	counts.py: Remove filter_args argument from CountStat definition. It turned out to not be that useful once we added subgroup. The previous design of the CountStat object also assumed more reuseability of the _query strings than what ended up happening. The filter_args also had some carrying costs: It's hard to be confident that filter_args other than the ones explicitly in our tests would have had expected behavior. * The filter_args/join_args system is the most complex part of the CountStat object, and makes understanding the *_query strings unnecessarily difficult for a new contributor.	2017-04-14 11:41:07 -07:00
Rishi Gupta	4dfadba244	counts.py: Hardcode is_active=true in count_user_by_realm_query. A step towards removing filter_args from the CountStat object.	2017-04-14 11:41:07 -07:00
Rishi Gupta	6bb97db136	analytics: Add active_users_audit:is_bot:day.	2017-04-14 11:41:07 -07:00
Rishi Gupta	cc75d83b74	counts.py: Reorder count_stats_ to put similar stats together.	2017-04-14 11:41:07 -07:00
Rishi Gupta	3d514c3e8d	analytics: Add a default for the value column in assertTableState. A default value of 1 is reasonable in this framework, especially for testing things like LoggingCountStats.	2017-04-14 11:41:07 -07:00
Rishi Gupta	2f74ccabf9	analytics: Add 15day_actives CountStat.	2017-04-14 11:41:07 -07:00
Rishi Gupta	9b661ca91f	analytics: Replace CountStat.is_gauge with interval. Groundwork for allowing stats like "Monthly Active Users". CountStat.interval is no longer as clean a value as before, so removed it from views.get_chart_data. It wasn't being used by the frontend anyway. Removing interval from logger calls in counts.py is not a big loss since we now include the frequency (which is typically also the interval) in CountStat.property.	2017-04-14 11:41:07 -07:00
Rishi Gupta	d6c5c672d3	analytics: Add minutes_active CountStat.	2017-04-14 11:41:07 -07:00
Rishi Gupta	6e425814bf	analytics: Add a few tests for fixtures.py. The code in fixtures.py is only called from populate_analytics_db, and is only used for generating pretty fixture data for manual testing. This commit adds tests for a few things that were easy to add tests for, and provides some minimal coverage of the file, but is not meant to be comprehensive.	2017-04-13 12:37:47 -07:00
Rishi Gupta	b555191ee7	analytics: Change subgroup and labels lists to dictionary in views.py. A dict is more semantically appropriate than two lists with an assertion that they be the same length.	2017-04-13 12:37:47 -07:00
Rishi Gupta	f3fc9721f4	analytics: Match client names in populate_analytics_db to populate_db. Originally, all the client names in populate_analytics_db started with underscores to make it easy to selectively delete and regenerate them when re-running populate_analytics_db. We eventually want to merge populate_analytics_db into populate_db though, in which case it makes more sense for them to share client names, and not worry about the case where we run (or re-run) populate_analytics_db independently of populate_db.	2017-04-12 11:45:15 -07:00
Rishi Gupta	30024d0a8f	models: Remove Realm.domain.	2017-03-25 19:55:48 -07:00
Rishi Gupta	9f60dd8387	analytics: Send zeros for data.user.bot in Messages Sent Over Time. It will simplify the logic needed to process the "Sent by Me" view in Messages Sent Over Time in stats.js. Also, we gzip the data sent from our server, so there is little additional network usage by doing this.	2017-03-25 14:18:23 -07:00
Tim Abbott	a474f4359d	tests: Set maxDiff to None unconditionally.	2017-03-21 07:34:16 -07:00
Tim Abbott	8041ebf579	mypy: Annotate maxDiff variable.	2017-03-21 07:31:37 -07:00
Tim Abbott	20a7609018	analytics: Rename message count types to use standard Zulip casing.	2017-03-21 00:09:54 -07:00
hollywoodno	dd067c761a	analytics: Separate private messages from group private messages. This makes it possible for our graphs to show the group private message counts as separate from 1:1 private messages. Fixes #4102.	2017-03-20 11:46:29 -07:00
Tim Abbott	0c0e5397c4	analytics: Fix nondeterministic ordering of labels.	2017-03-20 11:39:08 -07:00
Rishi Gupta	ceac6d9c59	analytics: Remove stray comment from test_counts.py. The "actual test that would be nice to do" was indeed done!	2017-03-17 21:58:51 -07:00
Umair Khan	4442703011	jinja2: No need for custom render_to_response. Django 1.10 has changed the implementation of this function to match our custom implementation; in addition to this, we prefer render(). Fixes #1914 via #4093.	2017-03-17 13:57:34 -07:00
Umair Khan	6511f929cc	analytics: Change render_to_response to render. Related to #4093	2017-03-17 13:52:59 -07:00
Rishi Gupta	7c6f0033ed	analytics: Add test for do_drop_all_analytics_tables.	2017-03-14 16:59:54 -07:00
Rishi Gupta	87981a2bf1	analytics: Fix direct import of models in migrations.	2017-03-14 16:59:54 -07:00
Rishi Gupta	ebebd04587	analytics: Fix ValueErrors affecting test coverage. Pathways that only catch internal code errors should use AssertionError so that they are not included when computing test coverage.	2017-03-14 16:59:54 -07:00
Rishi Gupta	b18bfe6771	analytics: Standardize format of zerver count queries. count_message_type_by_user_query is in a different format (no WHERE clause) from the rest since I'm having a hard time reasoning about how that would interact with the LEFT JOIN, especially given that there are %(join_args)s.	2017-03-14 16:59:54 -07:00
Rishi Gupta	e33ef1c788	analytics/models: Remove extended_id and key_model. They are unused / were part of a previous design.	2017-03-14 16:59:54 -07:00
Rishi Gupta	35f854a2fd	analytics: Add test for do_aggregate_to_summary_table.	2017-03-04 16:46:09 -08:00
Rishi Gupta	8feea6c598	analytics: Add LoggingCountStat for number of users.	2017-03-04 16:46:09 -08:00
Rishi Gupta	51b7677db7	Add RealmAuditLog table and record user activation/deactivation events. The RealmAuditLog will make it easier for server admins to replay history.	2017-03-04 16:45:44 -08:00
Raghav Jajodia	a3a03bd6a5	mypy: Added Dict, List and Set imports. Fixed mypy errors associated with the upgrade.	2017-03-04 14:33:44 -08:00
Rishi Gupta	1453a5bfda	Change string_id of test zephyr realm from mit to zephyr. Also changes Realm.is_zephyr_mirror_realm to use string_id=zephyr instead of domain=mit.edu. Part of a larger migration away from Realm.domain.	2017-03-04 12:18:01 -08:00
Rishi Gupta	8bea47d6b5	analytics: Do a stylistic cleanup of TestProcessCountStat.	2017-03-03 16:12:12 -08:00
Rishi Gupta	6c784d6321	analytics: Refactor COUNT_STATS declaration to not repeat itself.	2017-03-03 16:11:28 -08:00
Rishi Gupta	20255e48a4	analytics: Change messages_sent_to_stream to a daily stat. Analytics database tables are getting big, and so we're likely moving to a model where ~all stats are day stats, and we keep hourly stats only for the last N days. Also changed the name because: * messages_sent_* suggests the counts (summed over subgroup) should be the same as the other messages_sent stats, but they are different (these don't include PMs). * messages_sent_by_stream:is_bot:day is longer than 32 characters, the max allowable length for a BaseCount.property. Includes a database migration to remove the old stat from the analytics tables.	2017-03-03 16:11:28 -08:00
Rishi Gupta	4dc791f393	Clean up timestamps.py and add a test.	2017-03-01 23:03:56 -08:00
Rishi Gupta	562bc6429c	Replace datetime.now() with timezone.now() in Django ORM queries. When you pass a naive datetime to the Django ORM, it uses settings.TIME_ZONE for the time zone. In the development environment, both settings.TIME_ZONE and datetime.now() use 'America/New_York', so there is no change in behavior there. (fromtimestamp with no tz argument uses the same timezone as datetime.now) We are soon going to change settings.TIME_ZONE to UTC, so need to remove naive datetimes from queries to the ORM.	2017-03-01 22:54:28 -08:00
Rishi Gupta	01a4615f6e	Change datetime.now to timezone.now in active_user_stats_by_day. This actually fixes previously broken behavior, since 'date' here gets turned into the 'day' argument of seconds_active_during_day(day), where tzinfo is set to UTC.	2017-03-01 22:54:28 -08:00
Rishi Gupta	2b2be8120f	Change datetime.now(tz=X) to timezone.now(). datetime.now with a timezone set is equivalent to timezone.now() if it's never being printed out, but the latter is cleaner and more idiomatic.	2017-03-01 22:54:28 -08:00
Rishi Gupta	eee5cb5197	analytics: Add tests for views code.	2017-02-11 14:51:01 -08:00
Rishi Gupta	d6ce017a58	analytics: Minor cleanup of views.py.	2017-02-11 14:51:01 -08:00
Rishi Gupta	480fc0874b	analytics: Break ties deterministically in sort_client_labels.	2017-02-11 14:51:01 -08:00
Tim Abbott	f944ac8902	mypy: Fix incorrect annotation for by_used_time.	2017-02-10 23:53:44 -08:00
Rishi Gupta	19d1fc6223	stats: Pass user data to the frontend for messages sent over time.	2017-02-10 14:41:18 -08:00
Rishi Gupta	68a7f91022	stats: Add a fixed display order to summary charts. API: Adds a "display_order" to the response, which is a suggested order of importance for the clients or recipient types respectively. frontend: Changes messages_sent_by_{client,recipient_type} to use a fixed order for any given user.	2017-02-10 14:41:18 -08:00
Rishi Gupta	cf3ae2eafe	stats: Turn messages_sent_by_client into a bar chart. Also includes a number of changes to messages_sent_by_recipient_type that were convenient to do at the same time, since the two charts share a lot of code.	2017-02-10 14:41:18 -08:00
Rishi Gupta	ce89c64f43	stats.js: Move name_map computation to the backend.	2017-02-10 14:41:18 -08:00
Rishi Gupta	a1b1ffe1e4	analytics: Base default views end_time on FillState, not current time.	2017-02-10 14:41:07 -08:00
Rishi Gupta	6ab31d1bac	analytics: Move time computation to later in get_chart_data.	2017-02-10 14:40:14 -08:00
Tim Abbott	ec52322ae1	stats: Include Zulip and realm name in heading.	2017-02-07 11:22:57 -08:00
Tim Abbott	6c4eaf3d14	analytics: Map client names to user-facing versions. This makes the pie charts on /stats more readable.	2017-02-05 22:19:10 -08:00
Tim Abbott	161522e04c	analytics: Add comment explaining server admin routes.	2017-02-02 16:23:10 -08:00
Rishi Gupta	5eb5fa3f31	analytics: Change time_range to not include current day/hour. Current day/hour will always be 0, since we haven't computed it yet for the CountStat tables.	2017-02-02 10:59:52 -08:00
Tim Abbott	e8b0880320	analytics: Log updates to analytics counts.	2017-02-01 17:02:46 -08:00
umkay	76f3d02590	analytics: Add cron job to run analytics jobs. This adds a cron job to update the Zulip analytics counts, complete with locking etc. Substantially tweaked by tabbott.	2017-02-01 17:02:46 -08:00
Tim Abbott	b7df84d5a8	analytics: Add indexes to optimize performance of aggregation. These indexes fix some slow queries used in updating the analytics tables, resulting in the analytics system consuming far less total resources.	2017-02-01 15:47:49 -08:00
Amy Liu	0a39e354dc	analytics: Add graphs of usage statistics on /stats. This adds a frontend for the analytics system we've had for a few months, showing several graphs of the data in Zulip. There's a ton more that we can do with this tooling, but this initial version is enough to provide users with a pretty good experience. Fixes #2052.	2017-01-31 22:18:54 -08:00
Tim Abbott	4e171ce787	lint: Clean up E126 PEP-8 rule.	2017-01-23 22:06:13 -08:00
Tim Abbott	d6e38e2a5c	lint: Clean up E123 PEP-8 rule.	2017-01-23 21:34:26 -08:00
Tim Abbott	9cc83f87fc	lint: Clean up E241 PEP-8 rule.	2017-01-23 21:21:14 -08:00
Tim Abbott	9640a9e864	lint: Clean up E712 PEP-8 rule.	2017-01-23 21:11:18 -08:00
Rishi Gupta	29799d93c6	analytics/views.py: Always return time series data for stats. Makes a number of simplications to the analytics views code. The main one is that we now return the entire data series, even if the data is eventually going to go into a pie chart. This was prompted by us wanting several different pie charts for each stat (one for last 30 days, one for all time, etc), but I think it is also a more natural API. The total amount of data being sent for the pie charts now is maybe half of what is being sent for our single 'hourly' stat, or maybe up to 10,000 ints per year the organization has been around. The other big change is that the data being sent back is now always explicit about whether it is data about the realm (stored in data['realm'], or data about the user (stored in data['user']).	2017-01-19 17:44:17 -08:00
Rishi Gupta	734ca4644c	analytics: Add random_seed argument to generate_time_series_data.	2017-01-17 15:54:57 -08:00
Rishi Gupta	37bdc7c010	analytics: Remove COUNT_STATS['messages_sent:hour']. Having both messages_sent:hour and messages_sent:is_bot:day is confusing, since a single messages_sent:is_bot:hour would have a superset of the information and take less total space. This commit and its parent together replace the two stats with a single messages_sent:is_bot:hour.	2017-01-17 15:54:57 -08:00
Rishi Gupta	b593ac9d7c	analytics: Change messages_sent:is_bot to hourly frequency. In preparation for replacing messages_sent.	2017-01-17 15:54:57 -08:00
Rishi Gupta	68fcb4152f	analytics: Remove interval field from *Count tables. Includes a database migration. The interval field was originally there to facilitate time aggregation (e.g. aggregate_hour_to_day), but we now do such aggregations in views code or in the frontend.	2017-01-17 15:54:57 -08:00
Rishi Gupta	a8f2ebb443	analytics: Include interval in COUNT_STATS property names.	2017-01-17 15:54:57 -08:00
Rishi Gupta	c466036c80	analytics: Remove unneeded references to interval from test_counts.py.	2017-01-17 15:54:57 -08:00
Rishi Gupta	12d277d4f4	analytics: Change messages_sent:client stat to daily frequency. A few reasons: * Our two other subgroup'd message stats in UserCount are at CountStat.DAY frequency (messages_sent:is_bot and messages_sent:message_type). * Keeping this stat at hourly frequency would likely double the size of our analytics table, given the current stats. (Counterpoint: if there are roughly as many active streams as active users, and we keep messages_sent_to_stream:is_bot at hourly frequency, then maybe this stat is only a 30% or 50% increase). * We're currently only showing this on the frontend as a pie chart anyway.	2017-01-17 15:54:57 -08:00
Rishi Gupta	690002aef8	analytics: Add fixtures for several CountStats.	2017-01-17 15:54:57 -08:00
Rishi Gupta	2710a944e8	analytics: Refactor fixture creation to make it more general. Also less verbose, in preparation for adding a bunch more fixtures.	2017-01-17 15:54:57 -08:00
Rishi Gupta	1f4a4e5e26	analytics: Force --clear-existing-data option in populate_analytics_db. Makes more sense for a fixture generating script to just clear the existing data every time.	2017-01-17 15:54:57 -08:00
Rishi Gupta	680e7f75e1	analytics: Change generate_time_series_data argument from length to days. Previously, this function seemed ambivalent about whether it was generating a series of abstract data points or a series of data points that would correspond to times. Switch firmly to the latter, so e.g. if the frequency changes, so will the length of the output sequence.	2017-01-17 15:54:57 -08:00
Rishi Gupta	3712fda30d	analytics: Ensure fixture data points are non-negative.	2017-01-17 15:54:57 -08:00
Rishi Gupta	ecfc336a15	analytics: Add views for remaining /stats graphs.	2017-01-17 15:54:57 -08:00
Rishi Gupta	73c0c4c52e	analytics/views.py: Increase efficiency of get_time_series_by_subgroup. Not sure if this would actually be a performance problem in practice, but this was originally making a database query for each subgroup (instead of just a single query getting data for all the subgroups). Also removed the filter against the interval column, which will soon not be needed (interval will be uniquely determined by the property).	2017-01-17 15:54:57 -08:00
Rishi Gupta	d873902755	analytics/views.py: Refactor get_messages_sent_by_humans_and_bots. Refactor out the reusable parts, since we're about to add several more views.	2017-01-17 15:54:57 -08:00
Rishi Gupta	3a72b5cda9	analytics: Rename messages_sent_to_realm. Several additional stats in the pipeline that also relate to messages sent to the realm.	2017-01-17 15:54:57 -08:00
Rishi Gupta	cdb1c96169	analytics tests: Refactor assertCountEquals calls to be more readable.	2017-01-17 15:54:57 -08:00
Rishi Gupta	59d50c3a47	analytics tests: Make it easy to refer to users in test realm.	2017-01-17 15:54:57 -08:00
Rishi Gupta	54e66e6079	analytics: Add remaining backend tests in TestCountStats.	2017-01-17 15:54:57 -08:00
aakash-cr7	b373f2ef0f	analytics: Add backend test for messages_sent_to_stream:is_bot.	2017-01-17 15:54:57 -08:00
Amy Liu	10c0c2b16d	analytics: Add backend tests for messages_sent:message_type.	2017-01-17 15:54:57 -08:00
Rishi Gupta	f30b174199	analytics: Set property and interval defaults in assertCountEquals.	2017-01-17 15:54:57 -08:00
Rishi Gupta	a563a15f88	analytics: Make TestCountStats tests more robust. Adds two things to TestCountStats.setUp(): * A realm with no messages, that generally should not show up in Count tables, Users/streams/messages created at 0, 1, 61, and 1441 (just over a day) minutes ago (previously was 0, 60), to better test the start_time/end_time in the queries, and the frequency/interval setting in the CountStats.	2017-01-17 15:54:57 -08:00
Rishi Gupta	e94bc8f142	analytics tests: Autogenerate names for create* functions.	2017-01-17 15:54:57 -08:00
Amy Liu	f7ce76fb63	analytics: Add create_stream_with_recipient and create_huddle_with_recipient. This commit replaces AnalyticsTestCase.create_stream with create_stream_with_recipient and adds the method create_huddle_with_recipient.	2017-01-17 15:54:57 -08:00
Rishi Gupta	f375caed46	/activity: Fix URL route for analytics.views.get_realm_activity. analytics.views.get_realm_activity was taking a 'realm_str', but the URL route was expecting a 'realm'. Changed the URL route to take a 'realm_str'.	2017-01-12 15:21:06 -08:00
Rishi Gupta	3f2a002c6e	analytics/lib/counts.py: Fix one of the COUNT_STATS definitions. Fixes an error in the definition of COUNT_STATS['messages_sent_to_stream:is_bot']. The CountStat needs a group_by argument since it is supposed to group by UserProfile.is_bot.	2017-01-10 20:41:07 -08:00
Rishi Gupta	977f5b9178	analytics/lib/counts.py: Fix error in count_message_type_by_user_query. This query counts the number of messages each user has sent, subgroup'd by whether the message was a private_message (PM or sent to a huddle), sent to a 'private_stream', or sent to a 'public_stream'. We need to join on zerver_stream to find out whether stream messages were sent to public streams or private streams, but it needs to be a LEFT JOIN rather than a JOIN so that we preserve the messages sent to non-streams.	2017-01-10 20:41:07 -08:00
Rishi Gupta	6374596a77	analytics: Add initial fixture for testing views.	2017-01-10 17:48:07 -08:00
Tim Abbott	3f8d4193da	lint: Fix % comprehensions being used without a tuple.	2017-01-09 11:45:11 -08:00
Rishi Gupta	ac29928d91	Remove domain from analytics management commands.	2017-01-09 11:26:08 -08:00
Rishi Gupta	e14f575979	Remove domain from analytics/views.py.	2017-01-09 11:26:08 -08:00
Rishi Gupta	552d626ef2	analytics: Fix FillState.last_modified not being updated. We were updating FillState with FillState.objects.filter(..).update(..), which does not update the last_modified field (which has auto_now=True). The correct incantation is the save() method of the actual FillState object.	2017-01-08 23:36:34 -08:00
Rishi Gupta	190d320afa	analytics: Change CountStat.property from Text to str.	2017-01-08 17:24:51 -08:00
Rishi Gupta	a07757c127	analytics/views: Fix query in get_messages_sent_to_realm.	2017-01-08 17:24:51 -08:00
Rishi Gupta	f8962d521d	analytics: Fix uses of 'interval' in arguments and variable names. interval refers to a time interval, and frequency refers to something that semantically means something closer to 'hourly' or 'daily'. Currently, interval can have values 'hour', 'day', or 'gauge', and frequency can only have values 'hour' and 'day'.	2017-01-08 17:24:51 -08:00
Rishi Gupta	f5899dd14b	analytics: Add lib/ function to drop all analytics tables.	2017-01-08 17:24:51 -08:00
Rishi Gupta	73dc904e9c	analytics: Move time_range from views.py to lib/time_utils.py	2017-01-08 17:24:51 -08:00
Tommy Ip	28abfca565	analytics: Fix bare except clause.	2017-01-08 16:25:22 -08:00
Rishi Gupta	2b0a7fd0ba	Rename models.get_realm_by_string_id to get_realm. Finishes the refactoring started in `c1bbd8d`. The goal of the refactoring is to change the argument to get_realm from a Realm.domain to a Realm.string_id. The steps were * Add a new function, get_realm_by_string_id. * Change all calls to get_realm to use get_realm_by_string_id instead. * Remove get_realm. * (This commit) Rename get_realm_by_string_id to get_realm. Part of a larger migration to remove the Realm.domain field entirely.	2017-01-04 17:12:23 -08:00
Rishi Gupta	605361ec86	makemessages: Fix string with unnamed arguments in analytics/views.py.	2016-12-30 16:52:24 -08:00
Rishi Gupta	9e5325a164	Add /stats page with basic stats graph. Adds a new url route and a new json endpoint.	2016-12-29 14:20:13 -08:00
Rishi Gupta	31efe858ef	Clean up imports in analytics/views.py.	2016-12-29 14:20:13 -08:00
Rishi Gupta	717afcb408	Remove calls to get_realm in preparation for its deprecation. Also removes two calls to email_to_domain.	2016-12-26 17:53:32 -08:00
Rishi Gupta	c7c0e36508	analytics: Add InstallationCount checks to prototype TestCountStat. Was enabled by commit `41e8ee3` where we moved TIME_ZERO to before the realms created by populate_db.py. Also removes the stub for TestAggregates, since the remaining thing to be tested was the aggregation from RealmCount to InstallationCount, and the end to end checks provided by the TestCountStat tests should be sufficient.	2016-12-20 12:03:23 -08:00
Rishi Gupta	dbc94d0fc0	analytics: Remove test for no longer supported behavior. In a previous design, there was no FillState table, and one could run any CountStat at any time. This is no longer supported. This test was making sure that if one ran a CountStat at a certain hour, and then ran it at a previous hour, the old rows would still be there.	2016-12-20 12:03:23 -08:00
Rishi Gupta	e09aaf1020	analytics: Remove tests that will be subsumed by TestCountStats.	2016-12-20 12:03:23 -08:00
Rishi Gupta	6748b72ccc	analytics: Remove tests now covered by test_active_users_by_is_bot.	2016-12-20 12:03:23 -08:00
Rishi Gupta	2211b8b102	analytics: Change count_message_by_stream to join on UserProfile. It seems unlikely we will need count_message_by_stream without the UserProfile table in the future, so write count_message_by_stream_and_is_bot in the usual query form and replace count_message_by_stream with it. This also has the benefit of shortening our list of "special case" queries from two to one. The pathways of the removed test will be covered more thoroughly in the new TestCountStats tests.	2016-12-20 12:03:23 -08:00
Rishi Gupta	6992f9784c	analytics: Update TestCountStat prototype.	2016-12-20 12:03:23 -08:00
Rishi Gupta	c6a6c871ee	analytics: Change TIME_ZERO in tests to be in the past.	2016-12-20 12:03:23 -08:00
Rishi Gupta	d95fb33d8d	analytics: Add subgroups to unicode representations in models.py.	2016-12-20 12:03:23 -08:00
Rishi Gupta	f34af0896d	analytics: Add subgroup argument to assertCountEquals.	2016-12-20 12:03:23 -08:00
Rishi Gupta	31cf8db28c	analytics: Allow assertCountEquals to work on InstallationCount.	2016-12-20 12:03:23 -08:00
Rishi Gupta	93a10a475a	counts.py: Fix count_message_type_by_user_query.	2016-12-15 16:02:12 -08:00
Rishi Gupta	4f3e1b2ece	analytics/lib/counts.py: Fix messages_sent_to_stream:is_bot. Adds a new query.	2016-12-15 16:02:12 -08:00
Rishi Gupta	87b47ec283	analytics: Add __unicode__ method to the CountStat object.	2016-12-15 16:02:12 -08:00
reyha	82e32ad255	Access realm by `string_id` in management commands. `Realm.string_id` replaces 'Realm.domain' in the management commands. Fixes #2325.	2016-12-14 10:38:03 -08:00
anirudhjain75	beaa62cafa	mypy: Convert several directories to use typing.Text. Specifically, these directories are converted: [analytics/, scripts/, tools/, zerver/management/, zilencer/, zproject/]	2016-12-07 20:51:05 -08:00
nikolay	abc2ff4a06	pep8: Fix many rule E128 violations. [Tweaked by tabbott to adjust some approaches used in wrapping]	2016-12-03 13:33:31 -08:00
bulat22101	adebc75740	pep8: Fix E502 violations	2016-12-03 10:56:36 -08:00
Sidhant Bhavnani	8c0c12c1d9	pep8: Fix E303 violations.	2016-12-02 15:34:11 -08:00
AZtheAsian	1ba150fa85	pep8: Fix E203 violations	2016-12-01 20:37:57 -08:00
Rafid Aslam	c5316b4002	lint: Fix E127 pep8 violations. Fix pep8: E127 continuation line over-indented for visual indent style issue.	2016-12-01 10:23:55 -08:00
Rafid Aslam	41bd88d5ed	pep8: Fix E301 pep8 violations. Fix "E301: expected (1 or 2) blank line" pep8 violations.	2016-11-29 08:51:44 -08:00
Rishi Gupta	4b183cd526	domain migration: Remove several instances of get_realm. Remove the easy to remove instances of get_realm.	2016-11-26 15:19:56 -08:00
Anders Kaseorg	207cf6302b	Always start python via shebang lines. This is preparation for supporting using Python 3 in production. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2016-11-26 14:46:37 -08:00
Umair Khan	7d51efe9a1	Django 1.10: Fix dummy data for count stat. Django 1.10 checks the foreign key constraints as part of the testing suite so we need to create test data which passes validation tests.	2016-11-14 16:09:12 -08:00
umkay	dc8463e09c	analytics: Remove incorrect filter args for stat. The filter args dictionary applies to the X table in a count X by Y query, which in this case is the zerver_message table. This stat had an incorrect set of arguments meant for the zerver_userprofile table.	2016-11-10 12:25:21 -08:00
Umair Khan	d837753d4b	Django 1.10: Update analytics urls.	2016-11-10 16:20:03 +05:00
Umair Khan	682aa1f298	Django 1.10: Use add_argument for options in BaseCommand.	2016-11-04 10:20:23 -07:00
Umair Khan	b140236fcf	Django 1.10: Do not use patterns function.	2016-11-04 10:06:00 -07:00
umkay	e6ac8c3543	analytics: Add extra count stats. Fill in remaining countstats in counts.py for our intended use cases.	2016-11-03 16:50:39 -07:00
umkay	298890d125	analytics: Rename count stats and associated properties. Our current naming convention is getting unwieldy. The subgroup now goes on the right side of the colon.	2016-11-03 16:50:39 -07:00
umkay	5490442580	analytics: Replace all joins in raw SQL with natural joins. We alter the behavior of our queries to no longer write rows with 0 counts to the db, and pad with 0s in the related views code. As a result we are also able to combine the where and join clause conditions in the sql queries. This new behavior is also updated in our tests.	2016-11-03 16:50:39 -07:00
Rishi Gupta	db0e509422	do_create_realm: Replace domain argument with string_id. Turns string_id into a required argument, and domain into an optional argument.	2016-11-02 22:46:34 -07:00
Rishi Gupta	9ef8536cc6	models.Realm: Require Realm.string_id to be non-NULL. Adds a database migration, adds a new string_id argument to the management realm creation command, and adds a short name field to the web realm creation form when REALMS_HAVE_SUBDOMAINS is False.	2016-11-02 22:46:34 -07:00
umkay	5e5a0d4db9	analytics: Add user-level count query for messages sent to {PMs, streams}. Adds a count_X_by_Y_query to counts.py, similar in spirit to a count_recipient_by_user query, where we would join on the Message, Recipient, and UserProfile table. Here, we also join on the Stream table in order to distinguish private and public streams, and we merge the counts for PM and Huddle type messages into a single subgroup.	2016-11-01 17:00:43 -07:00
umkay	a94599fca7	analytics/models.py: Add subgroup column to unique_together constraints.	2016-11-01 16:53:56 -07:00
umkay	e92604ab78	analytics: Alter field length for property and interval in BaseCount.	2016-10-27 16:33:58 -07:00
umkay	610e92b94e	analytics: Add subgroup column to analytics tables. This is a major change to the analytics schema, and is the first step in a number of refactorings and performance improvements. For instance, it allows * Grouping sets of similar CountStats in the Count tables. For instance, active{_humans,_bots} will now have the same property, but have different subgroup values. Combining queries that differ only in their value on 1 filter clause, so that we make fewer passes through the zerver tables. For instance, instead of running a query for each of messages_sent_to_public_streams and messages_sent_to_private_streams, we can now run a single query with a group by on Stream.invite_only, and store the group by value in the subgroup column.	2016-10-27 16:33:58 -07:00
Rishi Gupta	54016e1096	analytics: Remove outdated comment in counts.py.	2016-10-25 13:42:55 -07:00
umkay	87d22c9e4d	analytics: Fix count_stream_by_realm. Add a join clause on zerver_message in count_stream_by_realm, otherwise we only output the final total streamcount for a realm for every time entry.	2016-10-22 19:10:36 -07:00
umkay	906a4e3b26	analytics: Add performance and transaction logging to counts.py. For each database query made by an analytics function, log time spent and the number of rows changed to var/logs/analytics.log. In the spirit of write ahead logging, for each (stat, end_time) update, log the start and end of the "transaction", as well as time spent.	2016-10-17 16:10:03 -07:00
Tim Abbott	4a4664d268	mypy: Remove a bunch of now-unnecessary type: ignore annotations. Since mypy and typeshed have advanced a lot over the last several months, we no longer need these `type: ignore` annotations.	2016-10-17 11:48:34 -07:00
Rishi Gupta	82b814a1cd	analytics: Simplify frequency and measurement interval options. Change the CountStat object to take an is_gauge variable instead of a smallest_interval variable. Previously, (smallest_interval, frequency) could be any of (hour, hour), (hour, day), (hour, gauge), (day, hour), (day, day), or (day, gauge). The current change is equivalent to excluding (hour, day) and (day, hour) from the list above. This change, along with other recent changes, allows us to simplify how we handle time intervals. This commit also removes the TimeInterval object.	2016-10-14 10:18:37 -07:00
Rishi Gupta	807520411b	analytics: Simplify logic in do_fill_count_stat_at_hour. Adding FillState, removing do_aggregate_hour_to_day, and disallowing unused (interval, frequency) pairs removes the need for the nested for loops in do_fill_count_stat_at_hour. This commit replaces that control flow with a simpler equivalent.	2016-10-14 10:18:37 -07:00
Rishi Gupta	27d1360e1d	analytics: Remove do_aggregate_hour_to_day. The functionality provided is more naturally done in the views code. It also allows us to aggregate using day boundaries from the local timezone, rather than UTC.	2016-10-14 10:18:37 -07:00
Rishi Gupta	655ee51e35	analytics: Add table to keep track of fill state. Adds two simplifying assumptions to how we process analytics stats: * Sets the atomic unit of work to: a stat processed at an hour boundary. * For any given stat, only allows these atomic units of work to be processed in chronological order. Adds a table FillState that, for each stat, keeps track of the last unit of work that was processed.	2016-10-14 10:18:37 -07:00
umkay	721529b782	analytics: Remove HuddleCount for now. Planned changes to the underlying analytics model will require potentially complicated changes to huddle queries.	2016-10-14 10:18:37 -07:00
umkay	7e2340155d	analytics: Fix aggregation to RealmCount for realms with no users. Previously, if a Realm had no users (or no streams), do_aggregate_to_summary_table would fail to add a row with value 0. This commit fixes the issue and also simplifies the do_aggregate_to_summary_table logic.	2016-10-11 18:20:58 -07:00
Rishi Gupta	52b56cca65	analytics: Reorder arguments to assertCountEquals. Require a table argument and change argument order around for clarity.	2016-10-11 18:20:58 -07:00
Rishi Gupta	929b69397b	analytics: Change string representation of BaseCount models. Previously we showed both the value and the id of the BaseCount record, which is confusing in a typical case where you only care about the value, and both the value and id are smallish ints.	2016-10-09 16:09:04 -07:00
Rishi Gupta	c6b611c8b9	analytics: Re-organize tests into higher level TestClasses. Refactor the current analytics tests into the following classes: * TestUpdateAnalyticsCounts, which will eventually test the management command, backfilling, what happens when new tests are added, etc. * TestProcessCountStat, which tests the ins and outs of propagating the value of a single stat up through the various Count tables. TestAggregates, which tests the do_aggregate_* methods. * TestXByYQueries, which tests the count_X_by_Y_query SQL snippets. * TestCountStats, which has tests for individual CountStats. This commit does not change the name or contents of any individual test.	2016-10-09 16:09:04 -07:00

... 6 7 8 9 10 ...

832 Commits