Граф коммитов

793 Коммитов

Автор SHA1 Сообщение Дата
Sunah Suh c8d0136694
Bug 1671517: Add event counts to clients daily (#1455) 2020-10-20 13:29:21 -05:00
Jeff Klukas a9c531e3c6
Exempt a few files from dry run due to new table-level ACLs (#1462)
* Exempt a few files from dry run due to new table-level ACLs

The dry run service can no longer perform queries with wildcard table
specifications or access raw AET data. See https://github.com/mozilla-services/cloudops-infra/pull/2599

* Verbose referenced_tables for AET logging clients daily
2020-10-20 10:34:02 -05:00
Jeff Klukas eae0d6d3d2
Revert field order in smoot nondesktop (#1454)
Fixes https://github.com/mozilla/bigquery-etl/issues/1453
2020-10-19 14:23:19 -07:00
Anthony Miyaguchi b66f948bdd
Add CREATE OR REPLACE VIEW clause to geckoview_versions and move it to org_mozilla_fenix (#1452)
* Add CREATE OR REPLACE VIEW clause to geckoview_versions

* Move geckoview version to org_mozilla_fenix
2020-10-19 12:36:45 -07:00
Jeff Klukas 4538e7c749
Formalize product names in nondesktop_clients_last_seen (#1380)
The `nondesktop_clients_last_seen_v1` view was developed mostly as an
internal implementation detail for downstream tables, but it has become
useful in its own right. This PR formalizes the view by providing an alias
without a version modifier and it adds a `product` field with application
names that are short but more meaningful than the `app_name` field.

See discussion in https://jira.mozilla.com/browse/DO-330 about confusion that
has resulted from the name "Fennec iOS" used in dashboards, etc. This is a
step toward reducing that kind of confusion.

This PR also adds `contributes_to_2019_kpi` and `contributes_to_2020_kpi` fields
as source of truth for how we count KPI metrics. That logic is currently
copied and pasted in several places, which could lead to errors.

This will need a fair amount of review from data users before moving forward.
It will also require backfilling several downstream tables and communicating
the change.
2020-10-19 15:16:21 -04:00
Anthony Miyaguchi 49c9bbf340
Add view for geckoview versions with one row per build hour (#1450) 2020-10-19 09:26:28 -07:00
Daniel Thorn 9d7a566e1c
Refactor stripe tables to reduce confusion (#1444) 2020-10-16 15:08:44 -07:00
Anthony Miyaguchi 2521f926e7
Add daily schedule for geckoview versions (#1447)
* Add daily schedule for geckoview versions

* Remove unnecessary parameters
2020-10-16 14:37:30 -07:00
Anthony Miyaguchi 349dff3ca2
Add table to determine Fenix nightly mapping of builds to geckoview versions (#1419)
* Add initial incremental query for geckoview build dates

* Add initial tests for incremental query (WIP)

* Add files for initial tests

* Rework query so it doesn't fail during tests

* Fix schema so queries run

* Add passing test for init

* Add test for query aggregation

* Add metadata file for scheduling the query

* Move scripts from fenix_nightly to fenix

* Remove scheduling

* Add document strings.

* Change dataset reference and indent comments correctly

* Remove init and address feedback

* remove init file
* make query idempotent by appending window to each submission_date
* rename n_builds to n_pings
* reduce window size from 30 days to 14 days
* avoid use of subqueries

* Update tests for query

* Fix tests

* Add failing test for 100

* Fix query so it work across fx100 boundary

* Add linting fixes
2020-10-16 11:57:23 -07:00
Jeff Klukas b9ed5f1242
Bug 1654078 Limit geo to country level in regrets-reporter view (#1440)
* Bug 1654078 Limit geo to country level in regrets-reporter view

See https://bugzilla.mozilla.org/show_bug.cgi?id=1654078#c45
2020-10-15 15:18:26 -04:00
Anthony Miyaguchi a7271f0189
Replace udf.fenix_build_to_datetime with mozfun reference (#1441) 2020-10-15 11:54:22 -07:00
Jeff Klukas 1ec0c7ba9f
Reduce complexity of mozfun.norm.product_info (#1439)
While testing https://github.com/mozilla/bigquery-etl/pull/1380
I was getting "query too complex" errors. It looks like that stems from the
many calls to `mozfun.norm.os` within the `product_info` function.
2020-10-15 12:52:13 -04:00
Jeff Klukas aad8ef72dd
Add mozfun.norm.product_info UDF (#1436)
* Add mozfun.norm.product_info UDF

Factoring this out from https://github.com/mozilla/bigquery-etl/pull/1380
since this UDF needs to be published before I can properly test the rest of the
changes there.

* Use assert.equals
2020-10-14 15:55:08 -04:00
Anthony Miyaguchi f416723675
Add udfs for creating and parsing custom build ids (#1418)
* Add udfs for creating and parsing custom build ids

* Move fenix_build_to_date into mozfun

* Reference mozfun.norm.fenix_build_to_datetime

* Reference norm instead of udf

* Update sql/mozfun/glam/parse_build_hour/metadata.yaml

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>

* Rename format_build_hour to fenix_build_to_build_hour

* Update sql/mozfun/glam/parse_build_hour/udf.sql

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>

* Rename parse_build_hour to build_hour_to_datetime

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>
2020-10-13 14:18:16 -07:00
Jeff Klukas d809e6fa2d
Fix field name error causing Shredder to miss FxA account deletions (#1428)
While looking at Shredder logs, I noticed that all entries for FxA-related
derived tables show 0 deletions, like:

> 712215424909 bytes and deleted 0 rows from moz-fx-data-shared-prod.firefox_accounts_derived.fxa_users_daily_v1

It appears that `account.deleted` events populate a field named `uid` rather
than `user_id`. I was able to verify this by choosing a recent `uid` value
from a deletion event and counting events where that same value appears as
`user_id` in FxA logs. There were matching messages.
2020-10-13 14:44:14 -04:00
Daniel Thorn 6abddb0f0d
Fix backticks in stripe derived views (#1425) 2020-10-12 12:43:37 -07:00
Anna Scholtz cec9412abd
Fix backticks in stripe views (#1423)
* Fix backticks in stripe views

* Fix backticks in mozilla vpn view

Co-authored-by: Daniel Thorn <dthorn@mozilla.com>
2020-10-12 11:39:27 -07:00
Anna Scholtz 70d98ca336 Add branches to experimenter experiments import data 2020-10-09 16:43:04 -07:00
Anna Scholtz 89067af73b Query for importing experimenter experiments 2020-10-09 16:43:04 -07:00
Frank Bertsch cfe5973f80 Allow for cross-project references in routines
- Scrape all projects for routine defns when generating tests
- Create UDFs as non-temp for stored procedure tests
- Make assert functions default non-temp (to support above)
2020-10-09 15:17:56 -04:00
Daniel Thorn 228fc8e041
Fully qualify stripe and mozilla_vpn views (#1414) 2020-10-09 11:06:28 -07:00
Daniel Thorn 6ca03e713e
Export mozilla vpn waitlist from cloud SQL (#1397) 2020-10-09 10:18:30 -07:00
Anna Scholtz 0d51459bd1 Move dependencies to udf_js_lib 2020-10-08 10:30:22 -07:00
Anna Scholtz 5a8311e2af Update UDF parsing 2020-10-08 10:30:22 -07:00
Jeff Klukas 3e0f2e4511
Bug 1669516 Use `app_display_version` for Fenix AMO stats (#1394)
* Bug 1669516 Use `app_display_version` for Fenix AMO stats

* Use geckoview_version for fenix nightly

* Remove deprecated installs_v1

* Remove dev installs_v1
2020-10-08 08:34:32 -04:00
Frank Bertsch 56defb1061 Fix failing test 2020-10-07 16:57:47 -04:00
Frank Bertsch f6165ef272 Remove unecesssary test
Co-authored-by: Jeff Klukas <jeff@klukas.net>
2020-10-07 16:57:47 -04:00
Frank Bertsch 949b64c4c5 Add exract_event_counts_with_properties 2020-10-07 16:57:47 -04:00
Frank Bertsch 3066cae069 Reformat extract_event_counts doc 2020-10-07 16:57:47 -04:00
Daniel Thorn 203db55d16
Remove incorrect destination partition from stripe tasks (#1398) 2020-10-06 15:33:55 -07:00
Jeff Klukas 1bab64d7ff
Bug 1635918 Enable deletion of data from AET tables on user request (#1396)
* Bug 1635918 Enable deletion of data from AET tables on user request

* Skip dryrun

* Add to SOURCES

* Add timestamp to deletions
2020-10-06 16:30:01 -04:00
Frank Bertsch 15418ab29b Remove dataset from friendly_name 2020-10-06 10:54:25 -04:00
Frank Bertsch 840b770004 Move UDF to sql/ dir 2020-10-06 10:54:25 -04:00
Frank Bertsch e917c3b1b4 Change event_analysis header 2020-10-06 10:54:25 -04:00
Anna Scholtz 518718d140
Move event_analysis bakck to mozfun (#1386) 2020-10-06 08:24:23 -04:00
Daniel Thorn 88ed89bd2c
Add stripe ETL to support Mozilla VPN dashboard (#1349) 2020-10-05 15:10:01 -07:00
Anna Scholtz 93bc51ba5e Move queries to right directories 2020-10-05 12:59:58 -07:00
Anna Scholtz bf33837ef7 Resolve rebase conflicts 2020-10-05 12:59:58 -07:00
Anna Scholtz d1c67dab53 Move projects into high-level sql/ folder 2020-10-05 12:59:58 -07:00
Anna Scholtz e010b0ec3c Move files to moz-fx-data-shared-prod 2020-10-05 12:59:58 -07:00
Arkadiusz Komarzewski bbd15f8ab5 Update ping selection criteria for fission monitoring tables
This adds filtering by experiment slug and filters out sessions where fission hasn't yet been enabled.
2020-10-05 18:59:19 +02:00
Arkadiusz Komarzewski 157209f2f5
Use submission_timestamp for partitioning in fission monitoring table (#1372) 2020-10-05 12:52:30 +02:00
Anna Scholtz c6888b57e5 Update destination table for bigquery_etl_scheduled_query_usage 2020-10-02 12:25:38 -07:00
Jeff Klukas 60bf1347fb
Added visited_* columns to aet_clients_daily (#1375)
Quick follow-up to #1371
2020-10-02 14:39:21 -04:00
Anna Scholtz 11a24e6b5a Refactor query usage monitoring query 2020-10-02 10:38:59 -07:00
Anna Scholtz 6e456168b8 Add query usages monitoring 2020-10-02 10:38:59 -07:00
Anna Scholtz ad896eed82 Remove anomdtct 2020-10-02 10:31:42 -07:00
Anna Scholtz 2d37c4f33f Reformat search_clients_last_seen 2020-10-02 10:15:53 -07:00
XuanL 06d78415ed Remove `total_searches`
The way `total_searches` is defined as sum over several other search-related metrics is not proper (but ok in determining whether user searched as used in this case). Misusage of total_searches value can be misleading, so we prefer to remove it from the view.
2020-10-02 10:15:53 -07:00
Jeff Klukas abb2ae5d90
Bug fixes for AET daily ETL (#1371)
This fixes a bug about missing uids in the daily table as reported in
https://bugzilla.mozilla.org/show_bug.cgi?id=1637926#c5

It also adds some boolean fields to allow the daily table to specifically
address questions about whether a given client was 5-uri active per day.
2020-10-02 12:50:32 -04:00