Граф коммитов

1393 Коммитов

Автор SHA1 Сообщение Дата
Frank Bertsch 4a9d44a802
Use non-ISO 8601 years for FxA Log retrieval (#1639)
ISO 8601 years are week-numbering years. Specifically, an ISO
year begins on the first Monday of Week 01; so if in the
Gregorian calendar the year starts on a Friday, the Friday,
Saturday, and Sunday will all fall in the previous year.

We run into this problem here. %g uses the ISO year, so for
2021-01-01 and 2021-01-02, that is year '20'. This has no match
in the underlying data (no 20-01-01 in FxA logs). Switching to
%y gives us year '21' for this data, and a match in the FxA logs.

https://en.wikipedia.org/wiki/ISO_8601#Week_dates
2021-01-04 16:03:54 -05:00
Anna Scholtz 93a7efb8bf Remove deviations 2021-01-04 11:54:29 -08:00
Anthony Miyaguchi 860d433ca6
Bug 1683765 - Import VPN surveys from alchemer/surveygizmo (#1634)
* Add initial code from telemetry_derived.surveygizmo_daily_attitudes

* Update module to successfully insert documents to bigquery

* Add a schema instead of inferring the schema

* Add recommend survey

* Add the rest of the surveys in the bug

* Add generated dag for survey imports

* Fix linting issues
2020-12-22 13:16:25 -08:00
Sunah Suh b726bf739e
Coalesce search metrics in desktop_search_aggregates_by_userstate_v1 before filtering (#1632) 2020-12-18 16:04:38 -06:00
XuanL 1f63db0f39
[Bug 1673979] Add filter to remove outliers in search count (#1631)
* Add filter to remove outliers

* Add additional filters to other search metrics

Co-authored-by: Sunah Suh <ssuh@mozilla.com>
2020-12-18 13:12:16 -06:00
Anna Scholtz 20a393094d Add total_byte_size and row_count to average ping size table 2020-12-16 13:38:37 -08:00
Anna Scholtz 636c0ad92f average_byte_size as FLOAT 2020-12-16 13:38:37 -08:00
Anna Scholtz 7bafdc9b64 Python script for determining average ping sizes 2020-12-16 13:38:37 -08:00
Anna Scholtz b6abb264ab Simplify script 2020-12-16 13:38:37 -08:00
Anna Scholtz 24cb0314e8 Ignore tables for average_ping_size 2020-12-16 13:38:37 -08:00
Anna Scholtz e45f665f8a Add script for monitoring average ping sizes 2020-12-16 13:38:37 -08:00
Anna Scholtz d860b684c1 Schedule bqetl_deviations 2020-12-16 13:06:58 -08:00
Jeff Klukas 9403e5d384
Deschedule fenix event_types query temporarily (#1624)
This query is failing as of 2020-11-18 due to unexpected input.
We are descheduling it until the logic is updated to handle this situation,
since new DAG runs are staying in the running state, waiting on past
runs that will never complete.
2020-12-15 15:46:01 -05:00
Yura Zenevich f78a8770f9
Updated accessibility clients query and start date. (#1623) 2020-12-15 12:41:30 -05:00
XuanL 7f14e799b4
Bug 1673979 - Add Search dashboard queries (#1619)
* Add search dashboard desktop search aggregates by userstate table and schedule daily

* Add desktop_search_aggregates_for_searchreport

* Add mobile_search_aggregates_for_searchreport

Co-authored-by: Sunah Suh <ssuh@mozilla.com>
2020-12-14 17:27:14 -06:00
Yura Zenevich 210f869441
Adding accessibility clients query for platform accessibility clients (#1612)
Co-authored-by: Jeff Klukas <jklukas@mozilla.com>
2020-12-11 15:05:50 -05:00
Jeff Klukas 10ec3741a2
Move field cleaning to table in main_1pct (#1598)
To be truly useful for quick investigations, the standard approach to views
doesn't work here.

The following query runs in 4 seconds:

```
SELECT
  DATE(submission_timestamp) AS dt,
  COUNT(*)
FROM
  `moz-fx-data-shared-prod.telemetry_derived.main_1pct_v1`
WHERE
  DATE(submission_timestamp) >= '2020-11-24'
  AND subsample_id = 0
GROUP BY
  1
ORDER BY
  1 DESC
```

But the equivalent on top of the existing view takes more than 30 seconds,
bottlenecked on query planning.
2020-12-04 12:31:51 -05:00
Jeff Klukas 5fbd6bdc63
Add main_1pct view (#1571)
* Add main_1pct view

Discussed with BQ engineering in [support case 25679061](https://console.cloud.google.com/support/cases/detail/25679061?project=moz-fx-data-shared-prod)

* Scheduling

* Shredder

* Add subsample_id

* Allow field addition

* Field addition

* yamllint

* Document expiration
2020-12-04 09:34:47 -05:00
Anna Scholtz 883f6f8979 Rewrite extracted_histograms query in main_summary 2020-12-03 12:45:21 -08:00
Anna Scholtz 564c71f58c Reduce complexity in main_summary 2020-12-03 12:45:21 -08:00
Anna Scholtz d8fb63c1d0 Support compressed histograms in clients_daily_histogram_aggregates 2020-12-03 12:45:21 -08:00
Anna Scholtz 3fd22a86a9 Support compressed histograms in italy covid query 2020-12-03 12:45:21 -08:00
Anna Scholtz 8b11a874c0 Support compressed histograms in global_outages query 2020-12-03 12:45:21 -08:00
Anna Scholtz 643db10355 main_summary compact histogram support 2020-12-03 12:45:21 -08:00
Ben Wu b50a95944d
Separate queries on clients_scalar_aggregates by app_version (#1594) 2020-12-03 14:26:35 -05:00
Felix Lawrence 4746042a84
Categorize VPN user attribution (#1579) 2020-12-02 12:42:43 -08:00
Anthony Miyaguchi 4234c40040
Add minimal set of tests for GLAM Fenix queries (#1488)
* Add script to determine query dependencies

* Add schemas and folders for minimal test

* Add schema for geckoview_versions

* Add query params to each query

* Update schema for new queries

* Remove main from bootstrap file

* Add dataset prefix to schemas

* Add failing test for clients_histogram_aggregates

It turns out that the dependency resolution I'm using for autogenerate
the schemas is ignoring the views. I actually want to keep the views
around. The tables also all need to be prefixed with the dataset name or
they won't be inserted into the sql query correctly.

* Add successful test for clients histogram aggregates

* Add minimal tests for clients_scalar_aggregates

* Remove skeleton files for views (no test support for views)

* Add tests for latest versions

* Add tests for scalar bucket counts that passes

* Add scalar bucket counts

* Add test for scalar percentiles

* Add test for histogram bucket counts

* Add passing test for probe counts

* Add test for histogram percentiles

* Add tests for extract counts

* Update readme

* Add data for scalar percentiles test

* Fix linting errors

* Fix mypy issues with tests module

* Name it data instead of tests.*.data

* Ignore mypy on tests directory

* Remove mypy section

* Remove extra line in pytest

* Try pytest invocation of mypy-scripts-are-modules

* Run mypy outside of pytest

* Use exec on pytest instead of mypy

* Update tests/sql/glam-fenix-dev/glam_etl/bootstrap.py

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>

* Update tests/sql/glam-fenix-dev/glam_etl/README.md

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>

* Document bootstrap in documentation

* Use artificial range for histogram_percentiles

* Simplify parameters for scalar probe counts

* Simplify tests for histogram probe counts

* Add test for incremental histogram aggregates

* Update scalar percentile counts to count distinct client ids

* Update readme for creating a new test

* Use unorded list for sublist

* Use --ignore-glob for pytest to avoid data files

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>
2020-12-01 17:11:45 -08:00
Ben Wu e31d7e031a
Remove filter for search metrics in glam (#1573) 2020-12-01 13:30:44 -05:00
Jeff Klukas 73a45d845d
Reduce complexity of addons_daily (#1581)
Resolves #1580
2020-11-30 15:50:24 -05:00
Jeff Klukas e397ae9d58
Bug 1677609 Add first_seen and core active to clients_last_seen view (#1578)
* Bug 1677609 Add first_seen and core active to clients_last_seen view

Follow-up to https://github.com/mozilla/bigquery-etl/pull/1561 now that the
backfill is complete.

See https://bugzilla.mozilla.org/show_bug.cgi?id=1677609

* Use hex literals

* Consistently use uppercase for hex literals

* Remove timeframe comments from backfilled fields

* Use days_since_first_seen
2020-11-30 14:35:10 -05:00
Jeff Klukas 603fec3850
Bug 1677609 Join clients_first_seen into clients_last_seen (#1561)
* Bug 1677609 Join clients_first_seen into clients_last_seen

Several folks on DS report that they have been getting great value from
clients_first_seen, as the first_seen_date there is a much more stable way
to define new profiles compared to using profile_created_date from pings.

Currently, using first_seen_date requires doing a join between these two tables.
This PR adds that join to the clients_last_seen query itself to make this
workflow more efficient. I'd like to get this merged before we proceed with
the backfill discussed in https://bugzilla.mozilla.org/show_bug.cgi?id=1677609

This change has a few operational implications. Most importantly, it makes
clients_last_seen dependent on clients_first_seen, so those queries can no
longer proceed in parallel. `clients_first_seen` takes on average 10 minutes
to run, so we'll be delaying all ETL downstream of `clients_last_seen` by
about 10 minutes, which seems acceptable. It also adds some mental complexity
to the model.

The extra join does not appear to significantly slow down the
`clients_last_seen` query itself; it scans about 15% more data and consumes
about 15% more slot time.
I expect the performance is dominated by the existing join between
clients_daily and the previous day of clients_last_seen.
2020-11-30 09:28:53 -05:00
Ben Wu df0508841f
Move apple_app_store to marketing project dir (#1570) 2020-11-20 13:17:33 -05:00
Nan Jiang 7b4dc9b906
Bug 1678344 - Add attribution source and ua to onboarding_events_amplitude (#1564)
Co-authored-by: Frank Bertsch <fbertsch@mozilla.com>
2020-11-20 11:50:26 -05:00
Daniel Thorn 243f44f1d5
Fix mozilla vpn waitlist update logic (#1566) 2020-11-19 10:59:35 -08:00
Jeff Klukas e517bd8e07
Bug fix for days_interacted_bits in clients_last_seen (#1563)
As uncovered in https://bugzilla.mozilla.org/show_bug.cgi?id=1677609#c9
this field only ever had a single bit set because it was using the wrong UDF
for combining days.
2020-11-19 09:39:56 -05:00
Anthony Miyaguchi b42d1c6934
Remove ordering clause in schema error counts (#1562) 2020-11-17 12:25:55 -08:00
Jeff Klukas 24207636dd
Bug 1677609 Add core active fields to clients_last_seen (#1560)
* Bug 1677609 Add core active fields to clients_last_seen

See https://bugzilla.mozilla.org/show_bug.cgi?id=1677609

This adds just the new underlying bit pattern fields that will need to be
backfilled, and these will be hidden from users initially.
After the backfill is complete, we will update the view to include these
fields along with the various fields derived from them.

We include days_visited_10_uri_bits which was not explicitly requested in
the context of this bug, but was proposed as part of the prototype feature_usage
table (https://github.com/mozilla/bigquery-etl/pull/1193); it may be useful
for future comparisons.

* Update tests to match new logic
2020-11-17 14:15:37 -05:00
Anthony Miyaguchi 3fcaf5becd
Add query for structured missing columns (#1556)
* Add query for structured missing columns

* Update metadata and ignore permission errors while locally testing

* Add scheduling for missing columns job

* Remove exception for activity stream and update comment

* Remove extra comments
2020-11-16 13:31:23 -08:00
Anthony Miyaguchi 11ec629760
Materialize dataset for schema error counts (#1555)
* Materialize dataset for schema error counts

* Update sql/moz-fx-data-shared-prod/monitoring/schema_error_counts_v2/metadata.yaml

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2020-11-16 11:56:04 -08:00
Sunah Suh 813a485d2d
Bug 1673404: Add searchmode scalars to clients_daily and search_clients_daily (#1549)
Add searchmode scalars to clients_daily and search_clients_daily
2020-11-13 15:59:20 -06:00
Anthony Miyaguchi a4b92730c0
Add init for telemetry_missing_columns_v3 (#1554) 2020-11-13 11:53:55 -08:00
Anthony Miyaguchi 22bf5caaf6
Bug 1676229 - Add materialized table for missing columns in telemetry dataset (#1551)
* Add materialized table for missing columns in telemetry dataset

* Ignore dryrun failures when fetching references

* Add generated dag

* Add manual reference to main ping copy deduplicate

* Update bigquery_etl/dryrun.py

Co-authored-by: Frank Bertsch <fbertsch@mozilla.com>

* Remove email from all monitoring queries

* Change order of logic

* Remove copy_deduplicate reference due to bug

Co-authored-by: Frank Bertsch <fbertsch@mozilla.com>
2020-11-13 11:15:29 -08:00
Anna Scholtz bd445ada1f Use referenced_tables for monitoring query dependencies 2020-11-13 10:39:34 -08:00
Anna Scholtz a6aa8d8db5 Schedule monitoring tasks in bqetl_monitoring 2020-11-13 10:39:34 -08:00
Daniel Thorn 0789c18553
Expose vpn attribution (#1547) 2020-11-12 13:25:41 -08:00
Yura Zenevich bbd659a264
Added accessibility_tabbing_order_activated telemetry probe to the a11y panel usage query. (#1538)
Co-authored-by: Jeff Klukas <jklukas@mozilla.com>
2020-11-11 10:47:33 -05:00
Ben Wu b15c6c7686
Rename GA standardized country names table (#1540) 2020-11-10 18:12:47 -05:00
Anna Scholtz 3b2a672f78 Support python script query scheduling 2020-11-10 14:36:07 -08:00
Anthony Miyaguchi 21fabe9496
Remove udf_js.glean_percentile (#1532) 2020-11-06 10:55:55 -08:00
Anthony Miyaguchi 30c30880d5
Fix #1527 - Add missing semicolon between udf and test (#1531) 2020-11-06 10:04:42 -08:00
Anthony Miyaguchi b77b542743
Replace GLAM temp functions with persistent functions (#1523)
* Replace GLAM temp functions with persistent functions

* Add generated sql

* Fix typo in udf name

* Add missing files and fully qualify udfs

* Add missing namespace

* Namespace even more things

* format sql
2020-11-05 13:42:09 -08:00
Jeff Klukas 2f738eb4f0
Add bits_from_offsets UDF (#1513)
* Add bits_from_offsets UDF

This is relevant to the emerging "clients_all_time" work drafted in
https://github.com/mozilla/bigquery-etl/pull/1480

I'm proposing we add this under `udf` rather than in `mozfun` because I'm not
yet certain about the naming. If we want to have additional functionality to
support all-time bit patterns, I would like to have those organized under a
single `mozfun` namespace, and it's not clear yet what the interface should
look like.

This function on its own should be enough to empower a new DS workflow for
experimenting with new usage definitions before committing them to clients_daily
and clients_last_seen (to be documented).
2020-11-04 11:31:25 -05:00
Daniel Thorn 3e7fd0c1ff
Use union instead of join to make query more readable (#1507) 2020-11-02 13:02:24 -08:00
Sunah Suh bf6069a4c2
Bug 1673404: Add new search scalars to main summary en route to clients daily, etc (#1504)
* Add new search scalars to main summary en route to clients daily, etc

* fixup! Merge branch 'master' into search-probes-main-summary
2020-11-02 12:41:02 -06:00
Daniel Thorn 498d84cb57
Fix join logic for mozilla_vpn_external.users_v1 (#1503) 2020-10-30 11:58:11 -07:00
Rhys 1ace0fe2b7
Ran YAMLlint on all yaml files and resolved linting issues (fixes #1297) (#1481)
* "Ran YAMLlint on all yaml files"

* "Moved product info metadata table to README file"

* "Reformatted yaml lists"

* "Updated line breaks so script runs"

* "Updated line breaks so script runs"

* "Undid line breaks"

* "Created custom config file"

* "Removed base document id"

* "Undid line breaks"

* "Reformatted code"

* "Trimmed whitespace"

* "Undid line break"

* "Introduced newline"

* "Trimmed whitespace"

* "Added yamillint to config file"

* "Added yamllint to config file"

* "Moved up yamllint test"

* "Trimmed whitespace"

* "Trimmed whitespace"

* "Trimmed whitespace"

* "Trimmed whitespace"

* "Removing hyphen to fix CI error"

* "Indentation to remove CI error"

* "Included yamllint install in build run"

* "Added yamllint in requirements.txt and .in file"

* "Moved install yamllint step to its own stage"

* "Updated yamllint test"

* "Updated circleci step"

* "Reformatted code"

* "Added yamllint to circleci steps"

* "Added checkout block to yamllint step"

* "Trimmed whitespace"

* "Undid yamllint step"

* "Specified directory name for yamllint test"

* "Fixed yamlint errors"

* "Fixed yamllint errors"

* "Fixed yamllint errors"

* "Fixed yamllint errors"

* "Ignore pathway in linting"

* "Added ignore venv pathway during linting"

* "Updated ignore block"

* "Updated ignore block"

* "Removed ignore block"

* "Updated ignore block"

* "Indented base as a list"

* "Indented base item"

* Update tests/sql/moz-fx-data-shared-prod/search_derived/mobile_search_clients_last_seen_v1/test_day_bit_shifting/expect.yaml

Co-authored-by: Anthony Miyaguchi <acmiyaguchi@gmail.com>

* "Resolved linting errors"

* "Referenced tables put back on same line"

* "Fixed linting error"

* Update sql/moz-fx-data-shared-prod/account_ecosystem_derived/fxa_logging_users_daily_v1/metadata.yaml

Co-authored-by: Anthony Miyaguchi <acmiyaguchi@gmail.com>

* "Fixed linting error"

Co-authored-by: Anthony Miyaguchi <acmiyaguchi@gmail.com>
2020-10-29 17:24:55 -07:00
Daniel Thorn 415ee2fb62
Add ETL for Mozilla VPN acquisition funnel (#1465) 2020-10-29 15:01:17 -07:00
Ben Wu 1529510c6e
Add derived tables for blog.m.o google analytics (#1492) 2020-10-28 17:40:45 -04:00
William Lachance 1ca71429ea
Bug 1673903 - Add urlbar-searchmode to allowed SAP sources (#1494) 2020-10-28 14:40:22 -04:00
jailang 88b0e2066e Listify review_bugs in tests and metadata files 2020-10-28 09:01:43 -07:00
Daniel Thorn d6b2551854
Use resource id, not event id, when updating stripe resource tables (#1491) 2020-10-27 13:03:25 -07:00
Anna Scholtz 71ac2bc686 Exception handling for column sizes 2020-10-27 12:11:11 -07:00
Anna Scholtz 752cd03531 Script for determining column storage sizes 2020-10-27 12:11:11 -07:00
Anthony Miyaguchi b7695049c6
Fix #1457 - Generate and run Fenix ETL for GLAM in glam-fenix-dev (#1458)
* Resolve generated sql to glam-fenix-dev and change output in sql/ dir

* Add new script for testing glam-fenix queries

* Add generated sql for version control

* Use variables correctly in bash

* Remove latest versions from UDF

* Update test to generate minimum set of tables for nightly

* Commit generated queries for testing

* Cast only if not glob

* Ignore dryrun and publish view for glam-fenix-dev

* Fix linting error

* Update comments

* Use DST_PROJECT consistently in scripts

* Update comments

* Update script/glam/test/test_glean_org_mozilla_fenix_glam_nightly

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>

* Update script/glam/generate_and_run_desktop_sql

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>

Co-authored-by: Ben Wu <benjaminwu124@gmail.com>
2020-10-22 11:40:52 -07:00
Daniel Thorn 1b7ad0cfef
Setup more imports from Mozilla VPN CloudSQL (#1469) 2020-10-22 07:12:41 -07:00
Jeff Klukas 78ea72c5a5
Allow field addition for one day of clients_daily and friends (#1468)
* Allow field addition for one day of clients_daily and friends

Allows Airflow to apply https://github.com/mozilla/bigquery-etl/pull/1455

* Commit DAG changes
2020-10-21 13:30:54 -04:00
Sunah Suh c8d0136694
Bug 1671517: Add event counts to clients daily (#1455) 2020-10-20 13:29:21 -05:00
Jeff Klukas a9c531e3c6
Exempt a few files from dry run due to new table-level ACLs (#1462)
* Exempt a few files from dry run due to new table-level ACLs

The dry run service can no longer perform queries with wildcard table
specifications or access raw AET data. See https://github.com/mozilla-services/cloudops-infra/pull/2599

* Verbose referenced_tables for AET logging clients daily
2020-10-20 10:34:02 -05:00
Jeff Klukas eae0d6d3d2
Revert field order in smoot nondesktop (#1454)
Fixes https://github.com/mozilla/bigquery-etl/issues/1453
2020-10-19 14:23:19 -07:00
Anthony Miyaguchi b66f948bdd
Add CREATE OR REPLACE VIEW clause to geckoview_versions and move it to org_mozilla_fenix (#1452)
* Add CREATE OR REPLACE VIEW clause to geckoview_versions

* Move geckoview version to org_mozilla_fenix
2020-10-19 12:36:45 -07:00
Jeff Klukas 4538e7c749
Formalize product names in nondesktop_clients_last_seen (#1380)
The `nondesktop_clients_last_seen_v1` view was developed mostly as an
internal implementation detail for downstream tables, but it has become
useful in its own right. This PR formalizes the view by providing an alias
without a version modifier and it adds a `product` field with application
names that are short but more meaningful than the `app_name` field.

See discussion in https://jira.mozilla.com/browse/DO-330 about confusion that
has resulted from the name "Fennec iOS" used in dashboards, etc. This is a
step toward reducing that kind of confusion.

This PR also adds `contributes_to_2019_kpi` and `contributes_to_2020_kpi` fields
as source of truth for how we count KPI metrics. That logic is currently
copied and pasted in several places, which could lead to errors.

This will need a fair amount of review from data users before moving forward.
It will also require backfilling several downstream tables and communicating
the change.
2020-10-19 15:16:21 -04:00
Anthony Miyaguchi 49c9bbf340
Add view for geckoview versions with one row per build hour (#1450) 2020-10-19 09:26:28 -07:00
Daniel Thorn 9d7a566e1c
Refactor stripe tables to reduce confusion (#1444) 2020-10-16 15:08:44 -07:00
Anthony Miyaguchi 2521f926e7
Add daily schedule for geckoview versions (#1447)
* Add daily schedule for geckoview versions

* Remove unnecessary parameters
2020-10-16 14:37:30 -07:00
Anthony Miyaguchi 349dff3ca2
Add table to determine Fenix nightly mapping of builds to geckoview versions (#1419)
* Add initial incremental query for geckoview build dates

* Add initial tests for incremental query (WIP)

* Add files for initial tests

* Rework query so it doesn't fail during tests

* Fix schema so queries run

* Add passing test for init

* Add test for query aggregation

* Add metadata file for scheduling the query

* Move scripts from fenix_nightly to fenix

* Remove scheduling

* Add document strings.

* Change dataset reference and indent comments correctly

* Remove init and address feedback

* remove init file
* make query idempotent by appending window to each submission_date
* rename n_builds to n_pings
* reduce window size from 30 days to 14 days
* avoid use of subqueries

* Update tests for query

* Fix tests

* Add failing test for 100

* Fix query so it work across fx100 boundary

* Add linting fixes
2020-10-16 11:57:23 -07:00
Jeff Klukas b9ed5f1242
Bug 1654078 Limit geo to country level in regrets-reporter view (#1440)
* Bug 1654078 Limit geo to country level in regrets-reporter view

See https://bugzilla.mozilla.org/show_bug.cgi?id=1654078#c45
2020-10-15 15:18:26 -04:00
Anthony Miyaguchi a7271f0189
Replace udf.fenix_build_to_datetime with mozfun reference (#1441) 2020-10-15 11:54:22 -07:00
Jeff Klukas d809e6fa2d
Fix field name error causing Shredder to miss FxA account deletions (#1428)
While looking at Shredder logs, I noticed that all entries for FxA-related
derived tables show 0 deletions, like:

> 712215424909 bytes and deleted 0 rows from moz-fx-data-shared-prod.firefox_accounts_derived.fxa_users_daily_v1

It appears that `account.deleted` events populate a field named `uid` rather
than `user_id`. I was able to verify this by choosing a recent `uid` value
from a deletion event and counting events where that same value appears as
`user_id` in FxA logs. There were matching messages.
2020-10-13 14:44:14 -04:00
Daniel Thorn 6abddb0f0d
Fix backticks in stripe derived views (#1425) 2020-10-12 12:43:37 -07:00
Anna Scholtz cec9412abd
Fix backticks in stripe views (#1423)
* Fix backticks in stripe views

* Fix backticks in mozilla vpn view

Co-authored-by: Daniel Thorn <dthorn@mozilla.com>
2020-10-12 11:39:27 -07:00
Frank Bertsch cfe5973f80 Allow for cross-project references in routines
- Scrape all projects for routine defns when generating tests
- Create UDFs as non-temp for stored procedure tests
- Make assert functions default non-temp (to support above)
2020-10-09 15:17:56 -04:00
Daniel Thorn 228fc8e041
Fully qualify stripe and mozilla_vpn views (#1414) 2020-10-09 11:06:28 -07:00
Daniel Thorn 6ca03e713e
Export mozilla vpn waitlist from cloud SQL (#1397) 2020-10-09 10:18:30 -07:00
Anna Scholtz 0d51459bd1 Move dependencies to udf_js_lib 2020-10-08 10:30:22 -07:00
Anna Scholtz 5a8311e2af Update UDF parsing 2020-10-08 10:30:22 -07:00
Jeff Klukas 3e0f2e4511
Bug 1669516 Use `app_display_version` for Fenix AMO stats (#1394)
* Bug 1669516 Use `app_display_version` for Fenix AMO stats

* Use geckoview_version for fenix nightly

* Remove deprecated installs_v1

* Remove dev installs_v1
2020-10-08 08:34:32 -04:00
Daniel Thorn 203db55d16
Remove incorrect destination partition from stripe tasks (#1398) 2020-10-06 15:33:55 -07:00
Jeff Klukas 1bab64d7ff
Bug 1635918 Enable deletion of data from AET tables on user request (#1396)
* Bug 1635918 Enable deletion of data from AET tables on user request

* Skip dryrun

* Add to SOURCES

* Add timestamp to deletions
2020-10-06 16:30:01 -04:00
Anna Scholtz 518718d140
Move event_analysis bakck to mozfun (#1386) 2020-10-06 08:24:23 -04:00
Daniel Thorn 88ed89bd2c
Add stripe ETL to support Mozilla VPN dashboard (#1349) 2020-10-05 15:10:01 -07:00
Anna Scholtz 93bc51ba5e Move queries to right directories 2020-10-05 12:59:58 -07:00
Anna Scholtz bf33837ef7 Resolve rebase conflicts 2020-10-05 12:59:58 -07:00
Anna Scholtz d1c67dab53 Move projects into high-level sql/ folder 2020-10-05 12:59:58 -07:00