Граф коммитов

43 Коммитов

Автор SHA1 Сообщение Дата
Anna Scholtz a6aa8d8db5 Schedule monitoring tasks in bqetl_monitoring 2020-11-13 10:39:34 -08:00
Ben Wu 9923823ddd
Change GA dag to run later (#1515) 2020-11-04 14:47:32 -05:00
Rhys 1ace0fe2b7
Ran YAMLlint on all yaml files and resolved linting issues (fixes #1297) (#1481)
* "Ran YAMLlint on all yaml files"

* "Moved product info metadata table to README file"

* "Reformatted yaml lists"

* "Updated line breaks so script runs"

* "Updated line breaks so script runs"

* "Undid line breaks"

* "Created custom config file"

* "Removed base document id"

* "Undid line breaks"

* "Reformatted code"

* "Trimmed whitespace"

* "Undid line break"

* "Introduced newline"

* "Trimmed whitespace"

* "Added yamillint to config file"

* "Added yamllint to config file"

* "Moved up yamllint test"

* "Trimmed whitespace"

* "Trimmed whitespace"

* "Trimmed whitespace"

* "Trimmed whitespace"

* "Removing hyphen to fix CI error"

* "Indentation to remove CI error"

* "Included yamllint install in build run"

* "Added yamllint in requirements.txt and .in file"

* "Moved install yamllint step to its own stage"

* "Updated yamllint test"

* "Updated circleci step"

* "Reformatted code"

* "Added yamllint to circleci steps"

* "Added checkout block to yamllint step"

* "Trimmed whitespace"

* "Undid yamllint step"

* "Specified directory name for yamllint test"

* "Fixed yamlint errors"

* "Fixed yamllint errors"

* "Fixed yamllint errors"

* "Fixed yamllint errors"

* "Ignore pathway in linting"

* "Added ignore venv pathway during linting"

* "Updated ignore block"

* "Updated ignore block"

* "Removed ignore block"

* "Updated ignore block"

* "Indented base as a list"

* "Indented base item"

* Update tests/sql/moz-fx-data-shared-prod/search_derived/mobile_search_clients_last_seen_v1/test_day_bit_shifting/expect.yaml

Co-authored-by: Anthony Miyaguchi <acmiyaguchi@gmail.com>

* "Resolved linting errors"

* "Referenced tables put back on same line"

* "Fixed linting error"

* Update sql/moz-fx-data-shared-prod/account_ecosystem_derived/fxa_logging_users_daily_v1/metadata.yaml

Co-authored-by: Anthony Miyaguchi <acmiyaguchi@gmail.com>

* "Fixed linting error"

Co-authored-by: Anthony Miyaguchi <acmiyaguchi@gmail.com>
2020-10-29 17:24:55 -07:00
Ben Wu 1529510c6e
Add derived tables for blog.m.o google analytics (#1492) 2020-10-28 17:40:45 -04:00
Anthony Miyaguchi 2521f926e7
Add daily schedule for geckoview versions (#1447)
* Add daily schedule for geckoview versions

* Remove unnecessary parameters
2020-10-16 14:37:30 -07:00
Daniel Thorn 6ca03e713e
Export mozilla vpn waitlist from cloud SQL (#1397) 2020-10-09 10:18:30 -07:00
Daniel Thorn 88ed89bd2c
Add stripe ETL to support Mozilla VPN dashboard (#1349) 2020-10-05 15:10:01 -07:00
Anna Scholtz ad896eed82 Remove anomdtct 2020-10-02 10:31:42 -07:00
Jeff Klukas d539fafb59
Bug 1635906 Add bqetl support for scripts and script for AET lookup (#1323)
* Bug 1635906 Add bqetl support for scripts and script for AET lookup

There are some code changes here for DAG generation and for testing.

* Apply suggestions from code review

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

* DAG fixups

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2020-09-23 15:28:25 -04:00
Frank Bertsch 818f680052
Create DAG for events rollup (#1301)
* Create DAG for events rollup

* Update sql/org_mozilla_firefox_derived/events_daily_v1/metadata.yaml

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2020-09-10 18:50:45 -04:00
Jeff Klukas b0efb898c5
Add bqetl_fxa_events DAG (#1163)
* Add bqetl_fxa_events DAG

Replaces the fxa_events DAG in telemetry-airflow

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2020-07-14 07:58:11 -04:00
Anna Scholtz 2fa01c6ddb bqetl_asn_aggregates 2020-07-13 11:15:30 -07:00
Daniel Thorn 2d8e08294b
Add query for monitoring deletion request volume (#1106) 2020-07-13 11:11:06 -07:00
Anna Scholtz 41b05f58d3 Rescheduling 2020-07-10 13:30:24 -07:00
Anna Scholtz ee4a3ee0ce More task re-scheduling 2020-07-10 13:30:24 -07:00
Anna Scholtz c4054ccdbf Reschedule and set main_summary priority 2020-07-09 13:08:17 -07:00
Anna Scholtz c2d1430ede Reschedule DAGs 2020-07-08 13:46:03 -07:00
Anna Scholtz 75ad93c9cd Calculate execution delta for tasks 2020-07-08 13:46:03 -07:00
Anna Scholtz 4d85266015 Refactoring 2020-07-06 11:27:26 -07:00
Anna Scholtz c5d727c336 DAG for daily experiments clients 2020-07-06 11:27:26 -07:00
Anna Scholtz e444d450b6 Support multipart queries and main_summary 2020-07-06 11:27:26 -07:00
Alessio Placitelli 76bac7a98e
Create an ETL job for the Internet Outages (#1058)
* Add aggregation by country

* Copy the initial Italy focus query

This initial commit provides a baseline for the
next commits to ease review, since this initial
code was already reviewed.

* Cleanup the country list and replace FULL OUTER with LEFT joins

* Aggregate by city for cities with more than 15k inhabitants

The actual 15k limit is enforced at ingestion time.
This further limits the resulting cities to ones with at
least 1000 active daily users.

* Produce hourly aggregates

* Move the query to the `internet_outage` dataset

* Provide automatic daily scheduling through AirFlow

* Tweak the SQL addressing review comments

This additionally changes the `CAST` to
`SAFE_CAST` to account for weirdnesses in
the data.

* Add ssl_error_prop

* Add missing_dns_success

* Add missing_dns_failure

* Lower the minimum reported bucket size to 50

This allows us to match the EDA by Saptarshi and
to have a better comparable baseline.

* Document the oddities around `submission_timestamp_min`
2020-07-01 06:44:40 +02:00
Anna Scholtz 49ee9b34b1 Rename bqetl_clients to bqetl_clients_daily 2020-06-24 12:18:04 -07:00
Anna Scholtz 38e6acbee6 DAGs for client queries 2020-06-24 12:18:04 -07:00
Anna Scholtz 48430fcfe1 Move addons queries to bqetl_addons DAG 2020-06-24 08:54:22 -07:00
Anna Scholtz 45fb7d41e5 Add bqetl_search DAG 2020-06-12 09:47:51 -07:00
Anna Scholtz d72918d6e7 Create bqetl_activity_stream DAG 2020-06-12 08:32:42 -07:00
Anna Scholtz 038b4b18e7 DAG for messaging system queries 2020-06-10 13:45:22 -07:00
Anna Scholtz 43e806c9e7 Move smoot queries to bqetl_gud DAG 2020-06-09 09:38:11 -07:00
Anna Scholtz 69d68c55c3 bqetl_mobile_search DAG 2020-06-08 13:33:19 -07:00
Anna Scholtz 823542b235 bqetl_nondesktop DAG 2020-06-08 12:36:06 -07:00
Anna Scholtz 494f8c1a5f bqetl_core DAG 2020-06-08 12:05:26 -07:00
Anna Scholtz b370c4710d DAG for vrbrowser queries 2020-06-05 10:52:12 -07:00
Anna Scholtz 5b906f859c Add version to error_aggregates query 2020-06-03 14:39:51 -07:00
Jeff Klukas 5cbdfa0dcc Add bqetl_amo_stats DAG 2020-06-03 14:28:59 -04:00
Anna Scholtz 914e74c9ce Add support for defining external task dependencies 2020-06-03 09:32:50 -04:00
Anna Scholtz 3a57c323ce Use consistent naming for DAG publishing JSON 2020-06-02 11:04:32 -07:00
Anna Scholtz 45961c331b Handle public data export in DAGs 2020-06-02 11:04:32 -07:00
Anna Scholtz ca287bdf3a Keep error_aggregates query without version 2020-05-28 14:12:24 -07:00
Anna Scholtz e7b2b56c01 KPI dashboard generated Airflow DAG 2020-05-28 14:12:24 -07:00
Anna Scholtz d5822b952d Generate error aggregates DAG 2020-05-28 14:12:24 -07:00
Anna Scholtz 22f1c39b5b Error aggregates as scheduled query 2020-05-28 14:12:24 -07:00
Anna Scholtz 1116be16b6 Pull in telemetry-airflow 2020-05-28 14:12:24 -07:00