Граф коммитов

4169 Коммитов

Автор SHA1 Сообщение Дата
Anna Scholtz d66bb2a8d9
Convert non_user_facing_dataset_suffixes to tuple when loading from bqetl_project.yaml (#4066) 2023-07-12 11:12:01 -07:00
dependabot[bot] 40baf741c0
Bump sqlglot from 17.3.0 to 17.4.1 (#4061) 2023-07-12 17:59:26 +00:00
dependabot[bot] ccc28d9ec9
Bump gitpython from 3.1.31 to 3.1.32 (#4058) 2023-07-12 17:37:50 +00:00
Anna Scholtz 98006dcce9
Fix paths for skipped views (#4063) 2023-07-12 09:53:59 -07:00
Marlene Hirose 96c80ced58
take out app_token from log and out of view (#4056) 2023-07-12 09:32:05 -07:00
akkomar 4ed032cceb
Set ConfigLoader's project directory on module initialization (#4062) 2023-07-12 17:45:08 +02:00
Lucia 49b6fba6e9
Add os_grouped to the view active_users_aggregates (#4044)
* Add os_grouped to the view.

* bqetl reformatting.

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
Co-authored-by: Marlene Hirose <92952117+Marlene-M-Hirose@users.noreply.github.com>
2023-07-12 13:17:04 +02:00
Anna Scholtz 5c0748cf79
Add missing / for generating docs (#4055) 2023-07-11 15:55:01 -07:00
Anna Scholtz 3f9181c6e1
Fix getting skipped routines from config (#4054) 2023-07-11 14:23:59 -07:00
Anna Scholtz 03357769cc
Move view, schema and remaining configs to bqetl_project.yaml (#4051)
* Move view configs to bqetl_project.yaml

* Move schema config to bqetl_project.yaml

* Move docs config to bqetl_project.yaml

* Replace remaining configs
2023-07-11 13:10:57 -07:00
dependabot[bot] 08babb3d7c
Bump mkdocs-material from 9.1.17 to 9.1.18 (#4013) 2023-07-11 19:29:49 +00:00
Sean Rose a5c2370bc7
Add init logic to new incremental Stripe ETLs (DENG-974). (#4052) 2023-07-11 12:08:04 -07:00
dependabot[bot] 737a23b891
Bump pip-tools from 6.13.0 to 6.14.0 (#4011) 2023-07-11 18:52:28 +00:00
kik-kik 7d3f5d930e
added dataset_metadata.yaml for adjust namespace (#4047) 2023-07-11 20:29:12 +02:00
Sean Rose af6012969c
Move `stripe_subscriptions_changelog_v1` ETL to `stripe_external` dataset (DENG-974) (#4036)
* Move `stripe_subscriptions_changelog_v1` ETL to `stripe_external` dataset.

* Include all subscription metadata in `stripe_external.subscriptions_changelog_v1`.
2023-07-11 11:09:53 -07:00
Anna Scholtz 8d72cfa9fe
Move routine config to bqetl_project.yaml (#4038) 2023-07-11 10:52:48 -07:00
Leli 8ccd05433a
Bug 1842216 exclude main_remainder_v4 from telemetry_missing_columns_v3 (#4031)
* Bug 1842216 exclude main_remainder_v4 from telemetry_missing_columns_v3

* exclude view from dryrun

* add remainder for saved_session and first_shutdown

* Update sql/moz-fx-data-shared-prod/monitoring_derived/telemetry_missing_columns_v3/query.sql

Co-authored-by: Daniel Thorn <dthorn@mozilla.com>

---------

Co-authored-by: Daniel Thorn <dthorn@mozilla.com>
2023-07-11 19:08:28 +02:00
dependabot[bot] 405c74a58d
Bump pandas from 2.0.2 to 2.0.3 (#4001) 2023-07-11 16:54:47 +00:00
dependabot[bot] 7a118fd3e7
Bump exceptiongroup from 1.1.1 to 1.1.2 (#4012) 2023-07-11 16:32:48 +00:00
dependabot[bot] 15295f77bc
Bump sqlglot from 16.7.4 to 17.3.0 (#4027) 2023-07-11 16:09:52 +00:00
Glenda Leonard b71e25bc77
Removed checks.sql from dryrun. (#4050) 2023-07-11 11:45:49 -04:00
Glenda Leonard acdacb5095
Reduced threshold from 250,000 to 50,000 (#4037) 2023-07-11 10:37:57 -04:00
Alexander 19d5fe9c41
Updated main_remainder schema.yaml (#4049) 2023-07-11 09:47:10 -04:00
Leli 6d54c049db
add adjust view to bqetl_project to skip (#4046) 2023-07-11 15:01:51 +02:00
Daniel Thorn 6137048eeb
DS-2642 - Import stripe itemized tax report (#3999) 2023-07-10 17:22:18 -07:00
Marlene Hirose e53afd05e5
take out ending underscore from table name (#4042) 2023-07-10 16:49:56 -07:00
Sean Rose 4c3b3ee188
Override `--replace=true` from `.bigqueryrc` for `stripe_external.customers_changelog_v1` (DENG-974). (#4043) 2023-07-10 16:35:38 -07:00
Winnie Chan 91882dd150
DENG-806 Added backfill process cli command (#3936)
* Added backfill process command
2023-07-10 16:13:42 -07:00
Marlene Hirose 5819319c8e
change api_token variable to adjust_api_token (#4041) 2023-07-10 15:28:52 -07:00
Marlene Hirose ab829349b1
add arguments to metadata and query.py, remove os_environ variables (#4039)
* add arguments to metadata and query.py, remove os_environ variables

* add arguments to dag

* whitespace fixing

* add carriage return
2023-07-10 14:54:17 -07:00
Sean Rose 7ee239d3e6
Add `stripe_external.customers_changelog_v1` ETL (DENG-974) (#4035)
* Add `stripe_external.customer_discount_v1` ETL.

* Add `stripe_external.customers_changelog_v1` ETL.
2023-07-10 14:17:50 -07:00
Anna Scholtz 3a61fd34bb
Move format skip files to bqetl_project.yaml (#4033) 2023-07-10 10:10:47 -07:00
Anna Scholtz d5a6dc97f4
Docs for new `bqetl_project.yaml` (#4018)
* Add ConfigLoader and move dry run skip to bqetl_project.yaml

* Update bqetl configuration docs
2023-07-10 09:50:22 -07:00
kik-kik 9b5c04a7bb
bug(1741487): Rename url2 and related fields in stable views (#4029)
* Bug 1741487 - Rename url2 and related fields in stable views

This removes the following unpopulated fields from Glean views: `metrics.url`, `metrics.text`, `metrics.jwe`, and `metrics.labeled_rate`. If any of these metrics exist in the source table under `2`-suffixed name, it is also aliased to its original name (`url2` to `url` and so on).
Suffixed fields are still preserved until view consumers migrate.

* Remove redundant comma from generated sql

* Ignore missing fields in views if any of them were removed

* added a todo comment

* Added additional context around why we are excluding some of the non-suffixed fields and why alising to remove suffix 2 from some fields

---------

Co-authored-by: Arkadiusz Komarzewski <akomarzewski@mozilla.com>
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-07-10 09:31:15 -07:00
Winnie Chan ae2edf2234
DENG-1052 Added bigquery_usage_v2 table (#3978)
* Added bigquery_usage_v2 table
2023-07-10 09:09:20 -07:00
Marlene Hirose 1592e337cf
take rbaffourawuah@mozilla.com off of email list for DAG (#4034)
* take rbaffourawuah@mozilla.com off of email list for DAG

* remove rbaffourawuah@mozilla.com from dags.yaml

* fix formatting and remove another instance of rbaffourawuah@mozilla.com from DAG

* fix order of emails

* modify name to be more explicity

* rename view and folder

* update view name

* update view location

* update view name
2023-07-07 22:25:44 -07:00
Marlene Hirose d8ce0307a5
Ds 2944 create external adjust table redux (#3907)
* add metadata, schema yamls and query.py

* created adjust_derived namespace

* add query.py, metadata, schema, dataset for testing

* delete extraneous file, update DAG name

* modify bqetl_adjust DAG redux

* update DAG name, take out '_derived'

* update table name in view

* standardize table names across files

* regenerate DAG

* update schema in both locations

* add query.py, metadata, schema yaml files

* take put extraneous print statements, update datasets to be 'adjust' or 'adjust_derived'

* add submission date to date_partition_parameter

* update table name to be just one table

* add DAG for adjust_derived

* add bq_etl adjust_derived DAG to yaml file

* add note about API token

* revert changes to bqetl.adjust.py

* use proper tast_id

* fix start dates

* add python command and docker image

* add python command and docker image

* delete extraneous code

* comment out docker part in old adjust dag

* add whitespace, delete extraneous code

* Update sql/moz-fx-data-shared-prod/adjust/adjust_derived/view.sql

Co-authored-by: Lucia <30448600+lucia-vargas-a@users.noreply.github.com>

* Update sql/moz-fx-data-shared-prod/adjust_derived/adjust_derived_v1/query.py

Co-authored-by: Lucia <30448600+lucia-vargas-a@users.noreply.github.com>

* Update sql/moz-fx-data-shared-prod/adjust_derived/adjust_derived_v1/query.py

Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>

* updated logic to check if response dictionary is not empty, moved view out of nested folder, added token ownership statement to metadata file, turned off email retry in dags.yaml, separated out clean up of json to its own function

* take out extraneous if statement and move else statement

* reorder where comment is to make more sense

* more description as to why we're using mhirose's API token

* take out periods

* Update sql/moz-fx-data-shared-prod/adjust_derived/adjust_derived_v1/metadata.yaml

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

* combine adjust DAGs

* change logic for query_export check loop continuance, adapt metadata.yamls

* add blank parameters test

* Update sql/moz-fx-data-shared-prod/adjust_derived/adjust_derived_v1/metadata.yaml

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

* add arguments to metadata.yaml

* remove external table reference

* refactor to add date parameter

* refactor based on Circle CI's advice

* Update sql/moz-fx-data-shared-prod/adjust_derived/adjust_derived_v1/query.py

Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>

* Update sql/moz-fx-data-shared-prod/adjust_derived/adjust_derived_v1/query.py

Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>

* take out TODO comment

---------

Co-authored-by: kik-kik <kignasiak@mozilla.com>
Co-authored-by: Lucia <30448600+lucia-vargas-a@users.noreply.github.com>
Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-07-07 15:32:03 -07:00
Anna Scholtz b3efbf3c88
Pass date partition parameters to check tasks (#4032) 2023-07-07 12:36:38 -07:00
Frank Bertsch 8d08cec820
Copy App Store tables from Fivetran (#4030)
* Copy App Store tables from Fivetran

* Move dryrun to config

* Generate DAG

* Dont dryrun views

* Add schemas
2023-07-07 14:24:03 -04:00
Anna Scholtz dc482ad8d5
[DENG-948] Macro support for data checks (#3993)
* Add support for check macros

* Add min_rows() check macro

* Add is_unique() check macro

* Add in_range() check macro

* Update ssl_ratios DAG

* Add test for macro checks

* Fix rendering
2023-07-06 14:36:59 -07:00
Anna Scholtz e719266146
Remove stray commas from skip list (#4026) 2023-07-06 12:12:52 -07:00
Anna Scholtz d9bda0df7e
Add ConfigLoader and move dry run skip to bqetl_project.yaml (#4000)
* Add ConfigLoader and move dry run skip to bqetl_project.yaml

* format tests
2023-07-06 10:42:29 -07:00
Sean Rose 6f1066e0bc
Update Stripe subscriptions ETL to use customer shipping addresses (DENG-1083). (#4024) 2023-07-05 21:46:37 -07:00
Anna Scholtz 9b1b745702
Fix DAG formatting (#4023) 2023-07-05 13:39:48 -07:00
Anna Scholtz 0f3e54346c
Split main_1pct (#4022) 2023-07-05 13:00:29 -07:00
kik-kik 10680a7e81
updated firefox_ios_derived attributable_clients_v1 and firefox_ios_clients_v1 execution_delta to 3h instead of 1h to fix sensor not working correctly (#4020) 2023-07-05 12:03:26 +02:00
Anna Scholtz 3286508bc5
Update view metadata in a single update_table() operation (#4017) 2023-07-04 10:55:55 -07:00
kik-kik ee8b159a40
updated the nonprod_fxa_server_events_v1 schema to match that one of the source table (#4016) 2023-07-04 17:53:22 +02:00
kik-kik 39631e6df4
updated the nonprod_fxa_server_events_v1 query to explicitly cast jsonPayload.fields.t as STRING (#4015) 2023-07-04 16:51:51 +02:00
Sean Rose 352cffedb8
Add `stripe_subscriptions_history_v2` ETL (DENG-974) (#4009)
* Add `synced_at` column to `stripe_subscriptions_changelog_v1`.

* Tweak `stripe_subscriptions_changelog_v1` tax rate and discount joins to only include those that existed when the change happened.

* Parse subscription metadata in `stripe_subscriptions_changelog_v1`.

* Add `stripe_external.invoice_line_item_v1` ETL.

* Add `stripe_subscriptions_revised_changelog_v1` ETL.

* Add `stripe_subscriptions_history_v2` ETL.
2023-06-30 14:18:31 -07:00