akkomar
66729aa702
FXA-6721 Setup import of accounts table from FxA production CloudSQL ( #4423 )
2023-10-25 09:50:25 +02:00
m-d-bowerman
ae7d0f4766
Change pocket_available for new Pocket markets ( #4472 )
2023-10-24 12:50:45 -07:00
Frank Bertsch
55c5d412c1
Allow running multiple checks ( #4471 )
...
* Allow running multiple checks
* Don't yield anything on no matches
2023-10-24 14:39:01 -04:00
Sergio E. Betancourt
2b2697e8f3
[RS-826] New job to calculate newtab visits -> activity stream sessions ( #4387 )
...
* New job to calculate newtab visits -> activity stream sessions
* Removing newline chars at end of file
* Removing newline chars at end of file
* Removing newline chars at end of file
* Addressing comment suggestions
* Format
* Add bqetl_ads DAG
* Add ACL to nt_visits_to_sessions_conversion_factors_daily_v1
* Add metadata files
* Add view to dry_run skip list
* Oops, fix the view
---------
Co-authored-by: Curtis Morales <cmorales@mozilla.com>
2023-10-24 12:51:14 -04:00
dependabot[bot]
a4d2a0a2d1
Bump mkdocs from 1.5.2 to 1.5.3 ( #4321 )
...
Bumps [mkdocs](https://github.com/mkdocs/mkdocs ) from 1.5.2 to 1.5.3.
- [Release notes](https://github.com/mkdocs/mkdocs/releases )
- [Commits](https://github.com/mkdocs/mkdocs/compare/1.5.2...1.5.3 )
---
updated-dependencies:
- dependency-name: mkdocs
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-24 11:49:36 -04:00
dependabot[bot]
c3e98c341b
Bump urllib3 from 1.26.14 to 1.26.18 ( #4450 )
...
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 1.26.14 to 1.26.18.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.14...1.26.18 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-24 10:19:49 -04:00
dependabot[bot]
fb2574cc33
Bump gitpython from 3.1.36 to 3.1.40 ( #4458 )
...
Bumps [gitpython](https://github.com/gitpython-developers/GitPython ) from 3.1.36 to 3.1.40.
- [Release notes](https://github.com/gitpython-developers/GitPython/releases )
- [Changelog](https://github.com/gitpython-developers/GitPython/blob/main/CHANGES )
- [Commits](https://github.com/gitpython-developers/GitPython/compare/3.1.36...3.1.40 )
---
updated-dependencies:
- dependency-name: gitpython
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-24 09:01:16 -05:00
dependabot[bot]
5e42214cc0
Bump gcsfs from 2023.6.0 to 2023.10.0 ( #4468 )
...
Bumps [gcsfs](https://github.com/fsspec/gcsfs ) from 2023.6.0 to 2023.10.0.
- [Commits](https://github.com/fsspec/gcsfs/compare/2023.6.0...2023.10.0 )
---
updated-dependencies:
- dependency-name: gcsfs
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-24 06:44:38 -07:00
Curtis Morales
d529f2583d
RS-829 Add Focus to the list of app_names for mobile search revenue levers ( #4394 )
...
* Add Focus to the list of app_names for mobile search revenue levers
* Add country for Bing and DDG
2023-10-23 16:17:44 -04:00
Alexander
e1b5a2019a
Update clients_first_seen_28_days_later metadata to write to correct partition ( #4466 )
2023-10-23 15:03:48 -04:00
Daniel Thorn
8c499c5772
DENG-476 - Remove telemetry.main et al view overrides so that they will point at v5 ( #4456 )
2023-10-23 10:05:26 -07:00
akkomar
f99de5d9e5
FXA-6721 Add user-facing view for stage FxA accounts table ( #4413 )
2023-10-23 16:16:14 +02:00
Alekhya
a685e17279
CRemove the filter on messaging_system_event ( #4465 )
2023-10-23 09:53:10 -04:00
Alekhya
a72ffc4dab
Remove ETL checks ( #4460 )
...
Co-authored-by: akkomar <akkomar@users.noreply.github.com>
2023-10-23 08:59:18 -04:00
Alexander
c5dd137f26
Fix error_aggregates sample data references ( #4462 )
2023-10-19 15:43:47 -04:00
Alexander
2eabb130ca
DENG-1714 - Create clients_first_seen_28_days_later ( #4395 )
...
* DENG-1714 - Create clients_first_seen_28_days_later
* Regenerate DAG
2023-10-19 14:14:45 -04:00
Frank Bertsch
ac0af012c2
Add opt-in to running checks for backfill ( #4455 )
2023-10-18 17:34:58 -04:00
Alekhya
6f3d34ba67
DS3244 - Add derived datasets for review checker data ( #4447 )
...
* Add review checker derived datasets
* Add bqetl_review_checker dag
Fix
* Fix CI validate dag step
* Incorporate feedback from Alex
* Fix CI
* change client last seen to clients first seen
change client last seen to clients first seen
* fix dag
2023-10-18 16:57:15 -04:00
kik-kik
cd0f042de5
feat(DENG-1777): adding retention tables for fenix ( #4425 )
...
* Adding retention tables for firefox android / fenix
* regenerated bqetl_analytics_tables DAG
* fixing schema for fenix.funnel_retention_clients_week_2_v1
* fixed incorrect wrong table source for retention week 4 and format SQL applied
2023-10-18 11:56:42 -04:00
akkomar
7a36416554
Set project in init jobs ( #4453 )
...
This fixes https://github.com/mozilla/bigquery-etl/pull/4452
2023-10-18 16:58:18 +02:00
akkomar
0171f93596
Set project in init jobs ( #4452 )
2023-10-18 16:04:09 +02:00
akkomar
c3c5ecffd4
Don't set destination table for init jobs ( #4451 )
...
This reverts https://github.com/mozilla/bigquery-etl/pull/4193/files
By convention all but two init.sql jobs use `CREATE TABLE` statement. Setting destination table on a job that runs these queries causes an `BadRequest: 400 Cannot set destination table in jobs with DDL statements` error as observed in [1].
Apart from removing setting of destination_table this fixes two init queries.
[1] https://workflow.telemetry.mozilla.org/dags/copy_deduplicate/grid?dag_run_id=scheduled__2023-10-17T01%3A00%3A00%2B00%3A00&task_id=baseline_clients_first_seen&tab=logs
2023-10-18 14:45:22 +02:00
Daniel Thorn
3792797d1d
DENG-476 - Update clients daily and clients first seen to reference main_v5 ( #4438 )
2023-10-17 16:54:42 -07:00
Frank Bertsch
164ba19abf
Glean usage checks ( #4445 )
...
* WIP: Add checks for glean_usage
* Ignore pycache in autogenerated click cmds
* Move check to backfill command
* Remove view checks
2023-10-17 17:03:41 -04:00
Anna Scholtz
34c8cf35e7
Fix error_aggregates tests to reference main_v5 ( #4448 )
2023-10-17 13:16:40 -07:00
Alekhya
7f0b7d522a
DENG1546 - Fix bqetl_serp DAG failure ( #4429 )
2023-10-17 15:51:48 -04:00
wil stuckey
36cd455d2a
Add `json.from_map` UDF ( #4414 )
2023-10-17 14:00:25 -05:00
Anna Scholtz
358ea9d574
Fix Glean usage generation; don't skip query generation when tables don't exist ( #4446 )
2023-10-17 10:54:57 -07:00
Daniel Thorn
b692735c80
DENG-476 - Update FoG decision ETL to reference main_v5 ( #4436 )
2023-10-17 07:14:10 -07:00
Daniel Thorn
40ffe7f0b6
DENG-476 - Update UDF to reference main pings instead of main_v4 ( #4440 )
2023-10-16 15:49:58 -07:00
Daniel Thorn
3b14b881f8
DENG-476 - Remove unnecessary references to main_v4 in addons ETL ( #4434 )
2023-10-16 15:13:37 -07:00
Daniel Thorn
065f4c78f9
DENG-476 - Update latest versions to reference main_v5 ( #4441 )
2023-10-16 14:56:43 -07:00
Daniel Thorn
b63779faf1
DENG-476 - Update error aggregates to reference main_v5 ( #4439 )
2023-10-16 14:40:12 -07:00
Daniel Thorn
167d3a7a31
DENG-476 - Remove unnecessary reference to main_v4 from ssl ratios ( #4437 )
2023-10-16 14:17:25 -07:00
Alekhya
f0c53d7afe
DENG1546 - Fix bqetl serp DAG failure due to missing parameter ( #4426 )
...
* DENG1546 - Fix DAG failure due to missing parameter
* Fix CI issue
* Fix logic in parameter
2023-10-16 13:23:57 -04:00
Alekhya
f0af9885c4
DENG1546 - Fix BQ partition issue ( #4419 )
...
* DENG1546 - Fix BQ partition issue
FIX CI issue
fix CI issue
* Update sql_generators/serp_events/templates/metadata.yaml
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/serp_events/templates/metadata.yaml
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/serp_events/templates/metadata.yaml
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* fix dag generation
---------
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-10-14 13:12:44 -04:00
Daniel Thorn
4728884e3c
DS-2642 - Update tax address source in stripe payout report ( #4422 )
2023-10-13 13:16:12 -07:00
Daniel Thorn
f41c85a37d
DS-2642 - Add tax amount to stripe payout report ( #4421 )
2023-10-13 11:56:00 -07:00
Sean Rose
4bbbc32a5b
Put assert UDFs in `mozfun` project ( #4367 )
...
* Put assert UDFs in `mozfun` project.
* Tweak syntax in `assert.array_equals()` to avoid SQLGlot parsing error.
https://github.com/tobymao/sqlglot/issues/2348
* Fix SQL syntax error in `assert.struct_equals()` tests.
* Fix UDF dependency file path logic when deploying to stage.
* Change regular expressions in `parse_routine` module to allow quotes around routines' dataset and name.
2023-10-13 10:58:42 -07:00
Mikaël Ducharme
95e90efb4d
fix(ci): Manually set SSH configuration for multiple SSH keys. ( #4420 )
2023-10-13 11:46:23 -04:00
kik-kik
109b9f0835
feat(DENG-1599): added ETL checks related to firefox_ios_derived.new_profile_activation_v2 ( #4411 )
...
* added ETL checks related to firefox_ios_derived.new_profile_activation_v2
* regenerated bqetl_firefox_ios DAG
2023-10-13 16:42:02 +02:00
Eduardo Filho
83569d8211
Add sampling to glam-fog ( #4409 )
...
* Add sampling to glam-fog
* Simplify count logic
* Update bigquery_etl/glam/templates/clients_daily_histogram_aggregates_v1.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update bigquery_etl/glam/templates/clients_daily_scalar_aggregates_v1.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
---------
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-10-13 09:24:04 -04:00
Alekhya
22c44fc62a
DENG1546 -Add a derived dataset for serp events ( #4407 )
...
* Add a derived dataset for serp events
* Fix CI issues
Add bqetl_dag
Fix CI issues
* reapply missing updates
* Fix the query logic
Fix the query logic
* Fix sql query
* fix partition field
2023-10-12 19:38:29 -04:00
Yashika Khurana
148d4eb539
feat: Cirrus monitor query ( #4393 )
...
* feat: Cirrus monitor query
* Update sql_generators/experiment_monitoring/templates/experiments_daily_active_clients_v1/query.sql
Co-authored-by: Mike Williams <102263964+mikewilli@users.noreply.github.com>
* feat: Monitor cirrus events
* fix: Use events
* Update sql/moz-fx-data-shared-prod/telemetry_derived/experiments_daily_active_clients_v1/query.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/experiment_monitoring/templates/experiment_enrollment_aggregates_v1/query.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/experiment_monitoring/templates/experiment_events_live_v1/init.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/experiment_monitoring/templates/experiment_search_events_live_v1/init.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/experiment_monitoring/templates/experiments_daily_active_clients_v1/query.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/experiment_monitoring/templates/experiments_daily_active_clients_v1/query.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/experiment_monitoring/templates/templating.yaml
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql_generators/experiment_monitoring/templates/templating.yaml
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql/moz-fx-data-shared-prod/monitor_cirrus_derived/experiment_search_events_live_v1/init.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* Update sql/moz-fx-data-shared-prod/monitor_cirrus_derived/experiment_events_live_v1/init.sql
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
* feat: Update view to table
* feat: Update view to table
* use live tables instead of stable for live events
---------
Co-authored-by: Mike Williams <102263964+mikewilli@users.noreply.github.com>
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
Co-authored-by: Mike Williams <mwilliams@mozilla.com>
2023-10-12 14:46:26 -07:00
Mikaël Ducharme
9a6d9d15f4
fix(ci): sync-dags-repo uses wrong branch for private-bigquery-etl ( #4417 )
2023-10-12 17:21:42 -04:00
Anna Scholtz
35ae323487
Funnel generators POC ( #4390 )
...
* Add funnel generation logic
* Example funnel config
* Fix funnel columns
* funnel generation dimensions
* Optimize segmenting generated funnels
* Add funnel generation docs
* Schedule generated funnels
* Skip DAGs with no tasks
* Add background info funnel generator
* Add funnel generation tests
* Fix join_previous_step_on
* Add funnel example config
2023-10-12 14:05:08 -07:00
Mikaël Ducharme
27a99ca947
fix(ci): Add missing SSH key in sync-dags-repo step ( #4416 )
2023-10-12 16:46:21 -04:00
Anna Scholtz
61da5cca03
Respect sql_dir in dryrun skip ( #4334 )
...
* Respect sql_dir in dryrun skip
* Update bigquery_etl/dryrun.py
Co-authored-by: Sean Rose <1994030+sean-rose@users.noreply.github.com>
* Update bigquery_etl/dryrun.py
Co-authored-by: Sean Rose <1994030+sean-rose@users.noreply.github.com>
* Set sql_dir when using Schema.from_query_file()
---------
Co-authored-by: Sean Rose <1994030+sean-rose@users.noreply.github.com>
2023-10-12 13:27:54 -07:00
Mikaël Ducharme
c187aeba06
chore(ci): update private-bigquery-etl git submodule in telemetry-airflow-dags repo ( #4415 )
2023-10-12 15:45:54 -04:00
Frank Bertsch
a5bcebc308
Add line numbers for clients_daily aggs ( #4412 )
2023-10-12 12:36:50 -04:00