Граф коммитов

213 Коммитов

Автор SHA1 Сообщение Дата
kik-kik 2e7d42b39d
feat: update bqetl_mobile_kpi_metrics DAG start date (#5731) 2024-06-04 19:03:25 +02:00
kik-kik ee9e489238
feat: add bqetl_mobile_kpi_metrics DAG and schedule mobile_kpi_metrics using it (#5715) 2024-06-04 14:47:23 +02:00
Chelsey Beck 4a060e8a12
Deng 3763 increase schedule frequency (#5590)
* changing to 8 hours to align with schedules

* updating schedule

* updating schedule and aligning checks

* updating error text
2024-05-29 13:25:50 -07:00
kik-kik 5b23aa1349
feat: move mobile engagement logic into mobile kpi support metrics generator (#5661)
* feat: move mobile_engagement SQL logic into mobile_kpi_support_metrics generator

* feat: remove mobile_engagement queries from telemetry and telemetry_erived

* feat: apply formatting to engagement_clients view and add checks template

* feat: add engagement view template and add engagement templates to the generation logic

* bug: fix engagement query errors

* feat: sneak in removing of retention views inside telemetry (mobile kpi generator will overwrite these anyways)

* feat: remove bqetl_mobile_engagement_model DAG from DAG list as no longer needed.

* feat: fix an engagement_v1 query template and removed unecessary comment from the metadata template

* fix: update engagement templates to ensure valid query is rendered
2024-05-29 15:49:50 +02:00
Eduardo Filho 4fb6dcc6eb
chore(glam): accommodate for dag reschedule (#5664) 2024-05-27 14:12:44 -04:00
Sean Rose 3324aa05e8
Update `bqetl_subplat` DAG description with new expected Stripe report error message. (#5642) 2024-05-23 08:42:21 -07:00
Eduardo Filho 4225b75ab1
chore(glam): Adjust DAGs schedule and execution_delta (#5609)
* chore(glam): Adjust DAGs schedule and execution_delta according to upstream's new schedule

* Fix execution_delta
2024-05-17 18:18:07 -04:00
Katie Windau 760ce16de0
DENG-3288 Create Mobile Engagement Model Tables & Views (#5525) 2024-05-15 14:47:03 -05:00
Marlene Hirose 5ebb5dd418
Deng 3187 desktop retention model (#5496)
* initial commit for desktop_retention_clients view

* initial commit for desktop_retention_clients view - after formatting

* initial add of query script for telemetry_derived.retention_v1 table

* change view to table as I need to use the submission_date query parameter

* move view to query for desktop_retention_clients

* reformat file and add COALESCE on submission_date for new_profiles

* take out app_name from retention_v1 query. Change isp_name to isp, take out mozfun UDF on normalized_os_version in CTE and update main query from retention_clients

* run formatting on retention_v1/query.sql

* move files from retention to desktop_retention, retention_clietns to desktop_retention_clients

* add newline to end of schema.yaml file

* reinstate retention_v1 deprecated folder/metadata.yaml file

* refactor desktop_retention_clients_v1/desktop_retention metadata.yaml - add in clustering, take out extra space indent

* refactor metadata.yaml, add schema.yaml, change query.sql to pull from telemetry_derived.desktop_retention_clients_v1 not view

* add in metric_date for desktop_retention_v1 partition, take out require partition date filtering for both retention and retention_clients

* add desktop_retention_model to dags.yaml

* remove retention_clients_v1

* change column name is_new_profile to new_profile_metric_date

* take out 'app_name' from group by

* add telemetry_derived.desktop_retention_clients_v1 to shredder config
2024-05-14 13:50:15 -07:00
Leli e94c62f42f
Add description to the bqetl_braze DAG (#5566) 2024-05-14 20:01:33 +02:00
Katie Windau 7ad70e29e7
DS-3501 - Create New iOS App Campaign Reporting Table (#5515)
* DS-3501 initial commit

* DS-3501 add new dag

* DS-3501 update metadata.yaml

* DS-3501 - update metadata.yaml

* DS-3501 add backticks around column named date
2024-05-07 14:44:59 -05:00
Anna Scholtz 4227b5381e
Assign kwindau as owner of bqetl_google_analytics (#5492)
Co-authored-by: Katie Windau <153020235+kwindau@users.noreply.github.com>
2024-05-03 14:47:41 -05:00
Katie Windau 67e0c1ae9d
DENG-3186 new desktop engagement model (#5415)
* DENG-3186 new desktop engagement model

* DENG-3186 add distribution ID to eng client V1 query

* DENG-3186 - add distribution ID to the schema.yaml for eng client

* DENG-3186 add distribution ID to the agg table

* DENG-3186 adding locale to both client & agg tables

* DENG-3186 add app_version to client & agg tables

* DENG-3168 move lifecycle stage from engagement agg table to engagement agg view

* DENG-3186 - add new desktop_engagement_client_v1 table to shredder

* DENG-3186 update formatting of config.py

* DENG-3186 switch to get country, os, os version, and build # from clients last seen and remove join to clients daily
2024-04-24 11:57:11 -05:00
Leli 0d65e93aba
Deng 3471 suppression list out of campaign monitor api (#5416)
* DENG-3471 suppression list from campaign monitor

* Update sql/moz-fx-data-shared-prod/marketing_suppression_list_external/campaign_monitor_suppression_list_v1/metadata.yaml

Co-authored-by: Ben Wu <12437227+BenWu@users.noreply.github.com>

* after seans coder review

---------

Co-authored-by: Ben Wu <12437227+BenWu@users.noreply.github.com>
2024-04-24 13:55:43 +02:00
Leli ed76336c5e
DENG-3471 suppression list from campaign monitor (#5412)
* DENG-3471 suppression list from campaign monitor

* Update sql/moz-fx-data-shared-prod/marketing_suppression_list_external/campaign_monitor_suppression_list_v1/metadata.yaml

Co-authored-by: Ben Wu <12437227+BenWu@users.noreply.github.com>

---------

Co-authored-by: Ben Wu <12437227+BenWu@users.noreply.github.com>
2024-04-23 19:39:25 +02:00
Winnie Chan 7f4d216bef
DENG-2621: Add pageload derived tables (#5359)
* Added nightly, experiments and 1pct tables
2024-04-23 08:44:38 -07:00
Chelsey Beck ba9c6abda5
adding an active user filter to subscriptions (#5382)
* adding an active user filter to subscriptions

* changing schedule on dag
2024-04-17 13:12:56 +02:00
Leli 2b2d2a0e38
DENG-3412 braze sync hard bounce to bigquery with bqetl (#5375) 2024-04-16 19:54:33 +02:00
Chelsey Beck 6a93bc4059
Create Braze derived models (#5362)
* DENG-3008 removing misspelled file

* DENG-3008 adding suppressions model

* adding products model

* updating dataset

* adding newsletter model

* adding waitlist model

* removing create statement and adding modified timestamp

* updating name and description for newsletters

* updating description

* adding users model

* adding braze dag

* changing subscribed to boolean

* updating type to date

* updating to full refresh

* updating email id to external id for standardization

* updating format

* updating nullable fields

* updating format

* updating nullable fields

* adding subscription map model

* adding subscriptions model

* adding user profiles model

* updating format

* formatting

* removing last modified date in favor of only timestamp

* updating table name
2024-04-15 19:27:31 -07:00
Leli eac125c103
doc: DENG-3361 add documentation to telemetry dev cyle DAG (#5349) 2024-04-09 18:51:08 +02:00
Leli 6469993573
fix: DENG-3287 change start date for DAGs acoustic (#5335) 2024-04-05 16:39:52 +02:00
Leli c57ef86a4f
Deng 3287 move acoustic data from marketing to shared prod (#5316)
* DENG-3151 create acoustic_external dataset in BigQuery

* DENG-3287 move data from marketing to shared-prod

* acoustic_derived dataset
2024-04-03 16:26:44 +02:00
Leli 7057f83985
DENG-3161 create full suppression list Accoustic (#5246)
Co-authored-by: Marlene Hirose <92952117+Marlene-M-Hirose@users.noreply.github.com>
2024-03-20 17:25:55 +01:00
kik-kik 02ca92f9dd
fix: add missing impact tag to the bqetl_data_observability_test_data_copy DAG (#5236) 2024-03-19 18:28:24 +01:00
kik-kik 0184b7ecff
feat(DENG-2864): data-observability-dev define tables, schemas, and add ETL for updating them (#5222)
* feat: add fenix_derived test tables to the data-observability-dev project

* feat: add event_monitoring_live_v1 to data-observability-dev

* feat: add new temp DAG for running copy to data-observability-dev tasks

* fix: using fully qualified source table name in fenix_derived.events_daily_v1 inside data-observabiolity project

* fix: correct event queries used to copy data into data-observability-dev project
2024-03-19 17:10:59 +01:00
ksiegler1 e6f1f40f00
DENG-2095 Create users_services_daily table for accounts_backend (#4906) 2024-02-19 09:07:32 +01:00
Sean Rose 802da71a2c
Add ETLs and views for Google Search Console data (DENG-1733) (#4892)
* Add ETLs for historical Google Search Console data synced by Fivetran.

* Fix formatting of `CASE` subclauses like `WHEN` inside Jinja blocks.

* Add ETLs for current Google Search Console data exported directly to BigQuery.

* Add views for Google Search Console data.
2024-02-07 12:53:32 -08:00
Eduardo Filho 9fbfac3ac0
GLAM historical aggregates tables (#4795) 2024-01-10 16:29:35 -05:00
Katie Windau 1c4102abed
Update bqetl_google_analytics_derived_ga4 yaml configs (#4780)
* Update bqetl_google_analytics_derived_ga4 yaml configs

* Switch to using countifs for more readability

* reformat and switch to countifs for readability
2024-01-04 15:44:28 -06:00
Katie Windau 6220025220
Add new table www_site_metrics_summary_v2 using new GA4 data (#4776)
* Initial draft

* adding new GA4 dag

* fixing source for browser

* work in progress

* change group by to be explicit

* adding non_fx_sessions

* adding non-fx-sessions

* adding campaign to query

* adding new column to group by

* added ad_content column

* Adding downloads column

* Adding non fx downloads

* reformat the SQL

* Add owner to new DAG

* Updata data types in the schema file

* Add missing comma

* Update query.sql

* Update start date and reduce from tier1 -> tier2

* Update sql/moz-fx-data-marketing-prod/ga_derived/www_site_metrics_summary_v2/metadata.yaml

Co-authored-by: Frank Bertsch <frank.bertsch@gmail.com>

* Update sql/moz-fx-data-marketing-prod/ga_derived/www_site_metrics_summary_v2/metadata.yaml

Co-authored-by: Frank Bertsch <frank.bertsch@gmail.com>

* Update sql/moz-fx-data-marketing-prod/ga_derived/www_site_metrics_summary_v2/query.sql

Co-authored-by: Frank Bertsch <frank.bertsch@gmail.com>

* Update sql/moz-fx-data-marketing-prod/ga_derived/www_site_metrics_summary_v2/query.sql

Co-authored-by: Frank Bertsch <frank.bertsch@gmail.com>

* Update sql/moz-fx-data-marketing-prod/ga_derived/www_site_metrics_summary_v2/query.sql

Co-authored-by: Frank Bertsch <frank.bertsch@gmail.com>

* update source and medium columns to use last touch attribution instead of first touch attribution

* update group by

---------

Co-authored-by: Frank Bertsch <frank.bertsch@gmail.com>
2024-01-04 15:01:47 -06:00
Katie Windau 30a4e9ef30
Fixing DAG start time (#4762) 2023-12-29 11:17:45 -06:00
Katie Windau cb31189be4
DENG-2262 - update DAG run time from 8am UTC daily to 6:30pm UTC daily (#4756) 2023-12-28 14:35:02 -06:00
Katie Windau ebe5b4cfb4
DENG-2262 - Creating new desktop_installs_v1 table (#4754)
* DENG-2262 - add new DAG bqetl_desktop_installs_v1

* Initial commit for DENG-2262

* DENG-2262 - reformatted query.sql

* DENG-2262 add empty new line to end of view.sql
2023-12-28 13:15:04 -06:00
Leli c05aec0f9b
DENG-1728 adding glean_app metrics to bigquery-etl (#4720)
* adding telemetry_dev_cycle_derived to bigquery_etl

* Update dags.yaml

Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>

* changes after code review

* move to _external dataset

* rename table

* fix defaults

* schema from file

---------

Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>
2023-12-20 20:14:11 +01:00
Alexander d96618dce2
Bug 1815242 - Update bqetl_pocket triage notes (#4733) 2023-12-20 12:52:54 -05:00
rzhao b1c6c6d7c8
feat(): Added new data to mobile feature usage tables (logins data for iOS and Fenix) (#4648)
* update

* This is the modified codes following suggestions in https://github.com/mozilla/bigquery-etl/pull/4467. This pull request aims to update codes that create four mobile feature usage tables

* updated codes following comments from kik in 72d6d71910

* update feature usage table codes following suggestions

* added new data to mobile feature usage table (logins data for iOS) and modified to add unnest for nested values

* update codes based on suggestion from https://github.com/mozilla/bigquery-etl/pull/4648/files

* update metadata.yaml and dags.yaml according to suggestions

* update metadata.yaml and dags.yaml according to suggestions

* remove clustering and references in all the metadata files, add LEFT JOIN to all the UNNEST, modify SQL files to folllow the incremental submission date format, and rename dau as events(metrics)_ping_client_count

* updated schema to reflect the name change for dau

* update according to comments on dec 15

* update the distinct client count name

* fix the yaml errors

* fix no new line error fordags.yaml

---------

Co-authored-by: Ruoxi Zhao <rzhao@rzhao-37509.local>
Co-authored-by: Ruoxi Zhao <rzhao@rzhao-37509.lan>
2023-12-15 15:08:58 -08:00
Katie Windau d5ffd4fe35
Fb 1866834 new use counter data (#4713)
* 1866834 - add new DAG bqetl_use_counter_analysis

* 1866834 - adding fenix & firefox use_counters_v1 tables and telemetry use_counters view

* 1866834 - fixing yaml file formatting

* 1866834 - fix dags.yaml format - remove trailing space

* 1866834 - add cast to numeric for rate to match schema for rate

* 1866834 - Remove unnecessary alias

* 1866834 - remove unnecessary alias

* Update and rename view.sql to view.sql

* 1866834 - test removing view

* 1866834 - update table names

* 1866834 - fixing fenix_and_firefox_use_counters view name and source table references
2023-12-15 13:13:36 -06:00
Daniel Thorn ffe8e304e2
Remove dthorn as dag owner (#4695) 2023-12-15 10:19:42 -08:00
Daniel Thorn b0bfc65052
DENG-965 - symbolicate and signaturize crash pings (#4642) 2023-12-12 08:57:52 -08:00
Anna Scholtz e8f3f759d5
Public GLAM datasets (#4606)
* Public GLAM datasets

* Remove Fenix GLAM datasets
2023-12-07 14:56:55 -08:00
kik-kik 377685cac9
fixing broken test for firefox_ios_derived.baseline_clients_yearly_v1 (#4645) 2023-12-04 10:48:56 -05:00
Anna Scholtz 7087dbff30
Separate Airflow tasks for glean_usage (#4588)
* Add support for assigning Airflow tasks to task groups

* Generate separate Airflow tasks for glean_usage

* Remove Airflow dependencies from old glean_usage tasks
2023-11-30 09:48:17 -08:00
Lucia 3db53758d2
Correct DAG description as DAG is currently active. (#4596)
Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
2023-11-22 16:06:45 +01:00
Frank Bertsch cbb843e455
Add ga_sessions_v1 table & view (#4554)
* Add ga_sessions_v1 table & view

This table aggregates session-level data from GA.

* Rename nullify string func

* Apply suggestions from code review

Co-authored-by: Alexander <anicholson@mozilla.com>

* Add upstream backfill deps

* Move depends_on to correct section

---------

Co-authored-by: Alexander <anicholson@mozilla.com>
2023-11-16 15:58:33 -05:00
Lucia b3abfc42ce
Update scheduler of aggregates to run after upstreams. (#4503)
* Update scheduler of aggregates to run after upstreams.

* Update dags for new scheduler of analytics_aggregates

* Update dag bqetl_search

* Remove DAG.

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
2023-11-06 17:55:12 +01:00
Rebecca BurWei 73a5535b67
Urlbar events: nested (long) instead of wide (#4373)
* feat: urlbar events final release

* feat: new result types

* feat: add interaction and group

* fix: date

* fix: use BQ builtin for UUIDs

* Add the view_v2'

* Add new table to the DAG

* fix CI error

fix ci error

* remove teon brooks

* Incorporate feedback by Curtis

Incorporate feedback from Curtis

---------

Co-authored-by: Alekhya Kommasani <akommasani@mozilla.com>
Co-authored-by: Alekhya <88394696+alekhyamoz@users.noreply.github.com>
2023-10-25 12:55:00 -04:00
akkomar 66729aa702
FXA-6721 Setup import of accounts table from FxA production CloudSQL (#4423) 2023-10-25 09:50:25 +02:00
Sergio E. Betancourt 2b2697e8f3
[RS-826] New job to calculate newtab visits -> activity stream sessions (#4387)
* New job to calculate newtab visits -> activity stream sessions

* Removing newline chars at end of file

* Removing newline chars at end of file

* Removing newline chars at end of file

* Addressing comment suggestions

* Format

* Add bqetl_ads DAG

* Add ACL to nt_visits_to_sessions_conversion_factors_daily_v1

* Add metadata files

* Add view to dry_run skip list

* Oops, fix the view

---------

Co-authored-by: Curtis Morales <cmorales@mozilla.com>
2023-10-24 12:51:14 -04:00
Alekhya 6f3d34ba67
DS3244 - Add derived datasets for review checker data (#4447)
* Add review checker derived datasets

* Add bqetl_review_checker dag

Fix

* Fix CI validate dag step

* Incorporate feedback from Alex

* Fix CI

* change client last seen to clients first seen

change client last seen to clients first seen

* fix dag
2023-10-18 16:57:15 -04:00
Anna Scholtz 35ae323487
Funnel generators POC (#4390)
* Add funnel generation logic

* Example funnel config

* Fix funnel columns

* funnel generation dimensions

* Optimize segmenting generated funnels

* Add funnel generation docs

* Schedule generated funnels

* Skip DAGs with no tasks

* Add background info funnel generator

* Add funnel generation tests

* Fix join_previous_step_on

* Add funnel example config
2023-10-12 14:05:08 -07:00