Граф коммитов

116 Коммитов

Автор SHA1 Сообщение Дата
Ben Wu 09d1ee7ca5
Add shell completion script for bqetl (#6372) 2024-10-21 15:18:16 +00:00
Lucia 8096006532
Reference to the shredder mitigation process during backfills. (#6233)
* Reference to the shredder mitigation process during backfills.

* missing dash
2024-09-20 13:57:19 +00:00
Leli a6c14b7333
docs: add note about no_triage while unedr development (#5761) 2024-06-06 20:01:54 +02:00
Winnie Chan 4ebf436f05
DENG-3851 Updated managed backfill docs with slack channel for notifcations (#5630)
* Updated docs

* Update docs/cookbooks/creating_a_derived_dataset.md

Co-authored-by: Alexander <anicholson@mozilla.com>

---------

Co-authored-by: Alexander <anicholson@mozilla.com>
2024-05-21 12:36:14 -07:00
Frank Bertsch d98a068f84
Add missing references to docs (#5618) 2024-05-20 09:40:17 -04:00
Alexander fbf9894251
docs: DENG-809 add instructions for managed backfills (#5377)
* docs: DENG-809 add instructions for managed backfills

* Apply suggestions from code review

Co-authored-by: Winnie Chan <10429026+wwyc@users.noreply.github.com>

* Additional doc updates

---------

Co-authored-by: Winnie Chan <10429026+wwyc@users.noreply.github.com>
2024-04-17 13:13:53 -04:00
Anna Scholtz eac0ac80c2
Remove telemetry_derived init.sql files (#5342)
* Remove init.sql files for telemetry_derived queries

* Remove init.sql for events_daily

* Remove init.sql from skip lists

* Remove init.sql references from tooling

* Add schema for baseline_clients_first_seen
2024-04-10 15:36:30 -07:00
kik-kik bc8ddd3582
feat(): Upgrade BQETL Python to 3.11 and pre-commit hooks (#5195)
* upgrade pre-commit hooks

* Upgrade Python to 3.11
2024-03-14 14:25:30 +01:00
Sean Rose 25e4c224a0
Support waiting for tables/partitions to exist before running ETLs (#5039)
* Implement `depends_on_tables_existing` and `depends_on_table_partitions_existing` scheduling metadata.

* Refactor repeated timedelta validation logic into `validate_timedelta_string` utility function.

* Replace Google Search Console empty-check ETLs with `depends_on_table_partitions`.
2024-02-15 14:38:52 -08:00
Anna Scholtz b0387fb7de
Remove referenced_tables usages (#4834)
* Remove referenced_tables usages

* Resolve check dependencies when generating DAGs

* Add tests for checks automatically resolving dependencies

* Incorporate feedback for removing referenced_tables

* Use depends_on for empty_checks

* More depends_on and fixes
2024-01-22 12:41:05 -08:00
Alexander 7a80984757
DENG-1193 Deprecate generated dataset docs (#4657) 2023-12-15 12:17:26 -05:00
kik-kik 85216b9c2a
added an option to matches_pattern to be able to specify percentage threshold / tolerance for failures for the check (#4716) 2023-12-15 15:44:49 +01:00
Leli 699b325a0e
add accepted values check (#4650)
* add accepted values check

* added semicolon
2023-12-15 13:34:04 +01:00
Alexander 463dc15bf1
Support shared-prod UDFs (#4708) 2023-12-14 13:45:13 -05:00
kik-kik 34d6a463dc
feat(DENG-2175): added matches_pattern etl checks macro and updated tests to use it (#4683)
* added matches_pattern etl checks macro and updated tests to use it

* added matches_pattern etl check macro to the data checks docs
2023-12-13 16:10:34 +01:00
kik-kik 2db2f3e2de
feat(DENG-2156): added value_length check and updated some of the ETL checks to use the macro (#4672)
* added value_length check and updated some of the ETL checks to use the macro

* added the new check macro to the data checks docs

* implemented lelilia feedback from PR#4672

* simplified the sql logic for the value_length check
2023-12-11 16:31:25 +01:00
Mike Williams 191f935d97
add triage/confidential tag to docs (#4678) 2023-12-11 09:56:38 -05:00
Jan-Erik Rediger a26eaf5e59 Docs: Link to section on the same page
The separate chapter was removed in #4293
2023-12-06 12:20:57 +01:00
Leli f84a097524
Feat/deng 2046/migrating telemetry derived active users aggregates v1 dim checks to etl checks (#4641)
* Migrated DIM checks over to ETL checks for telemetry_derived.active_users_aggregates_v1

* rewrite

* code review suggestions

* add doc

* rename

---------

Co-authored-by: kik-kik <kignasiak@mozilla.com>
2023-12-04 17:19:48 +01:00
Leli 9c1d19fc0b
change docs (#4629) 2023-11-30 17:03:54 +01:00
Marlene Hirose f02a25b4a0
add --project_id command, take out extraneous dashes in start and end commands in creating dataset cookbook (#4626) 2023-11-29 13:45:30 -08:00
Anna Scholtz dffbb3afef
Remove generated DAGs from main (#4507) 2023-11-01 15:05:59 -07:00
Sean Rose 4bbbc32a5b
Put assert UDFs in `mozfun` project (#4367)
* Put assert UDFs in `mozfun` project.

* Tweak syntax in `assert.array_equals()` to avoid SQLGlot parsing error.
  https://github.com/tobymao/sqlglot/issues/2348

* Fix SQL syntax error in `assert.struct_equals()` tests.

* Fix UDF dependency file path logic when deploying to stage.

* Change regular expressions in `parse_routine` module to allow quotes around routines' dataset and name.
2023-10-13 10:58:42 -07:00
kik-kik 42bfa1409e
Minor tweaks to the data checks docs (#4309) 2023-09-20 10:57:39 +02:00
Anna Scholtz 0b9b833ce3
Add more docs for updating clients_daily (#4316) 2023-09-18 11:02:06 -07:00
Alekhya 01ff67a29f
Rename min_rows check macro to min_row_count (#4301) 2023-09-14 15:06:41 -04:00
Anna Scholtz 9a6a2df03b
Updating and merging data check docs (#4293) 2023-09-13 14:48:47 -07:00
Mike Williams 4bb4c5b52f
fix DENG-1510: add dryrun flag to schema update command (#4270) 2023-09-08 14:58:44 -04:00
Lucia 27262acdfd
Default DAG for bqetl queries (#4143)
* DENG-1314 Implement changes to bqetl and create default DAG.

* DENG-1314. Update Documentation.

* DENG-1314. Dummy query to enable generating DAG and run tests.

* DENG-1314. Update tests.

* Update bigquery_etl/cli/query.py

Raise exception when scheduling information is missing.

Co-authored-by: Daniel Thorn <dthorn@mozilla.com>

* DENG-1314. Update tests.

* DS-3054. Update query creation to set bqetl_default as default value for --dag. Update tests.

* Default task and tests update.

* Default task and tests update.

* 3650 - Remove default DAG option, update DAG template comment & tests.

* 3650 - Condition for DAG warning.

* 3650 - Update docs.

* Clarification on sql/moz-fx-data-shared-prod/analysis/bqetl_default_task_v1/metadata.yaml

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

* Update docs/cookbooks/creating_a_derived_dataset.md

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
Co-authored-by: Daniel Thorn <dthorn@mozilla.com>
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-08-29 14:32:52 +02:00
kik-kik 7afc4c44f1
docs(DENG-960): bqetl data checks cli docs (#4200)
* Small tweaks made to the cli cmds comments / help display for data checks

* added usage docs to data_checks reference docs

* Apply suggestions from code review provided by scholtzan

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

---------

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-08-24 17:24:23 +02:00
kik-kik 6c27be8a19
added data checks docs to mkdocs.yml to ensure they are included in our docs page (#4189) 2023-08-16 17:48:58 +02:00
kik-kik de35707b1d
docs(DENG-961): Added reference doc for data checks and a cookbook for adding new data checks (#4162)
* added reference doc for data checks and a cookbook for adding new data checks

* Apply suggestions from code review by ascholtz

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

---------

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-08-09 17:12:55 +02:00
Anna Scholtz d5a6dc97f4
Docs for new `bqetl_project.yaml` (#4018)
* Add ConfigLoader and move dry run skip to bqetl_project.yaml

* Update bqetl configuration docs
2023-07-10 09:50:22 -07:00
Curtis Morales b7b1b835ba
Add trigger_rule as an option for generated airflow tasks (#3772)
* Add trigger_rule as an option for generated airflow tasks

* Add test

* Move trigger rule options to enum and add to documentation
2023-06-27 13:58:52 -04:00
Leli d98ae6bf2e
changing the formatting for documentation in cookbooks (#3970)
* changing the formatting for documentation in cookbooks

* change to 1. 1. 1.
2023-06-27 18:41:33 +02:00
Lucia 5dc9e405a8
Deng 1000 docs how to change control (#3903)
* DENG-1000. Update docs with change control guide.

* DENG-1000. Update docs with change control guide.

* DENG-1000. Add sample PR to docs.

* DENG-1000. Add sample PR to docs.

* DENG-1000. Fix syntax of MD file.

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
2023-06-05 15:53:16 +00:00
Lucia dfef07ee40
DENG-1000. Update docs with change control guide. (#3901)
* DENG-1000. Update docs with change control guide.

* DENG-1000. Update docs with change control guide.

* DENG-1000. Add sample PR to docs.

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
2023-06-05 14:53:20 +00:00
Winnie Chan 071c53e4cb
DENG-803/805: Create & Validate backfill cli commands (#3760)
* Added backfill create and validate cli ommand

---------

Co-authored-by: Alexander <anicholson@mozilla.com>
Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>
2023-06-01 10:06:09 -07:00
Daniel Thorn a0d810275b
Remove java dependency in favor of sqlglot (#3755) 2023-05-17 14:56:42 -07:00
Sean Rose 45836abc69
Fix metrics docs formatting (#3704)
* Remove trailing whitespace to satisfy pre-commit check.

* Remove blank line that's breaking docs formatting.

* Re-add blank lines lost in copy+paste.
2023-04-05 08:07:13 -07:00
Anna Scholtz 48d8c7603d
Metric hub integration - rewrite SSL ratios to use metrics (#3698)
* Add metrics.data_source()

* Rewrite SSL ratios to use metrics

* Fix docs formatting
2023-04-04 15:41:44 -07:00
Anna Scholtz eb5d63a8f5
Metric-hub doc tweaks (#3700) 2023-04-04 10:18:06 -07:00
Anna Scholtz d737925da1
Metric hub integration docs (#3697)
* Add metrics.data_source()

* Docs for referencing metrics

* Add docs for using metrics in queries
2023-04-04 09:56:47 -07:00
Anna Scholtz 34f778c090
Stage deploy docs (#3648)
* Docs for stage deploys

* Update cookbook for updating table schemas

* Docs for running stage deploys locally

* Update common workflow failures
2023-03-10 09:10:32 -08:00
Alexander 60c85e7c54
Revert CI changes for private UDFs and add stub documentation - DENG-735 (#3652)
* Revert "CI fixes for supporting private UDFs in bigquery-etl - DENG-735 (#3631)"

This reverts commit edcfe758f7.

* Added stub UDF for monetized_search

* Add docs for using a private internal UDF
2023-03-10 11:41:46 -05:00
kik-kik 414a3c5d77
Fixed typo in airflow_tags.md (#3589)
* Fixed typo in airflow_tags.md

As Sean pointed out, the tag provided in the docs was incorrect. This change fixes that.

* Apply suggestions from code review

Co-authored-by: Sean Rose <1994030+sean-rose@users.noreply.github.com>

---------

Co-authored-by: Sean Rose <1994030+sean-rose@users.noreply.github.com>
2023-02-16 11:10:58 +01:00
kik-kik 67ac75a04b
:dded description for triage/record_only tag (#3576) 2023-02-10 10:00:31 -05:00
Leli 17c4fb35b1
add to documentation cookbook for testing (#3544) 2023-01-30 18:52:41 +01:00
akkomar 41fb32d7ef
Add a documentation note on faster single test execution (#3541) 2023-01-30 15:09:00 +01:00
Anna Scholtz 22e54ccb5f
[Bug 1812301] Publish only string typed labels (#3530)
* [Bug 1812301] Publish only string typed labels

* Document label publishing
2023-01-26 09:24:49 -08:00