Граф коммитов

114 Коммитов

Автор SHA1 Сообщение Дата
Leli a6c14b7333
docs: add note about no_triage while unedr development (#5761) 2024-06-06 20:01:54 +02:00
Winnie Chan 4ebf436f05
DENG-3851 Updated managed backfill docs with slack channel for notifcations (#5630)
* Updated docs

* Update docs/cookbooks/creating_a_derived_dataset.md

Co-authored-by: Alexander <anicholson@mozilla.com>

---------

Co-authored-by: Alexander <anicholson@mozilla.com>
2024-05-21 12:36:14 -07:00
Frank Bertsch d98a068f84
Add missing references to docs (#5618) 2024-05-20 09:40:17 -04:00
Alexander fbf9894251
docs: DENG-809 add instructions for managed backfills (#5377)
* docs: DENG-809 add instructions for managed backfills

* Apply suggestions from code review

Co-authored-by: Winnie Chan <10429026+wwyc@users.noreply.github.com>

* Additional doc updates

---------

Co-authored-by: Winnie Chan <10429026+wwyc@users.noreply.github.com>
2024-04-17 13:13:53 -04:00
Anna Scholtz eac0ac80c2
Remove telemetry_derived init.sql files (#5342)
* Remove init.sql files for telemetry_derived queries

* Remove init.sql for events_daily

* Remove init.sql from skip lists

* Remove init.sql references from tooling

* Add schema for baseline_clients_first_seen
2024-04-10 15:36:30 -07:00
kik-kik bc8ddd3582
feat(): Upgrade BQETL Python to 3.11 and pre-commit hooks (#5195)
* upgrade pre-commit hooks

* Upgrade Python to 3.11
2024-03-14 14:25:30 +01:00
Sean Rose 25e4c224a0
Support waiting for tables/partitions to exist before running ETLs (#5039)
* Implement `depends_on_tables_existing` and `depends_on_table_partitions_existing` scheduling metadata.

* Refactor repeated timedelta validation logic into `validate_timedelta_string` utility function.

* Replace Google Search Console empty-check ETLs with `depends_on_table_partitions`.
2024-02-15 14:38:52 -08:00
Anna Scholtz b0387fb7de
Remove referenced_tables usages (#4834)
* Remove referenced_tables usages

* Resolve check dependencies when generating DAGs

* Add tests for checks automatically resolving dependencies

* Incorporate feedback for removing referenced_tables

* Use depends_on for empty_checks

* More depends_on and fixes
2024-01-22 12:41:05 -08:00
Alexander 7a80984757
DENG-1193 Deprecate generated dataset docs (#4657) 2023-12-15 12:17:26 -05:00
kik-kik 85216b9c2a
added an option to matches_pattern to be able to specify percentage threshold / tolerance for failures for the check (#4716) 2023-12-15 15:44:49 +01:00
Leli 699b325a0e
add accepted values check (#4650)
* add accepted values check

* added semicolon
2023-12-15 13:34:04 +01:00
Alexander 463dc15bf1
Support shared-prod UDFs (#4708) 2023-12-14 13:45:13 -05:00
kik-kik 34d6a463dc
feat(DENG-2175): added matches_pattern etl checks macro and updated tests to use it (#4683)
* added matches_pattern etl checks macro and updated tests to use it

* added matches_pattern etl check macro to the data checks docs
2023-12-13 16:10:34 +01:00
kik-kik 2db2f3e2de
feat(DENG-2156): added value_length check and updated some of the ETL checks to use the macro (#4672)
* added value_length check and updated some of the ETL checks to use the macro

* added the new check macro to the data checks docs

* implemented lelilia feedback from PR#4672

* simplified the sql logic for the value_length check
2023-12-11 16:31:25 +01:00
Mike Williams 191f935d97
add triage/confidential tag to docs (#4678) 2023-12-11 09:56:38 -05:00
Jan-Erik Rediger a26eaf5e59 Docs: Link to section on the same page
The separate chapter was removed in #4293
2023-12-06 12:20:57 +01:00
Leli f84a097524
Feat/deng 2046/migrating telemetry derived active users aggregates v1 dim checks to etl checks (#4641)
* Migrated DIM checks over to ETL checks for telemetry_derived.active_users_aggregates_v1

* rewrite

* code review suggestions

* add doc

* rename

---------

Co-authored-by: kik-kik <kignasiak@mozilla.com>
2023-12-04 17:19:48 +01:00
Leli 9c1d19fc0b
change docs (#4629) 2023-11-30 17:03:54 +01:00
Marlene Hirose f02a25b4a0
add --project_id command, take out extraneous dashes in start and end commands in creating dataset cookbook (#4626) 2023-11-29 13:45:30 -08:00
Anna Scholtz dffbb3afef
Remove generated DAGs from main (#4507) 2023-11-01 15:05:59 -07:00
Sean Rose 4bbbc32a5b
Put assert UDFs in `mozfun` project (#4367)
* Put assert UDFs in `mozfun` project.

* Tweak syntax in `assert.array_equals()` to avoid SQLGlot parsing error.
  https://github.com/tobymao/sqlglot/issues/2348

* Fix SQL syntax error in `assert.struct_equals()` tests.

* Fix UDF dependency file path logic when deploying to stage.

* Change regular expressions in `parse_routine` module to allow quotes around routines' dataset and name.
2023-10-13 10:58:42 -07:00
kik-kik 42bfa1409e
Minor tweaks to the data checks docs (#4309) 2023-09-20 10:57:39 +02:00
Anna Scholtz 0b9b833ce3
Add more docs for updating clients_daily (#4316) 2023-09-18 11:02:06 -07:00
Alekhya 01ff67a29f
Rename min_rows check macro to min_row_count (#4301) 2023-09-14 15:06:41 -04:00
Anna Scholtz 9a6a2df03b
Updating and merging data check docs (#4293) 2023-09-13 14:48:47 -07:00
Mike Williams 4bb4c5b52f
fix DENG-1510: add dryrun flag to schema update command (#4270) 2023-09-08 14:58:44 -04:00
Lucia 27262acdfd
Default DAG for bqetl queries (#4143)
* DENG-1314 Implement changes to bqetl and create default DAG.

* DENG-1314. Update Documentation.

* DENG-1314. Dummy query to enable generating DAG and run tests.

* DENG-1314. Update tests.

* Update bigquery_etl/cli/query.py

Raise exception when scheduling information is missing.

Co-authored-by: Daniel Thorn <dthorn@mozilla.com>

* DENG-1314. Update tests.

* DS-3054. Update query creation to set bqetl_default as default value for --dag. Update tests.

* Default task and tests update.

* Default task and tests update.

* 3650 - Remove default DAG option, update DAG template comment & tests.

* 3650 - Condition for DAG warning.

* 3650 - Update docs.

* Clarification on sql/moz-fx-data-shared-prod/analysis/bqetl_default_task_v1/metadata.yaml

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

* Update docs/cookbooks/creating_a_derived_dataset.md

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
Co-authored-by: Daniel Thorn <dthorn@mozilla.com>
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-08-29 14:32:52 +02:00
kik-kik 7afc4c44f1
docs(DENG-960): bqetl data checks cli docs (#4200)
* Small tweaks made to the cli cmds comments / help display for data checks

* added usage docs to data_checks reference docs

* Apply suggestions from code review provided by scholtzan

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

---------

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-08-24 17:24:23 +02:00
kik-kik 6c27be8a19
added data checks docs to mkdocs.yml to ensure they are included in our docs page (#4189) 2023-08-16 17:48:58 +02:00
kik-kik de35707b1d
docs(DENG-961): Added reference doc for data checks and a cookbook for adding new data checks (#4162)
* added reference doc for data checks and a cookbook for adding new data checks

* Apply suggestions from code review by ascholtz

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

---------

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2023-08-09 17:12:55 +02:00
Anna Scholtz d5a6dc97f4
Docs for new `bqetl_project.yaml` (#4018)
* Add ConfigLoader and move dry run skip to bqetl_project.yaml

* Update bqetl configuration docs
2023-07-10 09:50:22 -07:00
Curtis Morales b7b1b835ba
Add trigger_rule as an option for generated airflow tasks (#3772)
* Add trigger_rule as an option for generated airflow tasks

* Add test

* Move trigger rule options to enum and add to documentation
2023-06-27 13:58:52 -04:00
Leli d98ae6bf2e
changing the formatting for documentation in cookbooks (#3970)
* changing the formatting for documentation in cookbooks

* change to 1. 1. 1.
2023-06-27 18:41:33 +02:00
Lucia 5dc9e405a8
Deng 1000 docs how to change control (#3903)
* DENG-1000. Update docs with change control guide.

* DENG-1000. Update docs with change control guide.

* DENG-1000. Add sample PR to docs.

* DENG-1000. Add sample PR to docs.

* DENG-1000. Fix syntax of MD file.

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
2023-06-05 15:53:16 +00:00
Lucia dfef07ee40
DENG-1000. Update docs with change control guide. (#3901)
* DENG-1000. Update docs with change control guide.

* DENG-1000. Update docs with change control guide.

* DENG-1000. Add sample PR to docs.

---------

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
2023-06-05 14:53:20 +00:00
Winnie Chan 071c53e4cb
DENG-803/805: Create & Validate backfill cli commands (#3760)
* Added backfill create and validate cli ommand

---------

Co-authored-by: Alexander <anicholson@mozilla.com>
Co-authored-by: kik-kik <42538694+kik-kik@users.noreply.github.com>
2023-06-01 10:06:09 -07:00
Daniel Thorn a0d810275b
Remove java dependency in favor of sqlglot (#3755) 2023-05-17 14:56:42 -07:00
Sean Rose 45836abc69
Fix metrics docs formatting (#3704)
* Remove trailing whitespace to satisfy pre-commit check.

* Remove blank line that's breaking docs formatting.

* Re-add blank lines lost in copy+paste.
2023-04-05 08:07:13 -07:00
Anna Scholtz 48d8c7603d
Metric hub integration - rewrite SSL ratios to use metrics (#3698)
* Add metrics.data_source()

* Rewrite SSL ratios to use metrics

* Fix docs formatting
2023-04-04 15:41:44 -07:00
Anna Scholtz eb5d63a8f5
Metric-hub doc tweaks (#3700) 2023-04-04 10:18:06 -07:00
Anna Scholtz d737925da1
Metric hub integration docs (#3697)
* Add metrics.data_source()

* Docs for referencing metrics

* Add docs for using metrics in queries
2023-04-04 09:56:47 -07:00
Anna Scholtz 34f778c090
Stage deploy docs (#3648)
* Docs for stage deploys

* Update cookbook for updating table schemas

* Docs for running stage deploys locally

* Update common workflow failures
2023-03-10 09:10:32 -08:00
Alexander 60c85e7c54
Revert CI changes for private UDFs and add stub documentation - DENG-735 (#3652)
* Revert "CI fixes for supporting private UDFs in bigquery-etl - DENG-735 (#3631)"

This reverts commit edcfe758f7.

* Added stub UDF for monetized_search

* Add docs for using a private internal UDF
2023-03-10 11:41:46 -05:00
kik-kik 414a3c5d77
Fixed typo in airflow_tags.md (#3589)
* Fixed typo in airflow_tags.md

As Sean pointed out, the tag provided in the docs was incorrect. This change fixes that.

* Apply suggestions from code review

Co-authored-by: Sean Rose <1994030+sean-rose@users.noreply.github.com>

---------

Co-authored-by: Sean Rose <1994030+sean-rose@users.noreply.github.com>
2023-02-16 11:10:58 +01:00
kik-kik 67ac75a04b
:dded description for triage/record_only tag (#3576) 2023-02-10 10:00:31 -05:00
Leli 17c4fb35b1
add to documentation cookbook for testing (#3544) 2023-01-30 18:52:41 +01:00
akkomar 41fb32d7ef
Add a documentation note on faster single test execution (#3541) 2023-01-30 15:09:00 +01:00
Anna Scholtz 22e54ccb5f
[Bug 1812301] Publish only string typed labels (#3530)
* [Bug 1812301] Publish only string typed labels

* Document label publishing
2023-01-26 09:24:49 -08:00
Sean Rose d01ce2fc8a
Fix docs formatting (#3523)
* Use `mdx_truly_sane_lists` to fix nested list formatting in the docs (by default, mkdocs requires nested lists to be indented by four spaces, which isn't how most nested lists in our docs have been formatted).
* Enable syntax highlighting for code blocks in the docs.
* Add support for GitHub Flavored Markdown task lists in the docs.
2023-01-24 08:24:35 -08:00
Sean Rose afea1c35b9
Remove dependencies on the `mozdata` project from ETL (#3496)
Having dependencies on things in `mozdata` can cause issues with deployments, as deploying things to `mozdata` is usually a separate secondary step.
2023-01-23 08:59:04 -08:00