Граф коммитов

15 Коммитов

Автор SHA1 Сообщение Дата
kik-kik 681ac4b73a
feat(): Adding ETL for monitoring_airflow datasets (#3204)
* added sql logic for monitoring_airflow

* bqetl_monitoring_airflow added to dags.yaml

* added .probe_cache/ to gitignore

* generated bqetl_monitoring_airflow dag

* added monitoring_airflow_derived to dryrun ignore as it fails to access fivetran data referenced in this dataset

* moved airflow views and sql files to monitoring dataset

* manually triggering fivetran load of airflow metadata as suggested by @ascholtzan in PR#3204

* added schemas as requested by @scholtzan in PR#3204

* added descriptions to airflow_monitoring datasets

* fixed airflow_dag view folder name

* corrected dryrun ignore for monitoring_derived/airflow*
2022-10-12 10:57:24 +01:00
Lucia da36b8b3f8
Update bigquery-etl cookbooks (#2935)
* Ignore .idea/ folder in pycharm

* Make instructions to add field generic with example. Add recommendations to delete field.

* Reformat numbered list for consistency.

* Update instructions based on PR feedback.

Co-authored-by: Lucia Vargas <lvargas@mozilla.com>
2022-05-04 11:10:34 +02:00
William Lachance 9f21097a49
Remove .vscode settings and add to gitignore (#1929)
.vscode's settings.json can get cluttered with installation-specific
settings, which can lead to confusing pull requests. Instead, let's use
the approach outlined here:

https://stackoverflow.com/a/48387809

This provides a set of defaults that people can use, which we can
extend over time.
2021-03-30 12:25:05 -04:00
Daniel Thorn f0ec77e27d
Use zetasql to find sql dependencies (#1802) 2021-02-17 11:48:40 -08:00
Jeff Klukas 3512fb6ff7
Publish generated views and queries to a generated-sql branch (#1775)
* Add CI task to push content to generated-sql branch

Fixes #1742

The
[`generated-sql`](https://github.com/mozilla/bigquery-etl/tree/generated-sql)
branch now exists and you can browse the contents. See, for example,
[telemetry.main](https://github.com/mozilla/bigquery-etl/tree/generated-sql/sql/moz-fx-data-shared-prod/telemetry/main)

Follow-ups for which I'll file issues:

- This doesn't currently publish the generated Glean baseline ETL queries
  and views; we'll need to update that logic to use probe-scraper metadata
  rather than listing tables in BigQuery (due to creds) to integrate it.
- Docs publishing should reference this generated content rather
2021-02-10 09:42:58 -05:00
Linh Nguyen dd10f7a7f7
Generate summaries of derived tables (fixes #1589) (#1735)
* Generate summaries of derived tables

* Generate mozdata docs

* Address requested changes
2021-02-04 13:05:26 -08:00
Linh Nguyen be961f10a0
Add links to metadata.yaml editor and source dictory (#1242) (#1618)
* Add links to edit and source directory

* Update docs/mkdocs.yml

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

Co-authored-by: jailang <lynjailang@gmail.com>
Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2020-12-15 11:52:46 -08:00
Jeff Klukas d539fafb59
Bug 1635906 Add bqetl support for scripts and script for AET lookup (#1323)
* Bug 1635906 Add bqetl support for scripts and script for AET lookup

There are some code changes here for DAG generation and for testing.

* Apply suggestions from code review

Co-authored-by: Anna Scholtz <anna@scholtzan.net>

* DAG fixups

Co-authored-by: Anna Scholtz <anna@scholtzan.net>
2020-09-23 15:28:25 -04:00
Daniel Thorn 8ca73c2b60
Rewrite script/format_sql in python (#640) 2020-01-06 16:17:41 -08:00
Daniel Thorn 469c03ec10
Add script to format sql (#173) 2019-09-18 17:48:53 -07:00
Marina Samuel ba8b51a327 MDV2 client_probe_counts templates and queries. 2019-09-11 11:24:28 -04:00
Anna Scholtz 7207a4e52f Move SQL templates to templates/ and add generated SQL 2019-06-25 08:07:26 -07:00
Anna Scholtz f420c1b2a9 Update .gitignore
Co-Authored-By: Jeff Klukas <jeff@klukas.net>
2019-06-25 08:07:26 -07:00
Anna Scholtz a6661c5896 Trigger SQL query generation in pytest and update CircleCI config 2019-06-25 08:07:26 -07:00
Daniel Thorn 79070068ad
Add first test (#9) 2019-03-07 12:43:21 -08:00