Anna Scholtz
654401a36d
Fix intermittent verify-dags-up-to-date failures
2020-08-18 15:07:35 -07:00
Jeff Klukas
a92a16c92b
Add mozfun.norm.fenix_app_info UDF ( #1226 )
2020-08-18 16:21:42 -04:00
Anna Scholtz
ceb223e1d8
Use git diff for verifying DAGs in CI
2020-08-18 12:31:38 -07:00
Anna Scholtz
9b8172eb1e
Update integration tests for query scheduling
2020-08-18 12:31:38 -07:00
Anna Scholtz
155e41c734
Sort generated Airflow tasks and dependencies by name
2020-08-18 12:31:38 -07:00
Anna Scholtz
f5c9c4c612
Remove project ID option from generate_airflow_dags script
2020-08-18 12:31:38 -07:00
Anna Scholtz
6d1e00aa42
Verify if generated DAGs are up-to-date
2020-08-18 12:31:38 -07:00
Anna Scholtz
486224ecda
Update experiment_search_aggregates_v1 init destination table
...
Co-authored-by: Daniel Thorn <dthorn@mozilla.com>
2020-08-18 12:23:03 -07:00
Anna Scholtz
75caa0ed36
Add cumulative search counts for experiment search aggregates
2020-08-18 12:23:03 -07:00
Anna Scholtz
f1579104af
Experiments search aggregates live view
2020-08-18 12:23:03 -07:00
Anna Scholtz
4d5b5652e7
experiment_search_aggregates dataset
2020-08-18 12:23:03 -07:00
Anthony Miyaguchi
222e04b081
Fix #1232 - Ignore glam_etl directory when publishing views ( #1234 )
2020-08-18 11:33:50 -07:00
Anthony Miyaguchi
ca2204625d
Add views for logical Fenix app ids in GLAM ETL ( #1221 )
...
* Add views for logical app ids
* Add new generated sql
* Update generate_glean_sql script to handle logical apps
* Update logical app view for partitiontime
* Make sure to generate view for all of the app ids
* Update last versions to be logical app id agnostic
* Add formatting for black
* Fix linting error
* Update bigquery_etl/glam/generate.py
Co-authored-by: Ben Wu <benjaminwu124@gmail.com>
* Add "all" option to STAGE
* Add new metrics added since last PR
Co-authored-by: Ben Wu <benjaminwu124@gmail.com>
2020-08-17 15:05:15 -07:00
Ben Wu
c63bd7e3cd
Add decoded stub installer to structured docid count ( #1231 )
2020-08-17 16:57:02 -04:00
Anna Scholtz
a6ef677b7f
Use cached_property for DryRun
2020-08-14 11:08:15 -07:00
Anna Scholtz
47ff0d88f3
Refactor dry run and update integration tests
2020-08-14 11:08:15 -07:00
Anna Scholtz
58b3cf31b7
Add payload_bytes_* tables as referenced_tables for monitoring datasets
2020-08-14 11:08:15 -07:00
Anna Scholtz
d73b21ff6e
Update DAGs
2020-08-14 11:08:15 -07:00
Anna Scholtz
b1a9b4a781
Add dry run tests
2020-08-14 11:08:15 -07:00
Anna Scholtz
0a5adf91ed
Refactor Airflow DAG generation to use dry run
2020-08-14 11:08:15 -07:00
Anna Scholtz
e5c8946171
Dry run get referenced tables
2020-08-14 11:08:15 -07:00
Anna Scholtz
53c856b781
Fix GCS integration test issues
2020-08-13 12:17:23 -07:00
William Lachance
1af07bdb1e
Use qualified table names and udf functions ( #1227 )
...
This makes backfilling inside a sandbox project possible
2020-08-12 16:13:22 -04:00
Ben Wu
4bb8eff997
Add distinct docid count queries ( #1225 )
2020-08-11 18:38:42 -04:00
Jeff Klukas
1c71e70838
Bug 1654330 AMO download stats by UTM params ( #1224 )
2020-08-11 13:32:10 -04:00
Jeff Klukas
285efeb991
Add burnham test report table ( #1220 )
...
* Add burnham test report table
For https://github.com/mozilla/burnham/issues/39
* Add test_log_url and test_duration_millis
* Apply suggestions from code review
Co-authored-by: Raphael Pierzina <raphael@hackebrot.de>
Co-authored-by: Raphael Pierzina <raphael@hackebrot.de>
2020-08-10 09:50:27 -04:00
Jesse McCrosky
3c6d63739c
Fix formatting.
2020-08-10 09:39:50 -04:00
Jesse McCrosky
7298d43c32
Add missing array length call.
2020-08-10 09:39:50 -04:00
Jesse McCrosky
b57f24ff1b
Add addon and cert error features
2020-08-10 09:39:50 -04:00
asiOvOtus
6a83b6223e
Fix mozfun format in generated docs ( #1212 )
2020-08-07 16:35:44 -04:00
William Lachance
5702d96dae
Bug 1646362 - Accessibility inspector derived dataset ( #1171 )
...
Co-authored-by: Jeff Klukas <jklukas@mozilla.com>
2020-08-07 16:27:06 -04:00
William Lachance
4633b5fda4
Provide extra context when dag generation fails due to a malformed query ( #1222 )
2020-08-07 14:40:14 -04:00
Jeff Klukas
5bab4e9d26
Bug 1655146 Add impression_stats_by_experiment
...
This table will be a bit larger than impression_stats_flat, about 1 TB/day
uncompressed. But it will be much more efficient for the class of queries
that are concerned with evaluating performance of a single experiment.
2020-08-07 12:54:10 -04:00
Anthony Miyaguchi
36b7c184e6
Add script to backfill glam tables for a glean product ( #1108 )
...
* Add backfill script for glean products
* Specify product correctly and add target dataset
* Add product to example
* Use datetime.fromisoformat
2020-08-06 15:48:40 -07:00
Ben Wu
a58821eaae
Add build date udf mapping for fenix_nightly ( #1218 )
2020-08-06 15:32:45 -04:00
Jeff Klukas
ac182ca56b
Add parsed_x_source_tags to metadata in user-facing views
...
As discussed in https://bugzilla.mozilla.org/show_bug.cgi?id=1657360#c1
2020-08-06 15:25:48 -04:00
Ben Wu
c7eae14b25
Add submission_date to clients_scalar_aggregates ( #1216 )
2020-08-06 14:28:53 -04:00
Jeff Klukas
d5d64359f6
Bug 1657360 Exclude pings with "automation" tag from stable
...
We will also need to update monitoring queries to account for this when
counting unique document_ids in decoded and live tables.
2020-08-06 12:56:15 -04:00
asiOvOtus
2acb30c9b0
Rewrite duplicated map udfs to mozfun shims ( #1211 )
...
* Rewrite duplicated map udfs to mozfun shims
* Format get_key_with_null.sql
2020-08-04 13:26:13 -07:00
jmccrosky
8192ee323f
Fix URI comparison ( #1205 )
...
* Fix URI comparison
No need to backfill for this now. Just fixing for future use.
* Fix formatting.
Co-authored-by: Jeff Klukas <jklukas@mozilla.com>
2020-08-03 09:49:54 -04:00
Jeff Klukas
0969a51673
Include sample_id in feature_usage ( #1201 )
...
Used as clustering key and allows sampling
2020-08-03 09:38:02 -04:00
Ben Miroglio
cc06758a9a
Add tagged_search current to LTV ( #1210 )
2020-07-31 15:48:34 -04:00
Ben Miroglio
8d6c2f47c1
Add sqrt to LTV calc ( #1202 )
...
* Add sqrt to LTV calc
* Format query
Co-authored-by: Frank Bertsch <fbertsch@mozilla.com>
2020-07-31 10:48:20 -04:00
Ben Wu
019666b51b
Add queries for exported app store data ( #1207 )
2020-07-29 18:02:16 -04:00
Anna Scholtz
a39e168a2d
Update query scheduling section
2020-07-29 14:11:00 -07:00
asiOvOtus
111382b37a
Rewrite duplicated bits28 udfs to mozfun shims ( #1208 )
2020-07-29 11:17:26 -07:00
Anna Scholtz
6dedaca002
Improve query CLI based on feedback
2020-07-29 08:26:24 -07:00
Anna Scholtz
8171e30010
Refactor CLI
2020-07-29 08:26:24 -07:00
Anna Scholtz
bb47afedd5
Docs for query CLI command
2020-07-29 08:26:24 -07:00
Anna Scholtz
fee4f44e9a
Fix CLI version
2020-07-29 08:26:24 -07:00