Граф коммитов

1618 Коммитов

Автор SHA1 Сообщение Дата
Frank Bertsch e71840cec6
Android event types init (#1289)
* Init SQL for event_types_v1

* Fix comparison of differently-sized lists

* Add support for tests of init stmts

* Include metadata for event_types_v1

* Add tests for event_types init

* Reformat SQL

* Run black

* Skip invalid unicode sections in event_code_points_to_string

* Allow for init-only queries

* Partition events_daily_v1 by submission-date

This is not strictly required, but will aid in
debugging and reruns.

* Add assertion for not null

* Lint

* Alias events ping name

* Ignore time_ms event property
2020-09-10 12:12:56 -04:00
Anna Scholtz 7566b7c3f0 Add CLI UDF tests 2020-09-09 14:10:44 -07:00
Anna Scholtz f04d7bf507 Add UDF CLI command for publishing UDFs 2020-09-09 14:10:44 -07:00
Anna Scholtz e87bb3f4d4 Add CLI command for validating UDFs 2020-09-09 14:10:44 -07:00
Anna Scholtz cb8675fb92 CLI command for listing UDF information 2020-09-09 14:10:44 -07:00
Anna Scholtz 4e9e1619c6 Add CLI command for creating UDFs 2020-09-09 14:10:44 -07:00
Anna Scholtz 278a7b790b UDF CLI setup 2020-09-09 14:10:44 -07:00
Frank Bertsch 51a7631e09
Bug 1649871 - Include rolled up event types (#1294)
* Bug 1649871 - Include rolled up event types

* Reformat sql
2020-09-09 11:43:57 -04:00
Frank Bertsch de97647d11
Don't include most recent month in revenue calc (#1219) 2020-09-09 11:35:29 -04:00
Frank Bertsch 55f3ab5cb5
Use submission_date for partition pruning (#1257) 2020-09-09 10:59:38 -04:00
Ben Miroglio 23491d5054
Add ltv view without revenue fields (#1203)
* Add ltv view without revenue fields

* Change view to query

* Add submission_date filter

* Ignore normalized ltv export in dryrun

Co-authored-by: Frank Bertsch <fbertsch@mozilla.com>
2020-09-09 10:50:47 -04:00
Jeff Klukas 25363c44fc Remove mozfun reference in mozfun UDF definition
Airflow's publishing of mozfun definitions failed last night due to
https://github.com/mozilla/bigquery-etl/pull/1287

Long-term we should enforce that mozfun function definitions don't directly
reference mozfun.
2020-09-09 09:32:51 -04:00
Jeff Klukas fe0afb6ae0
Tolerate 10-digit Fenix app_build values when choosing channel (#1270)
See https://github.com/mozilla/bigquery-etl/pull/1250 for proposed logic of how
to parse a date from the 10-digit format, but this change should be reliable
for the specific case of determining whether we are before or after the
2020-07-03 epoch where naming of apps and channels changed.

This will affect GLAM and GUD.

Reverts https://github.com/mozilla/bigquery-etl/pull/1249

I will plan to run a backfill for GUD on the same day I merge this, and then
communicate out the change to data users, since channel breakdowns in GUD
will change for Fenix and Firefox Preview.
2020-09-09 08:53:06 -04:00
dependabot[bot] 9cb1b70b1f
Bump attrs from 20.1.0 to 20.2.0 (#1284) 2020-09-09 01:25:34 +00:00
dependabot[bot] b86f64fbfb
Bump gitpython from 3.1.7 to 3.1.8 (#1283) 2020-09-09 01:18:42 +00:00
Anna Scholtz ac56b5b1f6 Fix publishing UDFs 2020-09-08 12:28:16 -07:00
dependabot[bot] f25a096536
Bump pytest-black from 0.3.10 to 0.3.11 (#1282) 2020-09-08 16:46:47 +00:00
Felix Lawrence a8fc87b91c
Add `clients_last_seen.days_interacted_bits`. (#1280)
* Add `clients_last_seen.days_any_interaction_bits`.

This column is a 28-day bit array containing whether the client has any active ticks (thus nonzero active hours) associated with each of the last 28 submission dates.

* Change column name.

Follow the grammatical pattern for these column names, as pointed out by @jklukas.

* Update tests

* Reconcile order to satisfy IF

Co-authored-by: Jeff Klukas <jklukas@mozilla.com>
2020-09-02 16:21:32 -04:00
Daniel Thorn 26c67c7ee8
Upgrade to pytest 6.0.1 (#1281)
Also upgrade and fix pytest plugins
2020-09-02 11:30:14 -07:00
Anna Scholtz 6c7ad57b55 Prefix test data UDFs with test_ 2020-09-02 10:24:38 -07:00
Anna Scholtz 7564f8e46c Fix collecting UDFs for testing 2020-09-02 10:24:38 -07:00
Anna Scholtz 15a02710d2 Remove old parse_udf file 2020-09-02 10:24:38 -07:00
Anna Scholtz 3bdb418588 Add tests for parsing UDFs 2020-09-02 10:24:38 -07:00
Anna Scholtz 40e8a7ad91 Reformat UDFs 2020-09-02 10:24:38 -07:00
Anna Scholtz 56a475c572 Fix validating docs 2020-09-02 10:24:38 -07:00
Anna Scholtz 437cf67aa2 Refactor parse_udf 2020-09-02 10:24:38 -07:00
Anna Scholtz 0080ff8867 Refactor publish_udfs script 2020-09-02 10:24:38 -07:00
Anna Scholtz 2b29d24f59 Migrate UDFs to new format 2020-09-02 10:24:38 -07:00
dependabot[bot] 04014b483e
Bump google-cloud-storage from 1.27.0 to 1.31.0 (#1279) 2020-09-01 18:17:15 +00:00
dependabot[bot] c565d2e96b
Bump smart-open from 1.10.0 to 2.1.1 (#1278) 2020-09-01 17:20:29 +00:00
dependabot[bot] 357431e1b7
Bump google-cloud-bigquery from 1.24.0 to 1.27.2 (#1277) 2020-09-01 16:22:14 +00:00
dependabot[bot] 08369555f5
Bump attrs from 19.3.0 to 20.1.0 (#1275) 2020-09-01 16:12:43 +00:00
dependabot[bot] 288d6ed8db
Bump gitpython from 3.1.2 to 3.1.7 (#1274) 2020-09-01 16:05:43 +00:00
dependabot[bot] bf8983784a
Bump typing from 3.7.4.1 to 3.7.4.3 (#1267) 2020-09-01 15:57:55 +00:00
Anna Scholtz f0c3ad8fda
Fix writing to temporary table when publishing JSON (#1273)
* Fix writing to temporary table when publishing JSON

* Update test_script_incremental_query test

* Ensure temporary artifacts are deleted

* Wrap deletion of artifacts into try finally block
2020-09-01 08:51:15 -07:00
Frank Bertsch 3da15a3001
Prepare search_clients_last_seen for 100% backfill (#1261)
* Move null search engines to Other

* Remove sample_id limit
2020-09-01 09:46:35 -04:00
dependabot[bot] 5ed271b8b4
Bump pytest-black from 0.3.8 to 0.3.10 (#1265) 2020-08-31 21:10:47 +00:00
Anna Scholtz 34e2c21405 Add CLI command to initialize destination tables for queries 2020-08-31 12:47:58 -07:00
dependabot[bot] 4cd100e064
Bump pyyaml from 5.3 to 5.3.1 (#1263)
Bumps [pyyaml](https://github.com/yaml/pyyaml) from 5.3 to 5.3.1.
- [Release notes](https://github.com/yaml/pyyaml/releases)
- [Changelog](https://github.com/yaml/pyyaml/blob/master/CHANGES)
- [Commits](https://github.com/yaml/pyyaml/compare/5.3...5.3.1)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2020-08-31 12:22:56 -07:00
Daniel Thorn dd29a862c2
Migrate to new dependabot (#1262) 2020-08-31 12:00:48 -07:00
Anna Scholtz 3725a64e31 Check error output for entrypoint tests 2020-08-31 09:25:45 -07:00
Anna Scholtz 99b11cbf93 Install bq for integration tests in CircleCI 2020-08-31 09:25:45 -07:00
Anna Scholtz 6970b73a21 Add integration tests for entrypoint run query 2020-08-31 09:25:45 -07:00
Anna Scholtz debd57c662 Fix entrypoint run_query call 2020-08-27 21:08:14 -07:00
Anna Scholtz ffaaa2ab26 Call bigquery_etl.run_query from script/run_query 2020-08-27 14:48:32 -07:00
Anna Scholtz bebe352335 Add tests for run_query 2020-08-27 14:48:32 -07:00
Anna Scholtz e2007fe9fc Add CLI tests 2020-08-27 14:48:32 -07:00
Anna Scholtz 043c28dd4a Add cost and last_updated information to query info CLI 2020-08-27 14:48:32 -07:00
Anna Scholtz 9b587bf374 Add backfill command 2020-08-27 14:48:32 -07:00
Anna Scholtz 04cbf80eab Add format command to CLI 2020-08-27 14:48:32 -07:00