TobKed
75484bc37e
[AIRFLOW-5446] Rewrite Google KMS Hook to Google Cloud Python ( #6065 )
2019-09-10 14:12:01 +02:00
Jarek Potiuk
455eb41ba6
[AIRFLOW-5437] Better python version detection/explanation. ( #6060 )
...
We have fairly complex python version detection in our CI scripts.
They have to handle several cases:
1) Running builds on DockerHub (we cannot pass different environment
variables there, so we detect python version based on the image
name being build (airflow:master-python3.7 -> PYTHON_VERSION=3.7)
2) Running builds on Travis CI. We use python version determined
from default python3 version available on the path. This way we
do not have to specify PYTHON_VERSION separately in each job,
we just specify which host python version is used for that job.
This makes a nice UI experience where you see python version in
Travis UI.
3) Running builds locally via scripts where we can pass PYTHON_VERSION
as environment variable.
4) Running builds locally for the first time with Breeze. By default
we determine the version based on default python3 version we have
in the host system (3.5, 3.6 or 3.7) and we use this one.
5) Selecting python version with Breeze's --python switch. This will
override python version but it will also store the last used version
of python in .build directory so that it is automatically used next
time.
This change adds necessary explanations to the code that works for
all the cases and fixes some of the edge-cases we had. It also
extracts the code to common directory.
2019-09-10 10:16:10 +02:00
Fokko Driesprong
b21be80235
[AIRFLOW-5402] Remove deprecated logger ( #6006 )
...
* [AIRFLOW-5402] Remove deprecated logger
* Remove the related test
* Less is more
2019-09-09 15:18:38 +02:00
Kamil Breguła
6b2a290ddf
[AIRFLOW-5443] Use alpine image in Kubernetes's sidecar ( #6059 )
2019-09-09 14:42:06 +02:00
Tomek
d31c09236c
[AIRFLOW-3601] Update operators to BigQuery to support location ( #6020 )
2019-09-09 11:10:37 +02:00
Jacob Ferriero
6db9f86925
[AIRFLOW-4983] Add ability for DataflowPythonOperator to submit jobs w/ python3 ( #5602 )
2019-09-09 08:02:38 +02:00
Tomek
0ffc14c30a
[AIRFLOW-5425] Use logging not printing in LoggingCommandExecutor ( #6032 )
...
This will show proper message in log (yellow color) which will be harder to miss.
2019-09-08 10:36:50 +02:00
Kamil Breguła
867cd0f714
[AIRFLOW-5368] Display DAG from the CLI
2019-09-07 18:13:07 +02:00
Kamil Breguła
cb6dfdfabe
[AIRFLOW-2842] Add GoogleCloudStorageSynchronizeBuckets operator
2019-09-07 17:20:41 +02:00
Tomek Urbaszek
ae0d03eba7
[AIRFLOW-5401] Add support for project_id from connection in GKE
...
This change adds support for reading project_id value from
connection configuration GKE operators.
2019-09-07 17:14:36 +02:00
Kamil Breguła
004f353342
[AIRFLOW-3511][AIRFLOW-3512] Add Cloud Memorystore integration
2019-09-07 17:12:52 +02:00
Tomek
1b1d79c0a6
[AIRFLOW-4964] Add BigQuery Data Transfer Hook and Operator ( #5769 )
...
* [AIRFLOW-4964] Add BigQuery Data Transfer Hook and Operator
2019-09-06 12:48:42 +02:00
Kamil Breguła
0076e17a91
[AIRFLOW-4758] Add GcsToGDriveOperator operator ( #5822 )
...
* [AIRFLOW-4758] Add GcsToGDriveOperator operator
2019-09-06 12:45:15 +02:00
Mohannad Albanayosy
6b82b9ef91
[AIRFLOW-5318] Option to specify location of the new BQ dataset ( #5923 )
2019-09-06 12:41:40 +02:00
TobKed
f4b64b3557
[AIRFLOW-5412] Add get_conn/get_client to hooks tests ( #6018 )
...
* [AIRFLOW-5412] Add get_conn/get_client to hooks tests
2019-09-06 12:20:38 +02:00
Diederik Greveling
dd36d90cad
[AIRFLOW-5049] Add validation for src_fmt_configs in bigquery hook ( #5671 )
...
* AIRFLOW-5049 Add validation for src_fmt_configs in bigquery hook
Adds validation for the src_fmt_configs arguments in the bigquery hook. Otherwise wrong src_fmt_configs would be silently ignored which is non-desireable.
* [AIRFLOW-5049] Update - Add validation for src_fmt_configs in bigquery hook
Adds a common method for validating the src_ftm_configs
2019-09-06 08:50:26 +02:00
Tobias Kaymak
4ab6982f0b
[AIRFLOW-5072] gcs_hook should download once ( #5685 )
...
When a user supplied a filename the expected behaviour is that airflow
downloads the file and does not return it's content as a string.
2019-09-05 17:54:07 +02:00
Tomek
756069766d
[AIRFLOW-3804] Extend MySQL to GCS operator tests ( #5993 )
...
This commit adds additional test related to problem mentioned in initial
issue https://issues.apache.org/jira/browse/AIRFLOW-3804\#
2019-09-05 17:49:19 +02:00
Tomek
0948bcf868
[AIRFLOW-5319] Add system tests for Dataproc ( #5924 )
...
* [AIRFLOW-5319] Add system tests for Dataproc
2019-09-05 17:44:32 +02:00
Kamil Breguła
7018f47460
[AIRFLOW-5361] Add system tests for BigQuery ( #5968 )
2019-09-05 17:43:04 +02:00
Tomek
ad61f3272a
[AIRFLOW-5332] Add system tests for Datastore ( #5937 )
2019-09-05 17:35:31 +02:00
TobKed
0f548b8cf2
[AIRFLOW-5403] Fix input check in GKE Operator ( #6004 )
2019-09-05 16:48:14 +02:00
Tomek
9165739e90
[AIRFLOW-5409] Fix BigQuery hook tests ( #6017 )
2019-09-05 15:37:07 +02:00
Kamil Breguła
57f77e0c09
[AIRFLOW-5308] Pass credentials object to pandas_gbq ( #5911 )
2019-09-05 12:55:06 +02:00
Kamil Breguła
a4ebd9d2eb
[AIRFLOW-5358] Improved mocking of the project ID ( #5964 )
...
* [AIRFLOW-5358] Improved mocking of the project ID
2019-09-05 12:49:45 +02:00
davlum
17d4179db2
[AIRFLOW-4851] Refactor K8S codebase with k8s API models ( #5481 )
...
* [AIRLFOW-4851] refactor Airflow kubernetes
* [AIRFLOW-4851] refactor Airflow k8s models
* [AIRFLOW-4851] Fix linting and tests
* Refactor and add some tests
* [AIRLFOW-4851] Add assertions to PodOperator tests
2019-09-04 16:24:31 -05:00
Tomek
9143893c99
[AIRFLOW-5347] Add system tests for GoogleCloudStorage ( #5951 )
2019-09-04 22:21:15 +02:00
Zacharya
c9e2d04fde
[AIRFLOW-4085] FileSensor now takes glob patterns for `filepath` ( #5358 )
2019-09-04 11:02:28 +01:00
Igor Khrol
ba9e521e71
[AIRFLOW-5344] Add --proxy-user parameter to SparkSubmitOperator ( #5948 )
...
`spark2-submit` supports `--proxy-user` parameter which should be handled by SparkSubmitOperator.
```
$ spark2-submit --help 2>&1 | grep proxy
--proxy-user NAME User to impersonate when submitting the application.
```
2019-09-04 10:47:09 +01:00
Omkar Desai
13843151fb
[Airflow-4668] Make airflow/contrib/utils Pylint compatible ( #5916 )
2019-09-04 09:23:57 +01:00
Tomek
6e53281af6
[AIRFLOW-5399] Add invoke operator for GCP Functions ( #5995 )
2019-09-04 09:07:45 +02:00
Tomek
fa9e91227b
[AIRFLOW-5386] Move Google Dataproc to core ( #5986 )
...
For more information check AIP-21.
2019-09-03 23:00:38 +02:00
Hao Liang
f497d1d5aa
[AIRFLOW-4858] Deprecate "Historical convenience functions" in airflow.configuration ( #5495 )
...
1. Issue old conf method deprecation warnings properly and remove current old conf method usages.
2. Unify the way to use conf as `from airflow.configuration import conf`
2019-09-03 17:08:55 +01:00
Tomek
2ae5256df6
[AIRFLOW-5388] Add airflow version label to newly created buckets ( #5987 )
...
To tag and track GCP resources spawned from Airflow, we have
been adding airflow specific label(s) to GCP API service calls
whenever possible and applicable.
2019-09-02 20:54:55 +02:00
Tomek
7ee2b99229
[AIRFLOW-5314] Create test for new import paths ( #5920 )
2019-09-01 02:51:28 +02:00
Tomek
7a59358ffd
[AIRFLOW-5356] Fix GCP Datastore unit tests ( #5961 )
...
This commit adds additional mocking for Datastore tests which
tests fallback_to_default_project_id decorator.
2019-08-30 18:17:28 +02:00
Kaxil Naik
6138208749
[AIRFLOW-5335] Simplify GCSHook test ( #5958 )
2019-08-30 16:50:54 +01:00
Tomek
bc5c8f32a1
[AIRFLOW-5320] Add system tests for PubSub ( #5925 )
2019-08-30 15:28:12 +02:00
Kamil Breguła
1b69a0b120
[AIRFLOW-5353] Simplify GKEClusterHook ( #5960 )
2019-08-30 14:26:48 +02:00
Kamil Breguła
7c0d9b46ce
[AIRFLOW-5303] Use project_id from GCP credentials ( #5907 )
2019-08-30 01:11:34 +02:00
Kaxil Naik
890adde7bd
[AIRFLOW-5350] Fix bug in the num_retires field in BigQueryHook ( #5955 )
2019-08-29 23:05:10 +01:00
Kaxil Naik
b420ada05b
[AIRFLOW-5351] Move all GCP Cloud SQL tests in 1 file ( #5956 )
2019-08-29 23:04:00 +01:00
Tomek
5fb9972672
[AIRFLOW-5330] Add project_id to Datastore hook and operators ( #5935 )
2019-08-29 23:04:39 +02:00
Tomek
8eeedf538c
[AIRFLOW-5346] Add system tests for GKECluster ( #5947 )
2019-08-29 22:54:30 +02:00
Tomek
88a94cb1bb
[AIRFLOW-5341] Use more precise mock of time.sleep ( #5950 )
...
Some tests were mocking 'time.sleep' instead of using more precise
'airflow.module.path.time.sleep' mock.
2019-08-29 22:52:10 +02:00
Géraud
3d14a51cdb
[AIRFLOW-4833] Allow to set Jinja env options in DAG declaration ( #5943 )
2019-08-29 20:06:17 +01:00
Kaxil Naik
b1d3d55da9
[AIRFLOW-5335] Update GCSHook methods so they need min IAM perms ( #5939 )
2019-08-29 16:40:33 +01:00
Igor Dralyuk
fd544eb0cb
[AIRFLOW-5118] Add ability to specify optional components in DataprocClusterCreateOperator ( #5821 )
2019-08-28 21:12:30 +02:00
Michal Brys
fb0910e847
[AIRFLOW-5275] Add support for template parameters in DataprocWorkflowTemplateInstantiateOperator ( #5877 )
2019-08-28 21:06:30 +02:00
Tomek
382b21e234
[AIRFLOW-5322] Fix flaky test - GCP Transfer Service hook ( #5931 )
2019-08-28 15:56:38 +02:00