Граф коммитов

633 Коммитов

Автор SHA1 Сообщение Дата
Kaxil Naik 401003e8e3 [AIRFLOW-5740] Fix Transient failure in Slack test (#6407)
* [AIRFLOW-5740] Fix Transient failure in Slack test

The transient failure is caused by Dict Ordering

* We were comparing string

We were comparing strings
2019-10-24 19:35:42 +02:00
TobKed 44b6a271fb [AIRFLOW-5665] Add path_exists method to SFTPHook (#6344)
Co-Authored-By: Kamil Breguła <mik-laj@users.noreply.github.com>
2019-10-21 18:28:58 +02:00
MinJae Kwon 636aeb010a [AIRFLOW-5693] Support the "blocks" component for the Slack messages (#6364)
- fix doc string issues
2019-10-20 13:46:14 +02:00
dstandish 0790ede702 [AIRFLOW-4574] SSHHook private_key may only be supplied in extras (#6163)
* discussion on original PR suggested removing private_key option as init param
* with this PR, can still provide through extras, but not as init param
* also add support for private_key in tunnel -- missing in original PR for this issue
* remove test related to private_key init param
* use context manager to auto-close socket listener so tests can be re-run
2019-10-18 10:34:08 +01:00
JohannesGuenther d4e282d9b2 [AIRFLOW-5126] Read aws_session_token in extra_config of the aws hook (#6303) 2019-10-16 16:22:07 +01:00
TobKed 032200a495 [AIRFLOW-5622] Improve creating directories in SFTPHook (#6287)
* [AIRFLOW-XXX] Improve creating directories in SFTPHook

* fixup! [AIRFLOW-XXX] Improve creating directories in SFTPHook
2019-10-09 19:07:53 -07:00
Kamil Breguła 560c01e0ff
[AIRFLOW-5587] Move airflow.contrib.task_runner.cgroup_task_runner to core (#6248) 2019-10-07 14:20:45 +02:00
Kevin Yang d719e1fd67 [AIRFLOW-5362] Reorder imports (#5944) 2019-10-02 16:30:03 +01:00
Tomek Urbaszek 1eda51c501 [AIRFLOW-5502] Move GCP base hook to core 2019-09-26 14:34:59 +02:00
Kamil Breguła fd8de3e48e [AIRFLOW-5555] Remove Hipchat integration (#6184) 2019-09-26 10:45:14 +08:00
Peng Hou 2d19a41208 [AIRFLOW-3388] Add support to Array Jobs for AWS Batch Operator (#6153) 2019-09-22 11:11:38 +02:00
Kamil Breguła 86b4caac9a
[AIRFLOW-5434] Use hook to provide credentials in GKEPodOperator (#6050) 2019-09-21 22:06:19 +02:00
Tomek 8f04ebe669 [AIRFLOW-5499] Move GCP utils to core (#6122) 2019-09-21 20:59:36 +02:00
TobKed 31f19fbd6e [AIRFLOW-5514] No implicit optional flag for mypy (#6141) 2019-09-18 22:02:36 +02:00
dstandish fa8e18a2a8 [AIRFLOW-4574] add option to provide private_key in SSHHook (#6104) 2019-09-17 16:26:23 +02:00
Kamil Breguła e32663383d
[AIRFLOW-XXX] Display Azure integration in table (#6132) 2019-09-17 14:17:07 +02:00
Jarek Potiuk 4780105798 [AIRFLOW-5256] Related pylint changes for common licences in python files (#5786) 2019-09-17 13:16:32 +02:00
Tomek 686fac044c [AIRFLOW-5498] Move GCP Discovery hook to core (#6121) 2019-09-17 12:42:50 +02:00
Kamil Breguła 61d3f0c615 [AIRFLOW-5427] Add system tests for Dataflow 2019-09-15 20:25:58 +02:00
Fokko Driesprong a87f3dfc0c [AIRFLOW-5451] SparkSubmitHook don't set default namespace (#6072)
* [AIRFLOW-5451] SparkSubmitHook don't set default namespace

We only want to set the namespace if it isn't default.
https://spark.apache.org/docs/latest/running-on-kubernetes.html#configuration

The default is already set by Spark, therefore we don't want to pass
it if is default. This also allows us to pass the namespace over the
conf dict. Otherwise the namespace would be set twice.

* Fix tests as well
2019-09-12 09:44:13 +02:00
Tomek Urbaszek b07cb439a0 [AIRFLOW-5457] Move GCS operators to core 2019-09-11 17:22:04 +01:00
Tomek Urbaszek 766ebfe780 [AIRFLOW-5455] Move BigQuery operators to core
For more information check AIP-21.
2019-09-11 17:22:04 +01:00
Felix Uellendall 25c53b0a10
[AIRFLOW-4588] Add GoogleDiscoveryApiHook and GoogleApiToS3Transfer (#5335)
- add documentation to integration.rst
The hook provides:
- a get_conn function to authenticate to the Google API via an airflow connection
- a query function to dynamically query all data available for a specific endpoint and given parameters. (You are able to either retrieve one page of data or all data)
The transfer operator provides:
- basic transfer between google api and s3
- passing an xcom variable to dynamically set the endpoint params for a request
- exposing the response data to xcom, but raises exception when it exceeds MAX_XCOM_SIZE

Co-authored-by: louisguitton <louisguitton@users.noreply.github.com>
2019-09-11 10:40:04 +02:00
Fokko Driesprong dd175fa8db [AIRFLOW-5390] Remove provide context (#5990) 2019-09-10 15:17:03 +02:00
Jarek Potiuk 455eb41ba6
[AIRFLOW-5437] Better python version detection/explanation. (#6060)
We have fairly complex python version detection in our CI scripts.

They have to handle several cases:

1) Running builds on DockerHub (we cannot pass different environment
   variables there, so we detect python version based on the image
   name being build (airflow:master-python3.7 -> PYTHON_VERSION=3.7)

2) Running builds on Travis CI. We use python version determined
   from default python3 version available on the path. This way we
   do not have to specify PYTHON_VERSION separately in each job,
   we just specify which host python version is used for that job.
   This makes a nice UI experience where you see python version in
   Travis UI.

3) Running builds locally via scripts where we can pass PYTHON_VERSION
   as environment variable.

4) Running builds locally for the first time with Breeze. By default
   we determine the version based on default python3 version we have
   in the host system (3.5, 3.6 or 3.7) and we use this one.

5) Selecting python version with Breeze's --python switch. This will
   override python version but it will also store the last used version
   of python in .build directory so that it is automatically used next
   time.

This change adds necessary explanations to the code that works for
all the cases and fixes some of the edge-cases we had. It also
extracts the code to common directory.
2019-09-10 10:16:10 +02:00
Tomek d31c09236c [AIRFLOW-3601] Update operators to BigQuery to support location (#6020) 2019-09-09 11:10:37 +02:00
Tomek 0ffc14c30a [AIRFLOW-5425] Use logging not printing in LoggingCommandExecutor (#6032)
This will show proper message in log (yellow color) which will be harder to miss.
2019-09-08 10:36:50 +02:00
Kamil Breguła cb6dfdfabe [AIRFLOW-2842] Add GoogleCloudStorageSynchronizeBuckets operator 2019-09-07 17:20:41 +02:00
Kamil Breguła 004f353342 [AIRFLOW-3511][AIRFLOW-3512] Add Cloud Memorystore integration 2019-09-07 17:12:52 +02:00
Tomek 1b1d79c0a6 [AIRFLOW-4964] Add BigQuery Data Transfer Hook and Operator (#5769)
* [AIRFLOW-4964] Add BigQuery Data Transfer Hook and Operator
2019-09-06 12:48:42 +02:00
Kamil Breguła 0076e17a91 [AIRFLOW-4758] Add GcsToGDriveOperator operator (#5822)
* [AIRFLOW-4758] Add GcsToGDriveOperator operator
2019-09-06 12:45:15 +02:00
Mohannad Albanayosy 6b82b9ef91 [AIRFLOW-5318] Option to specify location of the new BQ dataset (#5923) 2019-09-06 12:41:40 +02:00
TobKed f4b64b3557 [AIRFLOW-5412] Add get_conn/get_client to hooks tests (#6018)
* [AIRFLOW-5412] Add get_conn/get_client to hooks tests
2019-09-06 12:20:38 +02:00
Diederik Greveling dd36d90cad [AIRFLOW-5049] Add validation for src_fmt_configs in bigquery hook (#5671)
* AIRFLOW-5049 Add validation for src_fmt_configs in bigquery hook

Adds validation for the src_fmt_configs arguments in the bigquery hook. Otherwise wrong src_fmt_configs would be silently ignored which is non-desireable.

* [AIRFLOW-5049] Update - Add validation for src_fmt_configs in bigquery hook

Adds a common method for validating the src_ftm_configs
2019-09-06 08:50:26 +02:00
Tobias Kaymak 4ab6982f0b [AIRFLOW-5072] gcs_hook should download once (#5685)
When a user supplied a filename the expected behaviour is that airflow
downloads the file and does not return it's content as a string.
2019-09-05 17:54:07 +02:00
Tomek 0948bcf868 [AIRFLOW-5319] Add system tests for Dataproc (#5924)
* [AIRFLOW-5319] Add system tests for Dataproc
2019-09-05 17:44:32 +02:00
Kamil Breguła 7018f47460 [AIRFLOW-5361] Add system tests for BigQuery (#5968) 2019-09-05 17:43:04 +02:00
Tomek ad61f3272a [AIRFLOW-5332] Add system tests for Datastore (#5937) 2019-09-05 17:35:31 +02:00
Tomek 9165739e90 [AIRFLOW-5409] Fix BigQuery hook tests (#6017) 2019-09-05 15:37:07 +02:00
Kamil Breguła 57f77e0c09 [AIRFLOW-5308] Pass credentials object to pandas_gbq (#5911) 2019-09-05 12:55:06 +02:00
Kamil Breguła a4ebd9d2eb [AIRFLOW-5358] Improved mocking of the project ID (#5964)
* [AIRFLOW-5358] Improved mocking of the project ID
2019-09-05 12:49:45 +02:00
Tomek 9143893c99 [AIRFLOW-5347] Add system tests for GoogleCloudStorage (#5951) 2019-09-04 22:21:15 +02:00
Zacharya c9e2d04fde [AIRFLOW-4085] FileSensor now takes glob patterns for `filepath` (#5358) 2019-09-04 11:02:28 +01:00
Igor Khrol ba9e521e71 [AIRFLOW-5344] Add --proxy-user parameter to SparkSubmitOperator (#5948)
`spark2-submit` supports `--proxy-user` parameter which should be handled by SparkSubmitOperator.

```
$ spark2-submit --help 2>&1 | grep proxy
  --proxy-user NAME           User to impersonate when submitting the application.
```
2019-09-04 10:47:09 +01:00
Omkar Desai 13843151fb [Airflow-4668] Make airflow/contrib/utils Pylint compatible (#5916) 2019-09-04 09:23:57 +01:00
Tomek fa9e91227b [AIRFLOW-5386] Move Google Dataproc to core (#5986)
For more information check AIP-21.
2019-09-03 23:00:38 +02:00
Hao Liang f497d1d5aa [AIRFLOW-4858] Deprecate "Historical convenience functions" in airflow.configuration (#5495)
1. Issue old conf method deprecation warnings properly and remove current old conf method usages.
2. Unify the way to use conf as `from airflow.configuration import conf`
2019-09-03 17:08:55 +01:00
Tomek 2ae5256df6 [AIRFLOW-5388] Add airflow version label to newly created buckets (#5987)
To tag and track GCP resources spawned from Airflow, we have
been adding airflow specific label(s) to GCP API service calls
whenever possible and applicable.
2019-09-02 20:54:55 +02:00
Kaxil Naik 6138208749
[AIRFLOW-5335] Simplify GCSHook test (#5958) 2019-08-30 16:50:54 +01:00
Tomek bc5c8f32a1 [AIRFLOW-5320] Add system tests for PubSub (#5925) 2019-08-30 15:28:12 +02:00