Граф коммитов

6916 Коммитов

Автор SHA1 Сообщение Дата
Tomek ee3fdacdd9 [AIRFLOW-5185] Move GCP Video Intelligence to core (#5794)
This commit moves GCP Video Intelligence from contrib to core.
For more information check AIP-21.
2019-08-17 19:26:38 +01:00
Rudison Lacerda a45ec674e9 [AIRFLOW-XXX] Add Raízen to list of airflow users (#5848) 2019-08-17 19:25:10 +01:00
Tomek f5204cd1e4 [AIRFLOW-5187] Move GCP Container to core (#5793)
This commit moves GCP Container from contrib to core.
For more information check AIP-21.
2019-08-17 19:23:55 +01:00
Jarek Potiuk e515fee768 [AIRFLOW-5237] Less verbose output for CI builds (#5840) 2019-08-17 15:36:20 +01:00
Chao-Han Tsai 46e5fb1a06 [AIRFLOW-XXX] Add Chao-Han to committer list (#5846) 2019-08-16 21:28:19 -07:00
Cooper Gillan a47292d3d4 [AIRFLOW-5239] Fix listing of pylint test scripts (#5844)
List the two separate pylint scripts for use inside the Docker
containers in CONTRIBUTING.md.
2019-08-16 23:26:33 -04:00
Qingping Hou e6a20acb13 [AIRFLOW-5140] fix all missing type annotation errors from dmypy (#5664) 2019-08-16 18:49:32 +01:00
Felix Uellendall 6d27ced85a [AIRFLOW-5056] Add argument to filter mails in ImapHook and related operators (#5672)
- changes the order of arguments for `has_mail_attachment`, `retrieve_mail_attachments` and `download_mail_attachments`
- add `get_conn` function
- refactor code
- fix pylint issues
- add imap_mail_filter arg to ImapAttachmentToS3Operator
- add mail_filter arg to ImapAttachmentSensor
- remove superfluous tests
- changes the order of arguments in the sensors + operators __init__
2019-08-16 18:31:29 +01:00
bingqinzhou 462ab88f02 [AIRFLOW-5152] Change back autodetect default value from False to True in GoogleCloudStorageToBigQueryOperator. (#5771)
Set autodetect default value from false to be true to avoid breaking downstream
services using GoogleCloudStorageToBigQueryOperator but not aware of the newly
added autodetect field.

This is to fix the current regression introduced by #3880
2019-08-16 18:22:02 +01:00
Daniel Imberman ef7e37c90d [AIRFLOW-5235] Fixes bug where K8s CI does not properly create user (#5838) 2019-08-16 16:01:01 +01:00
Jarek Potiuk 5cfe9c29a0
[AIRFLOW-5233] Fixed consistency in whitespace (tabs/eols) + common problems (#5835)
* [AIRFLOW-5233] Fixed consistency in whitespace (tabs/eols) + common problems
2019-08-16 10:35:34 -04:00
Tomek e2d76b9215 [AIRFLOW-5184] Move GCP Natural Language to core (#5792) 2019-08-16 11:21:18 +01:00
Kamil Breguła 676dafa1db
[AIRFLOW-XXX] Remove 'Setup Test Environment using MySQL' article (#5833) 2019-08-16 08:56:14 +02:00
Jarek Potiuk 4d491f394e
[AIRFLOW-5225] Consistent licence for all JS files (#5827) 2019-08-15 20:06:18 -04:00
Jarek Potiuk 3fac1bd04a
[AIRFLOW-5229] Consistent licences to all other files (#5831) 2019-08-15 20:02:53 -04:00
Jarek Potiuk 2ea2c539d8
[AIRFLOW-5227] Consistent licence for .sql files (#5829) 2019-08-15 19:59:35 -04:00
Kevin Yang 203e6e891e
[AIRFLOW-4285] Update task dependency context defination and usage (#5079) 2019-08-15 15:06:12 -07:00
Kaxil Naik e550afc762 [AIRFLOW-XXX] Remove duplicate lines from CONTRIBUTING.md (#5830) 2019-08-15 09:13:12 -04:00
Jarek Potiuk 70e937a8d8
[AIRFLOW-5161] Static checks are run automatically in pre-commit hooks (#5777) 2019-08-14 20:53:28 -04:00
Kamil Breguła 84b65d835e
[AIRFLOW-XXX] Group references in one section (#5776) 2019-08-14 23:57:50 +02:00
Tomek 40745aa225 [AIRFLOW-5183] Preprare documentation for new GCP import paths (#5791) 2019-08-14 19:07:43 +02:00
Ash Berlin-Taylor 4e03d2390f
[AIRFLOW-5179] Remove top level __init__.py (#5818)
The recent commit 3724c2aa to master introduced a __init__.py file in
the project root folder, which basically breaks all imports in local
development (`pip install -e .`) as it turns the project root into a
package.

[ci skip]
2019-08-14 15:07:53 +01:00
Daniel Frank eeac82318a [AIRFLOW-5210] Make finding template files more efficient (#5815)
For large DAGs, iterating over template fields to find template files can be time intensive.
Save this time for tasks that do not specify a template file extension.
2019-08-14 15:20:00 +02:00
Kaxil Naik 34fbd029f7
[AIRFLOW-5209] Bump Sphinx version to fix doc build (#5814) 2019-08-14 10:39:40 +01:00
damon09273@gmail.com 7935e9378c [AIRFLOW-5211] Add pass_value to template_fields for BigQueryValueCheckOperator (#5816) 2019-08-14 11:29:44 +02:00
Kamil Breguła 877e42d884
[AIRFLOW-5133] Keep original env state in provide_gcp_credential_file (#5747) 2019-08-14 11:26:10 +02:00
Ryan Yuan 09b9610bee [AIRFLOW-4908] Implement BigQuery Hooks/Operators for update_dataset, patch_dataset and get_dataset (#5546)
Implement BigQuery Hooks/Operators for update_dataset, patch_dataset and get_dataset
2019-08-14 11:19:26 +02:00
wmorris75 80bd5ff4f6 [AIRFLOW-5052] Added the include_deleted params to salesforce make_query (#5717) 2019-08-13 23:30:38 +01:00
Kaxil Naik 44eb89d672
[AIRFLOW-4222] Add cli autocomplete for bash & zsh (#5789) 2019-08-13 22:34:33 +01:00
Akshesh Doshi 3e2a02751c [AIRFLOW-4843] Allow orchestration via Docker Swarm (SwarmOperator) (#5489)
* [AIRFLOW-4843] Allow orchestration via Docker Swarm (SwarmOperator)

Add support for running Docker containers via Docker Swarm
which allows the task to run on any machine (node) which
is a part of your Swarm cluster

More details: https://issues.apache.org/jira/browse/AIRFLOW-4843

Built with <3 at Agoda!
2019-08-13 16:29:32 +02:00
Maulik Soneji a7a39003f9 [AIRFLOW-5165] make number of dataproc masters configurable (#5781) 2019-08-13 11:43:16 +02:00
Joel Croteau e71fe2c844 [AIRFLOW-5114] Fix gcp_transfer_hook behavior with default operator arguments (#5727)
`GCPTransferServiceHook.wait_for_transfer_job` defeaults its `timeout`
parameter to 60 and assumes it is an integer or at least comparable to
one. This is a problem as some of the built-in operators that use it
like `S3ToGoogleCloudStorageTransferOperator` and
`GoogleCloudStorageToGoogleCloudStorageTransferOperator` default their
`timeout` param to `None`, and when they call this method with their
default value, it causes an error. Fix this by allowing
`wait_for_transfer_job` to accept a timeout of `None` and fill in
appropriate defaults. This also adds functionality to allow it to take
a `timedelta` instead of an integer, allows seconds to be any real, as
there is really no need for them to actually be an integer, and fixes
the counting of time for determining timeout to be a bit more accurate.
2019-08-13 09:58:32 +02:00
Kamil Breguła dd5811987e
[AIRFLOW-5131] Create scopes property in GoogleCloudBaseHook (#5745) 2019-08-13 04:21:22 +02:00
Joel Croteau 1cf8bc4ae2 [AIRFLOW-5104] Set default schedule for GCP Transfer operators (#5726)
The GCS Transfer Service REST API requires that a schedule be set, even for
one-time immediate runs. This adds code to
`S3ToGoogleCloudStorageTransferOperator` and
`GoogleCloudStorageToGoogleCloudStorageTransferOperator` to set a default
one-time immediate run schedule when no `schedule` argument is passed.
2019-08-13 04:05:28 +02:00
Felix Uellendall 571ffcc97f [AIRFLOW-4686] Make dags Pylint compatible (#5753) 2019-08-13 03:26:50 +02:00
Kamil Breguła 60a032f4b8
[AIRFLOW-5124] Add gzip support for S3ToGoogleCloudStorageOperator (#5736) 2019-08-12 19:14:04 +02:00
Tomek 02cf7d6ecf [AIRFLOW-5135] Use gapic ClientInfo in GoogleCloudBaseHook (#5749)
Use from google.api_core.gapic_v1.client_info import ClientInfo
because this object inherits from
from google.api_core.client_info import ClientInfo
and implements one additional method used by Python SDKs.
2019-08-12 16:52:36 +02:00
Tomek 5c2031ee16 [AIRFLOW-4230] BigQuery schema update options should be a list (#5766) 2019-08-12 16:44:37 +02:00
Kaxil Naik 1211675651
[AIRFLOW-5169] Pass GCP Project ID explicitly to StorageClient in GCSHook (#5783) 2019-08-12 14:59:37 +01:00
Mohannad Albanayosy fae5b37f6b [AIRFLOW-5153] Option to force delete non-empty BQ datasets (#5768)
This gives the option to delete the dataset as well as its tables.
2019-08-12 12:40:15 +01:00
Jonathan Lange c4a9d8b92a [AIRFLOW-5100] Respect safe_mode configuration setting when parsing DAG files (#5757)
The scheduler calls `list_py_file_paths` to find DAGs to schedule. It does so
without passing any parameters other than the directory. This means that
it *won't* discover DAGs that are missing the words "airflow" and "DAG" even
if DAG_DISCOVERY_SAFE_MODE is disabled.

Since `list_py_file_paths` will refer to the configuration if
`include_examples` is not provided, it makes sense to have the same behaviour
for `safe_mode`.
2019-08-12 12:14:07 +01:00
Bas Harenslak 47dd4c99a7 [AIRFLOW-4835] Refactor operator render_template (#5461)
- Refactors `BaseOperator.render_template()` and removes `render_template_from_field()`. The functionality could be greatly simplified into a single `render_template()` function.
- Removes six usage.
- Improves performance by removing two `hasattr` calls and avoiding recreating Jinja environments.
- Removes the argument `attr` to `render_template()` which wasn't used.
- Squashes multiple similar tests into two parameterized tests.
- Adheres to 110 line length.
- Adds support for templating sets.
- Adds Pydoc.
- Adds typing.
2019-08-12 10:06:02 +01:00
Jarek Potiuk 8cf0635793
[AIRFLOW-5159] Checklicence image is not built when not needed (#5774) 2019-08-10 18:11:51 +03:00
iS2.co 2dc4e8450a [AIRFLOW-XXX] Add iS2.co to list of airflow users (#5772)
[AIRFLOW-XXX] Add iS2.co to list of airflow users
2019-08-10 08:56:37 +02:00
Bijan Moudi 2b326db037 Adding AloPeyk to the list of companies using Apache Airflow (#5773) 2019-08-10 08:55:47 +02:00
Daniel Imberman e5726c761d
[AIRFLOW-5139] Allow custom ES configs (#5760)
* AIRFLOW-5139 Allow custom ES configs

While attempting to create a self-signed TLS connection between airflow
and ES, we discovered that airflow does now allow users to modify the
SSL state of the elasticsearchtaskhandler. This commit will allow users
to define ES settings in the airflow.cfg
2019-08-09 16:35:05 -05:00
Zhou Fang f6356f2cbb [AIRFLOW-5088][AIP-24] Add DAG serialization using JSON (#5701)
It implements the method proposed in AIP-24 to serialize DAG. It will be used in DAG persistency in DB to solve webserver scalability issue.
2019-08-09 17:53:14 +01:00
kaxil 974ef9cca3 Revert "[AIRFLOW-5148] Add Google Analytics to the Airflow doc website (#5763)"
This reverts commit 502ed749fe.
2019-08-08 23:16:27 +01:00
Jarek Potiuk a4e3295e19
[AIRFLOW-5143] Caching works for Checklicence images (#5762) 2019-08-08 23:22:57 +02:00
Kaxil Naik 502ed749fe
[AIRFLOW-5148] Add Google Analytics to the Airflow doc website (#5763) 2019-08-08 21:42:48 +01:00