Граф коммитов

6566 Коммитов

Автор SHA1 Сообщение Дата
Aaron Niskode-Dossett 9d3cbfd2ce AIRFLOW-4740 Accept string `end_date` in DAG default_args (#5381)
A dag will accept, in its default_args, a start_date as simple as 2019-06-01. If it detects 
a string, it converts to a richer type. However, it did not accept a similar string for
end_date instead an exception was thrown.

That's a very confusing user experience. end_date should be as permissive as start_date
2019-06-18 16:22:50 +01:00
Ron van Weverwijk 6eb326255d [AIRFLOW-4781] Add the ability to specify ports in kubernetesOperator (#5410)
* [AIRFLOW-4781] Added the ability to specify ports in kubernetesOperator

* [AIRFLOW-4781] Added the ability to specify ports in kubernetesOperator

* [AIRFLOW-4781] Added the ability to specify ports in kubernetesOperator

added docstring

* [AIRFLOW-4781] Added the ability to specify ports in kubernetesOperator

add typehints

Co-Authored-By: Fokko Driesprong <fokko@driesprong.frl>

* [AIRFLOW-4781] Added the ability to specify ports in kubernetesOperator

fixed docstrings and typehints
2019-06-18 13:09:15 +02:00
Joshua Carp 929b8fd187 [AIRFLOW-4423] Improve date handling in mysql to gcs operator. (#5196)
* Handle TIME columns
* Ensure DATETIME and TIMESTAMP columns treated as UTC
2019-06-18 11:18:31 +02:00
Tao Feng d0f6a805c2 [AIRFLOW-4716] Instrument dag loading time duration (#5350) 2019-06-18 09:51:28 +08:00
Tomek 5c73a8cc8c [AIRFLOW-4765] Fix DataProcPigOperator execute method (#5426) 2019-06-17 17:29:16 +02:00
Fokko Driesprong 8644a6f94c
[AIRFLOW-4777] Simplify python_requires in setup (#5405)
From Airflow 2.0 we support Python 3.5+, but not yet Python 4.0
2019-06-17 16:23:36 +02:00
dstandish 09a0afe9ad [AIRFLOW-4798] obviate interdependencies for dagbag and TI tests (#5422) 2019-06-17 15:39:07 +02:00
Ryan Williams 9d54a6f1f9 [AIRFLOW-4800] fix GKEClusterHook ctor calls (#5424) 2019-06-17 15:21:52 +02:00
Tomek 2c0af40c93 [AIRFLOW-4766] Add autoscaling option for DataprocClusterCreateOperator (#5425) 2019-06-17 12:56:01 +02:00
dstandish 90b18000cd [AIRFLOW-4799] don't mutate self.env in BashOperator execute method (#5421)
* in tests using bash operator repeatedly, env is populated with contents of environment.
* on subsequent runs, render_templates will try to render contents of env.
* this produces unpredictable behavior where missing template error may be thrown, or env paths may be replaced with "template file" contents
2019-06-17 10:32:44 +02:00
Joshua Carp 9cfeb31fc1 [AIRFLOW-4795] Upgrade alembic to latest release. (#5411) 2019-06-17 09:56:37 +02:00
Aaron Niskode-Dossett 2d18f0738f AIRFLOW-4793 Add signature_name to mlengine operator (#5417)
* AIRFLOW-4793 Add signature_name to mlengine operator

* Add pydoc
2019-06-14 22:43:51 +02:00
jj-ian 3b29290ecc [AIRFLOW-3211] Reattach to GCP Dataproc jobs upon Airflow restart (#4083)
* [AIRFLOW-3211] Reattach to GCP Dataproc jobs upon Airflow restart

This change allows Airflow to reattach to existing Dataproc jobs upon
scheduler restart, preventing duplicate job submissions. Previously,
if the Airflow scheduler restarts while it's running a job on GCP
Dataproc, it'll lose track of that job, mark the task as failed, and
eventually retry. However, the jobs may still be running on Dataproc
and maybe even finish successfully. So when Airflow retries and reruns
the job, the same job will run twice. This can result in issues like
delayed workflows, increased costs, and duplicate data.

* [AIRFLOW-3211] Fixed flake8 formatting

* Update test with new GCP_PROJECT convention

* More flake8 cleanups
2019-06-14 13:59:25 +02:00
Chris McLennon d89c17d5a5 [AIRFLOW-4750] Log identified zombie task instances (#5389) 2019-06-13 09:44:24 +01:00
Hitesh Gupta d94197d16a [AIRFLOW-5409] Added name under Who uses Apache Airflow for tracking purpose. (#5409)
* Update README.md

* Corrected Sequence

Corrected Sequence in who uses airflow section
2019-06-12 22:49:15 -07:00
Dan MacTough 34056f8fd2 [AIRFLOW-4760] Fix zip-packaged DAGs disappearing from DagBag when reloaded (#5404) 2019-06-11 13:25:11 +01:00
dstandish 63f3694003 [AIRFLOW-4756] add ti.state to ti.start_date as criteria for gantt (#5399)
* viewing gantt chart of running dagrun with a task that failed initially but was cleared would result in json encode error
* when you clear a TI it only nulls out the state -- not the start_date. so these cleared TIs would still be added to `tis` and thus to gantt but would have no state, and json conversion does not like None type for state.
* we should check state in addition to start_date to handle this case
2019-06-11 13:15:05 +01:00
Chris Mitchell d1dceff69a [AIRFLOW-4463] Handle divide-by-zero errors in short retry intervals (#5243) 2019-06-11 11:36:29 +01:00
dstandish 68c07c9de2 [AIRFLOW-3057] add prev_*_date_success to template context (#5372)
* Two new variables are added to template context: prev_execution_date_success and prev_start_date_success.
* These return the exec / start dates for the same task in prior successful dag run, without regard to TI status.
* Lazy evaluation is employed so that query to look up prev_ti is not executed unnecessarily.
2019-06-11 11:14:34 +01:00
Joshua Carp dadca8c1eb [AIRFLOW-4759] Batch queries in set_state API. (#5403)
Setting a `dagrun` to success or failure calls set_state for each task in 
the dag, running multiple database queries for each one. We can reduce
the number of queries, and improve performance for the associated
endpoints, by setting the states of all relevant tasks in the same query.
2019-06-11 10:57:22 +01:00
Victor Villas 5db23c6b3d [AIRFLOW-4737] Increase and document celery queue name limit (#5383) 2019-06-11 10:54:25 +01:00
Jarek Potiuk f710a0db49
[AIRFLOW-4757] Selectively disable missing docstrings for tests (#5400) 2019-06-10 17:51:35 +02:00
Jarek Potiuk 4b7667d2ad
[AIRFLOW-4659] Fix pylint problems for api module (#5398) 2019-06-10 13:53:52 +02:00
Gordon Ball 201e67100c [AIRFLOW-4731] Fix GCS hook with google-storage-client 1.16 (#5368)
google-storage-client 1.16 introduced a breaking change where the
signature of client.get_bucket changed from (bucket_name) to
(bucket_or_name). Calls with named arguments to this method now fail.
This commit makes all calls positional to work around this.
2019-06-10 11:44:31 +01:00
Ash Berlin-Taylor 3b8e99166c [AIRFLOW-3729] Fix pylint issues from #4554
That PR was opened before Pylint was added to the repo so we merged with
a green check that then broke things
2019-06-10 10:54:37 +01:00
Bruno Campos 0224105343 [AIRFLOW-3729] Support "DownwardAPI" in env variables for KubernetesPodOperator (#4554)
https://kubernetes.io/docs/tasks/inject-data-application/environment-variable-expose-pod-information/#the-downward-api
2019-06-10 10:26:06 +01:00
Jarek Potiuk f0ec5fee2a
[AIRFLOW-4755] Fixed default DOCKERHUB_USER (#5397) 2019-06-10 08:52:23 +02:00
Jarek Potiuk 54d41dff6e [AIRFLOW-4754] Fixed failure when no .git repo is found (#5396) 2019-06-09 21:24:04 +02:00
Bas Harenslak 6dd3f31a04 [AIRFLOW-4689] Make setup.py Pylint compatible (#5395) 2019-06-09 08:35:22 -07:00
Jarek Potiuk 78c592ad86
[AIRFLOW-4116] Dockerfile now supports CI image build on DockerHub (#4937) 2019-06-09 07:58:58 -07:00
Jarek Potiuk 2fd7567070 [AIRFLOW-4753] fixes pylint for json log formatter (#5393) 2019-06-09 16:40:18 +02:00
Bas Harenslak 189bbfd85d [AIRFLOW-4670] Make airflow/example_dags Pylint compatible (#5361) 2019-06-09 07:36:33 -07:00
Jarek Potiuk 3891de68af
[AIRFLOW-4752] Add missing * in build exclusion and generated config (#5392) 2019-06-09 07:35:17 -07:00
Bas Harenslak 3e99121dc0 [AIRFLOW-4364] Allow module names to begin with 0-9 and max 60 chars (#5391) 2019-06-09 06:58:27 -07:00
Bas Harenslak 02ef974e4b [AIRFLOW-4669] Make airflow/dag Pylint compatible (#5362) 2019-06-09 00:26:44 -07:00
Marcin Szymański 6db86c3d55 [AIRFLOW-4743] Add environment variables support to SSHOperator (#5385)
* [AIRFLOW-4743] Add environment variables support to SSHOperator
2019-06-08 23:13:25 -07:00
Kamil Breguła 6cf722182a [AIRFLOW-4473] Move Papermill guide (#5371) 2019-06-08 23:06:06 -07:00
Bas Harenslak 18b27387f5 [AIRFLOW-4725] Fix setup.py PEP440 & Sphinx-PyPI-upload dependency (#5363) 2019-06-08 22:41:56 -07:00
kurtqq 827d6d40c5 [AIRFLOW-XXX] fix typos in README.md (#5390)
fix typos
2019-06-08 13:02:19 +02:00
Tao Feng abb506ba5b [AIRFLOW-XXX] Fix WS-2019-0032 (#5384) 2019-06-07 12:05:50 +01:00
Chao-Han Tsai 591a9e8346 [AIRFLOW-4521] Don't load the whole DagBag in Pause dag actions (#5342) 2019-06-07 11:34:57 +01:00
Ghislain Antony Vaillant f1734e57dd [AIRFLOW-XXX] Add missing word in concepts documentation (#5380) 2019-06-07 08:47:16 +05:30
aoen f19db28760 [AIRFLOW-3160] (Unrevert) Load latest_dagruns asynchronously (#5339) 2019-06-06 23:19:14 +05:30
Kamil Breguła 8e87e89373 [AIRFLOW-4738] Enforce exampleinclude for example DAGs (#5375) 2019-06-06 15:25:12 +01:00
Philippe Gagnon 68e8c998ad [AIRFLOW-XXX] Add .github/SECURITY.md (#5329)
This commit adds a .github/SECURITY.md file that defines the
contents of the "Policy" tab in the new "Security" section of
the GitHub interface.

Currently the Policy tab obtains its content from the
docs/security.rst file, which contains technical, non-policy
related information. This commit retains the
"Reporting Vulnerabilities" section of docs/security.rst, which
is relevant, and strips the extraneous content.
2019-06-06 09:33:54 +01:00
Luke Bodeen 9371ef437d [AIRFLOW-XXX] changing docutap to experity as it underwent merger (#5338)
[ci skip]
2019-06-06 09:32:27 +01:00
Takuya Sato 481a33b498 [AIRFLOW-4501] Register pendulum datetime converter for sqla+pymysql (#5190) 2019-06-06 09:13:40 +01:00
Kamil Breguła e90f398af6 [AIRFLOW-XXX] Add Aizhamal Nurmamat kyzy to contributors list (#5370) 2019-06-06 12:53:46 +05:30
Luis Gustavo Couto Wacker 89b5d607b0 [AIRFLOW-XXX] Add StoneCo to section "Currently using Airflow" (#5377) 2019-06-06 12:52:02 +05:30
OmerJog 9b58041574 [AIRFLOW-1381] Allow setting host temporary directory in DockerOperator (#5369)
Allow user to specify temporary directory to use on the host machine;
    default settings will cause an error on OS X due to the standard
    temporary directory not being shared to Docker.

Based on PR #2418 by benjamin@techcitylabs.com. Closes #2418 #4315
2019-06-05 13:07:26 +01:00