Граф коммитов

27 Коммитов

Автор SHA1 Сообщение Дата
Kamil Breguła 304cb9e685 [AIRFLOW-3744] Abandon the use of obsolete aliases of methods (#4568) 2019-01-22 15:45:12 -08:00
Joshua Carp 5710ef2615 [AIRFLOW-3367] Run celery integration test with redis broker. (#4207) 2018-12-03 09:19:24 +00:00
Kevin Yang 1d53f93966 [AIRFLOW-2761] Parallelize enqueue in celery executor (#4234) 2018-11-28 22:23:44 +00:00
Vardan Gupta 8f15661001 [AIRFLOW-3136] Add retry_number to TaskInstance.key to avoid race condition (#3994)
We were seeing an intermittent issue where executor reports task instance finished while task says it's in queue state, it was due to a race condition between scheduler which was clearing event_buffer in _process_executor_events method in jobs.py executor was about to put next_retry task's status as running which was failed in previous try. So, we thought to add retry_number as the member of TaskInstance key property.
2018-10-31 20:40:31 +00:00
Xiaodong fdda0240c3 [AIRFLOW-3239] Fix test recovery further (#4074)
Prepend "test_" for
- tests/executors/dask_executor.py
- tests/security/kerberos.py
2018-10-21 11:50:53 +01:00
Fokko Driesprong 0e8394fd23 [AIRFLOW-3190] Make flake8 compliant (#4035)
Enforce Flake8 over the entire project
2018-10-12 22:22:52 +01:00
dima-asana 7115883faf [AIRFLOW-3168] More resillient database use in CI (#4014)
Make sure mysql is available before calling it in CI
2018-10-11 09:55:15 +01:00
Fokko Driesprong 491fd743da [AIRFLOW-2918] Remove unused imports 2018-09-21 13:21:42 -07:00
Fokko Driesprong 79f8ee1415 [AIRFLOW-2918] Fix Flake8 violations (#3931) 2018-09-21 15:25:54 +01:00
yrqls21 9b82fcb5fb [AIRFLOW-2156] Parallelize Celery Executor task state fetching (#3830) 2018-09-11 09:12:18 -07:00
Yingbo Wang 9a510086f1 [AIRFLOW-2930] Fix celery excecutor scheduler crash (#3784)
Caused by an update in PR #3740.
execute_command.apply_async(args=command, ...)
-command is a list of short unicode strings and the above code pass multiple
arguments to a function defined as taking only one argument.
-command = ["airflow", "run", "dag323",...]
-args = command = ["airflow", "run", "dag323", ...]
-execute_command("airflow","run","dag3s3", ...) will be error and exit.
2018-08-28 14:33:22 +02:00
Xiaodong 3a023459ee [AIRFLOW-2921][AIRFLOW-2922] Fix bugs in CeleryExecutor (#3773)
Bug-1:
if a task state becomes either SUCCESS or FAILURE or REVOKED,
it will be removed from self.tasks() and self.last_state().
However, because line 108 is not indented properly,
this task will be added back to self.last_state() again.

Bug-2:
When the state is updated, it's referring to the latest
state `task.state` rather than variable `state`.
This may result in dead-lock if the state changed from
`STARTED` to `SUCCESS` after the if-elif-else block
started.

Test case is updated for fix to bug-1.
2018-08-22 11:06:09 +02:00
bolkedebruin 48229328f2
[AIRFLOW-2888] Remove shell=True and bash from task launch (#3740)
shell=True is a security risk. Bash is not required to launch
tasks and will consume extra resources.
2018-08-15 07:13:02 +02:00
Marius van Niekerk e95a1251b7 [AIRFLOW-2240][DASK] Added TLS/SSL support for the dask-distributed scheduler.
As of 0.17.0 dask distributed has support for
TLS/SSL.

[dask] Added TLS/SSL support for the dask-
distributed scheduler.

As of 0.17.0 dask distributed has support for
TLS/SSL.

Add a test for tls under dask distributed

Closes #2683 from mariusvniekerk/dask-ssl
2018-04-18 09:45:52 -07:00
Carl Johan Gustavsson 32c5f445e4 [AIRFLOW-610] Respect _cmd option in config before defaults
The command versions of config parameters were
overriden by the
default config. E.g sql_alchemy_conn got the
default value even
when sql_alchemy_conn_cmd was specified.

Closes #3029 from cjgu/airflow-610
2018-04-16 10:12:08 +02:00
Bolke de Bruin c7a472ed6b [AIRFLOW-2287] Fix incorrect ASF headers
Closes #3219 from bolkedebruin/fix_header
2018-04-14 09:13:23 +02:00
Bolke de Bruin 51180d370c [AIRFLOW-XXX] Upgrade to python 3.5 and disable dask tests
Dask tests seem to create issues down the line.
2017-12-19 15:26:49 +01:00
Fokko Driesprong 30076f1e45 [AIRFLOW-1840] Make celery configuration congruent with Celery 4
Explicitly set the celery backend from the config
and align the config
with the celery config as this might be confusing.

Closes #2806 from Fokko/AIRFLOW-1840-Fix-celery-
config
2017-12-11 18:56:29 +01:00
Bolke de Bruin 9624f5f24e [AIRFLOW-1809] Update tests to use timezone aware objects 2017-11-27 15:54:27 +01:00
Bolke de Bruin 2abead7049 [AIRFLOW-1641] Handle executor events in the scheduler
While in Backfills we do handle the executor
state,
we do not in the Scheduler. In case there is an
unspecified
error (e.g. a timeout, airflow command failure)
tasks
can get stuck.

Closes #2715 from bolkedebruin/AIRFLOW-1641
2017-10-27 11:02:24 +02:00
Edgar Rodriguez cdfced3248 [AIRFLOW-1631] Fix local executor unbound parallelism
Before, if unlimited parallelism was used passing
`0` for the
parallelism value, the local executor would stall
execution since no
worker was being created, violating the
BaseExecutor contract on the
parallelism option.

Now, if unbound parallelism is used, processes
will be created on demand
for each task submitted for execution.

Closes #2658 from edgarRd/erod-localexecutor-fix
2017-10-17 11:39:22 -07:00
Bolke de Bruin 65f3b468a2 [AIRFLOW-1527] Refactor celery config
The celery config is currently part of the celery executor definition.
This is really inflexible for users wanting to change it. In addition
Celery 4 is moving to lowercase.

Closes #2542 from bolkedebruin/upgrade_celery
2017-09-25 11:19:16 -07:00
Fokko Driesprong eb2f589099 [AIRFLOW-1604] Rename logger to log
In all the popular languages the variable name log
is the de facto
standard for the logging. Rename LoggingMixin.py
to logging_mixin.py
to comply with the Python standard.

When using the .logger a deprecation warning will
be emitted.

Closes #2604 from Fokko/AIRFLOW-1604-logger-to-log
2017-09-19 10:17:14 +02:00
Bolke de Bruin 1082671184 [AIRFLOW-1308] Disable nanny usage for Dask
Nanny is deprecated and results in build errors.

Closes #2366 from bolkedebruin/fix_dask
2017-06-15 09:44:16 -04:00
Bolke de Bruin d79ed74783 [AIRFLOW-910] Use parallel task execution for backfills
The refactor to use dag runs in backfills caused a
regression
in task execution performance as dag runs were
executed
sequentially. Next to that, the backfills were non
deterministic
due to the random execution of tasks, causing root
tasks
being added to the non ready list too soon.

This updates the backfill logic as follows:
* Parallelize execution of tasks
* Use a leave first execution model
* Replace state updates from the executor by task
based only

Closes #2107 from bolkedebruin/AIRFLOW-910
2017-03-11 09:40:57 -08:00
Jeremiah Lowin fe7881656f [AIRFLOW-862] Fix Unit Tests for DaskExecutor
Unit tests were inadvertently disabled for
DaskExecutor

Closes #2076 from jlowin/fix-dask-tests
2017-02-19 09:30:01 +01:00
Jeremiah Lowin 6e22102782 [AIRFLOW-862] Add DaskExecutor
Adds a DaskExecutor for running Airflow tasks
in Dask clusters.

Closes #2067 from jlowin/dask-executor
2017-02-12 16:06:31 -05:00