Граф коммитов

213 Коммитов

Автор SHA1 Сообщение Дата
Pier-Luc Caron St-Pierre 386ece44fc [AIRFLOW-XXX] Clarify documentation related to autodetect parameter in GCS_to_BQ Op (#5294) 2019-05-24 22:36:47 +01:00
Martijn van de Grift 3f276fd4fb [AIRFLOW-4471] Dataproc operator templated fields improvements (#5250) 2019-05-09 10:56:08 +02:00
Kaxil Naik 85899b3aee
[AIRFLOW-4334] Remove deprecated GCS features & Rename built-in params (#5087) 2019-04-18 15:38:41 +01:00
Kaxil Naik e26e340e7c [AIRFLOW-4313] Remove the Mesos executor (#5115)
* [AIRFLOW-4313] Remove the Mesos executor

* Update UPDATING.md
2019-04-17 18:28:58 +08:00
Fokko Driesprong c63ddccf8d [AIRFLOW-3934] Increase standard Dataproc PD size (#4749) 2019-04-15 19:02:47 +01:00
Kaxil Naik c6efd01264 [AIRFLOW-4255] Make GCS Hook Backwards compatible (#5089)
* [AIRFLOW-4255] Make GCS Hook Backwards compatible

* Update UPDATING.md

* Add option to stop warnings

* Update test_gcs_hook.py

* Add tests
2019-04-14 21:22:06 +02:00
Felix Uellendall 3eb2f547ac [AIRFLOW-3993] Add tests for salesforce hook (#4829)
- refactor code
- update docs
- change sign_in to get_conn
- add salesforce to devel_all packages
- add note to UPDATING.md

Co-Authored-By: mik-laj <mik-laj@users.noreply.github.com>
2019-04-14 21:07:43 +02:00
OmerJog 8ed8346c18 [AIRFLOW-2421] HTTPHook verifies HTTPS certificats by default (#4855)
Change the default value of verify from False to True
2019-04-11 13:33:33 +01:00
Kaxil Naik ec7c67ff95
[AIRFLOW-4255] Replace Discovery based api with client based for GCS (#5054) 2019-04-09 19:46:00 +01:00
Ash Berlin-Taylor e8cd3e23e0 [AIRFLOW-XXX] CHANGELOG and UPDATING for 1.10.3 2019-04-06 10:04:23 +01:00
Felix Uellendall b93f2649ae [AIRFLOW-4220] Change CloudantHook to a new major version and add tests (#5023)
- upgrade cloudant version from `>=0.5.9,<2.0` to `>=2.0`
- remove the use of the `schema` attribute in the connection
- remove `db` function since the database object can also be retrieved by calling `cloudant_session['database_name']`
- update docs
- refactor code
2019-04-05 23:12:13 +01:00
Kaxil Naik e732006fdd revert [AIRFLOW-4122] Remove chain function
Reverts 2 commits:
- ee71a8bb10
- 430efc9afb
2019-04-05 23:08:26 +01:00
Felix Uellendall 55aca52d1b [AIRFLOW-4014] Change DatastoreHook and add tests (#4842)
- update default used version for connecting to the Admin API from v1beta1 to v1
- move the establishment of the connection to the function calls instead of the hook init
- change get_conn signature to be able to pass an is_admin arg to set an admin connection
- rename GoogleCloudBaseHook._authorize function to GoogleCloudBaseHook.authorize
- rename the `partialKeys` argument of function `allocate_ids` to `partial_keys`.
- add tests
- update docs
- refactor code

Move version attribute from get_conn to __init__

- revert renaming of authorize function
- improve docs
- refactor code
2019-03-31 20:56:13 +02:00
saurabh gulati 06d2f53a32 [AIRFLOW-4172] Fix changes for driver class path option in Spark Subm… (#4992)
* [AIRFLOW-4172] Fix changes for driver class path option in Spark Submit Operator

* [AIRFLOW-4172] Fix changes for driver class path option in Spark Submit
2019-03-31 18:55:46 +02:00
Jiajie Zhong ffe1412d5e [AIRFLOW-4062] Improve docs on install extra package commands (#4966)
Some command for installing extra packages like
`pip install apache-airflow[devel]` cause error
in special situation/shell, We should clear them
by add quotation like
`pip install 'apache-airflow[devel]'`
2019-03-25 12:14:43 +00:00
Ash Berlin-Taylor 1c43cde65c
[AIRFLOW-3743] Unify different methods of working out AIRFLOW_HOME (#4705)
There were a few ways of getting the AIRFLOW_HOME directory used
throughout the code base, giving possibly conflicting answer if they
weren't kept in sync:

- the AIRFLOW_HOME environment variable
- core/airflow_home from the config
- settings.AIRFLOW_HOME
- configuration.AIRFLOW_HOME

Since the home directory is used to compute the default path of the
config file to load, specifying the home directory Again in the config
file didn't make any sense to me, and I have deprecated that.

This commit makes everything in the code base use
`settings.AIRFLOW_HOME` as the source of truth, and deprecates the
core/airflow_home config option.

There was an import cycle form settings -> logging_config ->
module_loading -> settings that needed to be broken on Python 2 - so I
have moved all adjusting of sys.path in to the settings module

(This issue caused me a problem where the RBAC UI wouldn't work as it
didn't find the right webserver_config.py)
2019-03-25 11:10:28 +00:00
Ryan Yuan e92f09b565 [AIRFLOW-3987] Unify GCP's Connection IDs (#4818) 2019-03-25 11:03:26 +00:00
Ash Berlin-Taylor c159e8e391
Revert "[AIRFLOW-4062] Improve docs on install extra package commands (#4897)" (#4965)
This reverts commit d4655c506e as it causes doc test warnings/failures.
2019-03-24 12:05:23 +00:00
Jiajie Zhong d4655c506e [AIRFLOW-4062] Improve docs on install extra package commands (#4897)
Some command for installing extra packages are
`pip install apache-airflow[devel]` we should
clear install extra package command to
`pip install 'apache-airflow[devel]'`

[ci skip]
2019-03-24 11:34:59 +00:00
Kamil Breguła 4c6a591a90 [AIRFLOW-3659] Create Google Cloud Transfer Service Operators (#4792)
Co-authored-by: Antoni Smolinski <antoni.smolinski@polidea.com>
2019-03-23 22:52:26 +00:00
Tao Feng 430efc9afb [AIRFLOW-XXX] Note removal, not deprecation of chain in UPDATING.md (#4953) 2019-03-21 16:27:29 +00:00
zhongjiajie ee71a8bb10 [AIRFLOW-4122] Remove chain function (#4940)
* [AIRFLOW-4122] Remove chain function

Bit operation like `>>` or `<<` are suggested
to set dependency, which visual and easier to
explain. and have multiple ways is confusion

* change UPDATING.md as recommend
[ci skip]
2019-03-19 19:27:39 -07:00
Kristian Yrjölä 781a82f638 [AIRFLOW-3997] Extend Variable.get so it can return None when var not found (#4819)
This will not change existing regular functions in the `Variable` class. If
variable `foo` doesn't exist:

```
foo = Variable.get("foo")
-> KeyError
```

For passing `default_var=None` to get, `None` is returned instead:
```
foo = Variable.get("foo", default_var=None)
if foo is None:
    handle_missing_foo()
```
2019-03-13 16:14:28 +00:00
Tao Feng dda309e662
[AIRFLOW-4020] Remove viewer DAG edit permissions (#4845) 2019-03-05 15:21:26 -08:00
Ash Berlin-Taylor 0230055190 [AIRFLOW-3353] Upgrade Redis client (#4834)
Now that Celery/Kombu have updated and work with RedisPy 3.x (they in
fact force us to use 3.2) we should re-introduce this change.
2019-03-04 19:04:14 +01:00
Xiaodong 6abcdfd496 [AIRFLOW-3793] Decommission configuration items for Flask-Admin web UI & related codes (#4637) 2019-03-04 15:13:29 +00:00
Kamil Breguła a84fb73627 [AIRFLOW-3867] Rename GCP's subpackage (#4690) 2019-02-27 15:21:57 +01:00
Joshua Carp 4e88726a8d [AIRFLOW-3932] Optionally skip dag discovery heuristic. (#4746) 2019-02-23 08:55:34 -08:00
Ryan Yuan 5c170f0594 [AIRFLOW-3933] Fix various typos (#4747)
Fix typos
2019-02-21 11:50:05 +01:00
marengaz 067a1e3f4a [AIRFLOW-3249] Make all take the same named `do_xcom_push` flag (#4345) 2019-02-15 15:25:41 +00:00
Felix 9c11f41d31 [AIRFLOW-XXX] Fix headlines in UPDATING.md (#4697) 2019-02-12 16:01:20 +01:00
Felix 20c7eeb0a6 [AIRFLOW-XXX] Add notes for imports and sensors (#4698)
- Add note for the removal of deprecated import mechanism
- Add note for changes to sensor imports
2019-02-12 16:00:13 +01:00
Kamil Breguła 6d229b240b [AIRFLOW-3707] Group subpackages/extras by cloud providers (#4524) 2019-02-08 10:23:32 +00:00
Tao Feng 2f70347bdc
[AIRFLOW-3771] Minor refactor securityManager (#4594) 2019-01-26 22:49:58 -08:00
Kaxil Naik 0b1e453033 [AIRFLOW-XXX] Update the UPDATING.md file for 1.10.2 2019-01-23 01:03:45 +00:00
Andrew Stahlman 9f8ca32da4 [AIRFLOW-3752] Add/remove user from role via CLI (#4572)
* [AIRFLOW-3752] Add/remove user from role via the CLI

Update the `users` subcommand to enable 2 new actions:

- `--add-role`: Make the user a member of the given role
- `--remove-role`: Remove the user's membership in the given role

For installations that use an external identity provider (e.g., Google
OAuth) the username is typically a long ID string. For the sake of
convenience, we allow the CLI operator to reference the target user
via either their `username` or their `email` (but not both).

* Update argparse spec

Accidentally left off this update to the argparse spec in the last
commit.

* Add unit tests

* Fix lint failures
2019-01-22 14:29:15 -08:00
Jarek Potiuk 383ec374ea [AIRFLOW-3713] Updated documentation for GCP optional project_id (#4541) 2019-01-17 09:37:07 -08:00
Verdan Mahmood c030729dcb [AIRFLOW-3303] Deprecate old UI in favor of FAB (#4339) 2019-01-14 14:33:45 +00:00
bolkedebruin 69adaee25c [AIRFLOW-3692] Remove ENV variables to avoid GPL (#4506) 2019-01-13 12:34:00 +00:00
Felix ff6ccee748 [AIRFLOW-3594] Unify different License Header 2019-01-11 19:17:20 +01:00
Fokko Driesprong 327860fe4f [AIRFLOW-3515] Remove the run_duration option (#4320) 2019-01-08 10:40:10 +00:00
Tao Feng 67572025cc [AIRFLOW-3612] Remove incubation/incubator mention (#4419) 2019-01-05 14:05:25 +00:00
aoen ecc88a5326 [AIRFLOW-3581] Fix next_ds/prev_ds semantics for manual runs (#4385) 2018-12-30 22:31:11 -08:00
Camilo Aguilar fb52d7b7ca Update UPDATING.md (#4348)
[AIRFLOW-XXX] Add section to Updating.md regarding timezones
2018-12-26 21:25:35 +01:00
Joshua Carp 71129cd210 [AIRFLOW-3540] Respect environment config when looking up config file. (#4340) 2018-12-19 10:28:56 -08:00
Kaxil Naik e44fc88397
[AIRFLOW-3447] Add 2 options for ts_nodash Macro (#4323) 2018-12-15 23:13:36 +00:00
thomasbrockmeier 9d87552ceb [AIRFLOW-1552] Airflow Filter_by_owner not working with password_auth (#4276)
Local users were always a superuser, this adds a column to the DB (and defaults to false,
which is going to cause a bit of an upgrade pain for people, but defaulting to not being an
admin is the only secure default.)
2018-12-15 15:27:10 +00:00
Daniel Lamblin f8027d4a4f [AIRFLOW-XXX] Correct typos in UPDATING.md (#4242)
Started with "habe", "serever" and "certificiate" needing to be:
  "have", "server", and "certificate".
Ran a check, ignoring British and US accepted spellings.
Kept jargon. EG admin, aync, auth, backend, config, dag, s3, utils, etc.
Took exception to: "num of dag run" meaning "number of dag runs",
  "upness" is normally for quarks,
  "url" being lower-case, and
  sftp example having an excess file ending.
Python documentation writes "builtin" hyphenated, cases "PYTHONPATH".
Gave up on mixed use of "dag" and "DAG" as well as long line lengths.
2018-11-26 22:14:24 -08:00
Ash Berlin-Taylor b9fc03ea1a [AIRFLOW-2779] Add license headers to doc files (#4178)
This adds ASF license headers to all the .rst and .md files with the
exception of the Pull Request template (as that is included verbatim
when opening a Pull Request on Github which would be messy)
2018-11-13 15:01:44 +01:00
Ash Berlin-Taylor 4f4ed1d8af [AIRFLOW-XXX] Airflow 1.10.1 release notes in UPDATING.md
Simply adding a header in the right place, as through dumb luck we had
the changes in the right order.
2018-11-12 17:32:48 +00:00
bolkedebruin 2c4b0eab7d [AIRFLOW-3164] Verify server certificate when connecting to LDAP (#4006) 2018-11-09 13:58:34 +00:00
Kevin Yang 75e2288a3f [Airflow-2760] Decouple DAG parsing loop from scheduler loop (#3873) 2018-10-26 09:37:10 +01:00
Ash Berlin-Taylor 3e47e3ab52 [AIRFLOW-3197] EMRHook is missing new parameters of the AWS API (#4044)
Allow passing any params to the CreateJobFlow API, so that we don't have
to stay up to date with AWS api changes.
2018-10-13 08:25:57 +02:00
Greg Neiheisel 1582da88ee [AIRFLOW-3177] Change scheduler_heartbeat from gauge to counter (#4027)
This updates the scheduler_heartbeat metric from a gauge to a counter to
better support the statsd_exporter for usage with Prometheus. A counter
allows users to track the rate of the heartbeat, and integrates with the
exporter better. A crashing or down scheduler will no longer emit the
metric, but the statsd_exporter will continue to show a 1 for the metric
value. This fixes that issue because a counter will continually change,
and the lack of change indicates an issue with the scheduler.

Add statsd change notice in UPDATING.md
2018-10-12 07:51:13 +02:00
Joshua Carp fb5ffd146a [AIRFLOW-3103][AIRFLOW-3147] Update flask-appbuilder (#3937) 2018-10-04 08:20:24 +01:00
Kaxil Naik 33a1fe76c5
[AIRFLOW-XXX] Add Delete for CLI Example in UPDATING.md 2018-09-28 17:40:33 +01:00
Kaxil Naik 4c572a4b2d [AIRFLOW-3130] Add CLI docs for users command 2018-09-28 08:49:36 -07:00
Eric Chang 2656319165 [AIRFLOW-3111] Fix instructions in UPDATING.md and remove comment (#3944)
artifacts in default_airflow.cfg

- fixed incorrect instructions in UPDATING.md regarding core.log_filename_template and elasticsearch.elasticsearch_log_id_template
- removed comments referencing "additional curly braces" from
default_airflow.cfg since they're irrelevant to the rendered airflow.cfg
2018-09-25 09:26:41 -07:00
Ash Berlin-Taylor 748ad939f1 [AIRFLOW-XXX] Move Dag level access control out of 1.10 section (#3882)
It isn't in 1.10 (and wasn't in this section when the PR was created).
2018-09-11 10:57:48 -07:00
yrqls21 9b82fcb5fb [AIRFLOW-2156] Parallelize Celery Executor task state fetching (#3830) 2018-09-11 09:12:18 -07:00
Dan Davydov 7142ae0732 [AIRFLOW-2895] Prevent scheduler from spamming heartbeats/logs
Reverts most of AIRFLOW-2027 until the issues with
it can be fixed.

Closes #3747 from
aoen/revert_min_file_parsing_time_commit
2018-08-20 09:14:36 -04:00
bolkedebruin 48229328f2
[AIRFLOW-2888] Remove shell=True and bash from task launch (#3740)
shell=True is a security risk. Bash is not required to launch
tasks and will consume extra resources.
2018-08-15 07:13:02 +02:00
Ash Berlin-Taylor d9fecba14c [AIRFLOW-XXX] Updating instructions about logging changes in 1.10 (#3715)
We had a few other logging changes that weren't mentioned in here that
meant previous logs were not viewable anymore.
2018-08-07 22:34:56 +02:00
bolkedebruin c37fc0b6ba
[AIRFLOW-2817] Force explicit choice on GPL dependency (#3660)
By default one of Apache Airflow's dependencies pulls in a GPL
library. Airflow should not install (and upgrade) without an explicit choice.

This is part of the Apache requirements as we cannot depend on Category X
software.
2018-08-01 11:25:31 +02:00
Taylor D. Edmiston 6d31c9e416 [AIRFLOW-2832] Lint and resolve inconsistencies in Markdown files (#3670)
Clean up the Markdown files and make the formatting consistent
2018-08-01 09:50:23 +02:00
Jacob Hayes fcd51f362c [AIRFLOW-2716] Replace async and await py3.7 keywords
Closes #3578 from JacobHayes/py37-keywords
2018-07-29 11:56:41 +02:00
Tao feng f3f2eb323f [AIRFLOW-2267] Airflow DAG level access
Make sure you have checked _all_ steps below.

### JIRA
- [x] My PR addresses the following [Airflow JIRA]
(https://issues.apache.org/jira/browse/AIRFLOW/)
issues and references them in the PR title. For
example, "\[AIRFLOW-XXX\] My Airflow PR"
    -
https://issues.apache.org/jira/browse/AIRFLOW-2267
    - In case you are fixing a typo in the
documentation you can prepend your commit with
\[AIRFLOW-XXX\], code changes always need a JIRA
issue.

### Description
- [x] Here are some details about my PR, including
screenshots of any UI changes:
 Provide DAG level access for airflow.  The detail
design could be found at https://docs.google.com/d
ocument/d/1qs26lE9kAuCY0Qa0ga-80EQ7d7m4s-590lhjtMB
jmxw/edit#

### Tests
- [x] My PR adds the following unit tests __OR__
does not need testing for this extremely good
reason:
Unit tests are added.

### Commits
- [x] My commits all reference JIRA issues in
their subject lines, and I have squashed multiple
commits if they address the same issue. In
addition, my commits follow the guidelines from
"[How to write a good git commit
message](http://chris.beams.io/posts/git-
commit/)":
    1. Subject is separated from body by a blank line
    2. Subject is limited to 50 characters
    3. Subject does not end with a period
    4. Subject uses the imperative mood ("add", not
"adding")
    5. Body wraps at 72 characters
    6. Body explains "what" and "why", not "how"

- [x] Passes `git diff upstream/master -u --
"*.py" | flake8 --diff`

Closes #3197 from feng-tao/airflow-2267
2018-07-16 13:13:42 -07:00
Kevin Yang e9babff4eb [AIRFLOW-2463] Make task instance context available for hive queries
[AIRFLOW-2463] Make task instance context
available for hive queries

update UPDATING.md, please squash

Closes #3405 from yrqls21/kevin_yang_add_context
2018-07-11 10:28:06 +02:00
Eric Lo 985a433a8d [AIRFLOW-2696] Setting UTF-8 as default mime_charset mail
update UPDATING.md

Closes #3559 from lxneng/feature/utf8_mime_charset
2018-07-01 10:05:56 +02:00
Ash Berlin-Taylor a4592f91eb [AIRFLOW-1840] Support back-compat on old celery config
The new names are in-line with Celery 4, but if
anyone upgrades Airflow
without following the UPDATING.md instructions
(which we probably assume
most people won't, not until something stops
working) their workers
would suddenly just start failing. That's bad.

This will issue a warning but carry on working as
expected. We can
remove the deprecation settings (but leave the
code in config) after
this release has been made.

Closes #3549 from ashb/AIRFLOW-1840-back-compat
2018-06-27 22:07:31 +02:00
Hugo Prudente 6c3c8f4459 [AIRFLOW-2542][AIRFLOW-1790] Rename AWS Batch Operator queue to job_queue
- Improved the retries times to jobs below 60s
- Renamed property queue to job_queue to prevent
AWS Batch and CeleryExecutor queue conflict
- Added Breaking Chain note for the UPDATING.md
master
- Fixed operator infinit loop
- Added documentation warning about the Breaking
chain
- Fixed the commit parameter to keep it on Airflow
guidelines
- Fixed logging typo
- rebased with master

Changes to be committed:
	modified:   ../../../UPDATING.md
	modified:   awsbatch_operator.py
	modified:   ../../../tests/contrib/operators/test_
awsbatch_operator.py

Closes #3436 from hprudent/master
2018-06-19 10:00:47 +02:00
niels d62a037671 [AIRFLOW-2539][AIRFLOW-2359] Move remaing log config to configuration file
Closes #3435 from
NielsZeilemaker/env_logging_filename
2018-06-15 13:25:26 +02:00
Kaxil Naik b220fe60d5 [AIRFLOW-2513] Change `bql` to `sql` for BigQuery Hooks & Ops
- Change `bql` to `sql` for BigQuery Hooks &
Operators for consistency

Closes #3454 from kaxil/consistent-bq-lang
2018-06-04 10:04:03 +01:00
Tao feng 2a55ffe0cd [AIRFLOW-2086][AIRFLOW-2393] Customize default dagrun number in tree view
Closes #3279 from feng-tao/reduce-tree-view

This introduces a new configuration variable to set the default
number of dag runs displayed in the tree view. For large DAGs, this
could cause timeouts in the webserver.
2018-05-09 08:45:17 -07:00
Fokko Driesprong 97ab9e762c [AIRFLOW-1812] Update logging example
The logging has changed, therefore we should also
update the
updating.md guide

Closes #2784 from Fokko/AIRFLOW-1812-update-
logging-example
2018-05-03 22:58:44 -07:00
Philippus Baalman ad28decc74 [AIRFLOW-1933] Fix some typos
Closes #2474 from Philippus/patch-1
2018-04-29 20:08:48 -07:00
Tao feng 64d9501667 [AIRFLOW-74] SubdagOperators can consume all celeryd worker processes
Closes #3251 from feng-tao/airflow-74
2018-04-24 10:13:25 -07:00
DerekRoy 8e83e2b3ef [AIRFLOW-2350] Fix grammar in UPDATING.md
Closes #3248 from r39132/patch-1
2018-04-21 08:34:16 +02:00
Sathyaprakash Govindasamy a148043107 [AIRFLOW-2254] Put header as first row in unload
Currently, data is ordered by first column in
descending order
Header row comes as first only if the first column
is integer
This fix puts header as first row regardless of
first column data type

Closes #3180 from sathyaprakashg/AIRFLOW-2254
2018-04-16 10:21:22 +02:00
Dan Davydov 3c4f1fd9e6 [AIRFLOW-2027] Only trigger sleep in scheduler after all files have parsed
Closes #2986 from aoen/ddavydov--open_source_disab
le_unecessary_sleep_in_scheduler_loop
2018-04-09 10:22:11 +02:00
Taylor D. Edmiston 9bdcc4760a [AIRFLOW-2282] Fix grammar in UPDATING.md
Also remove trailing whitespace.
2018-04-04 17:28:53 -04:00
Tao feng bf86b89439 [AIRFLOW-2233] Update updating.md to include the info of hdfs_sensors renaming
Closes #3145 from feng-tao/airflow-2233
2018-03-31 11:16:46 +02:00
Joy Gao 05e1861e24 [AIRFLOW-1433][AIRFLOW-85] New Airflow Webserver UI with RBAC support
Closes #3015 from jgao54/rbac
2018-03-23 09:18:48 +01:00
Fokko Driesprong bb287ecf5b [AIRFLOW-2226] Rename google_cloud_storage_default to google_cloud_default
The Google cloud operators uses both
google_cloud_storage_default and
google_cloud_default as a default conn_id. This is
confusing and the
google_cloud_storage_default conn_id isnt
initialized by default in db.py
Therefore we rename the
google_cloud_storage_default to
google_cloud_default for simplicity and
convenience

Closes #3141 from Fokko/airflow-2226
2018-03-19 22:02:10 +01:00
Bolke de Bruin a1d5551777 [AIRFLOW-1895] Fix primary key integrity for mysql
sla_miss and task_instances cannot have NULL
execution_dates. The timezone
 migration scripts forgot to set this properly. In
addition to make sure
MySQL does not set "ON UPDATE CURRENT_TIMESTAMP"
or MariaDB "DEFAULT
0000-00-00 00:00:00" we now check if
explicit_defaults_for_timestamp is turned
on and otherwise fail an database upgrade.

Closes #2969, #2857

Closes #2979 from bolkedebruin/AIRFLOW-1895
2018-01-27 09:01:10 +01:00
fenglu-g cc9295fe37 [AIRFLOW-1953] Add labels to dataflow operators
Closes #2913 from fenglu-g/master
2018-01-03 11:16:39 -08:00
Joy Gao c0dffb57c2 [AIRFLOW-1821] Enhance default logging config by removing extra loggers
Closes #2793 from jgao54/logging-enhancement
2017-12-22 14:07:29 +01:00
Fokko Driesprong 30076f1e45 [AIRFLOW-1840] Make celery configuration congruent with Celery 4
Explicitly set the celery backend from the config
and align the config
with the celery config as this might be confusing.

Closes #2806 from Fokko/AIRFLOW-1840-Fix-celery-
config
2017-12-11 18:56:29 +01:00
Ash Berlin-Taylor 98df0d6e3b [AIRFLOW-1795] Correctly call S3Hook after migration to boto3
In the migration of S3Hook to boto3 the connection
ID parameter changed
to `aws_conn_id`. This fixes the uses of
`s3_conn_id` in the code base
and adds a note to UPDATING.md about the change.

In correcting the tests for S3ToHiveTransfer I
noticed that
S3Hook.get_key was returning a dictionary, rather
then the S3.Object as
mentioned in it's doc string. The important thing
that was missing was
ability to get the key name from the return a call
to get_wildcard_key.

Closes #2795 from
ashb/AIRFLOW-1795-s3hook_boto3_fixes
2017-11-18 14:07:38 +01:00
Fokko Driesprong 635ab01a76 [AIRFLOW-1731] Set pythonpath for logging
Before initializing the logging framework, we want
to set the python
path so the logging config can be found.

Closes #2721 from Fokko/AIRFLOW-1731-import-
pythonpath
2017-10-27 16:02:56 +02:00
Dan Davydov 21e94c7d15 [AIRFLOW-1697] Mode to disable charts endpoint 2017-10-10 11:33:50 -07:00
Chris Riccomini ebe715c565 [AIRFLOW-1691] Add better Google cloud logging documentation
Closes #2671 from criccomini/fix-log-docs
2017-10-09 10:32:34 -07:00
Crystal Qian dd861f8cd0 [AIRFLOW-1323] Made Dataproc operator parameter names consistent
Closes #2636 from cjqian/1323
2017-10-03 11:15:27 +02:00
Fokko Driesprong 3c3a65a3fe [AIRFLOW-1611] Customize logging
Change the configuration of the logging to make
use of the python
logging and make the configuration easy
configurable. Some of the
settings which are now not needed anymore since
they can easily
be implemented in the config file.

Closes #2631 from Fokko/AIRFLOW-1611-customize-
logging-in-airflow
2017-10-02 17:14:01 +02:00
Fokko Driesprong a7a518902d [AIRFLOW-1582] Improve logging within Airflow
Clean the way of logging within Airflow. Remove
the old logging.py and
move to the airflow.utils.log.* interface. Remove
setting the logging
outside of the settings/configuration code. Move
away from the string
format to logging_function(msg, *args).

Closes #2592 from Fokko/AIRFLOW-1582-Improve-
logging-structure
2017-09-13 09:36:58 +02:00
Dan Davydov 4cf904cf5a [AIRFLOW-855] Replace PickleType with LargeBinary in XCom
PickleType in Xcom allows remote code execution.
In order to deprecate
it without changing mysql table schema, change
PickleType to LargeBinary
 because they both maps to blob type in mysql. Add
"enable_pickling" to
function signature to control using ether pickle
type or JSON. "enable_pickling"
 should also be added to core section of
airflow.cfg

Picked up where https://github.com/apache
/incubator-airflow/pull/2132 left off. Took this
PR, fixed merge conflicts, added
documentation/tests, fixed broken tests/operators,
and fixed the python3 issues.

Closes #2518 from aoen/disable-pickle-type
2017-08-15 12:24:07 -07:00
AllisonWang 6825d97b82 [AIRFLOW-1443] Update Airflow configuration documentation
This PR updates Airflow configuration
documentations to include a recent change to split
task logs by try number #2383.

Closes #2467 from AllisonWang/allison--update-doc
2017-08-09 14:49:56 -07:00
Bolke de Bruin 3927723263 Fix new SSH documentation 2017-07-20 22:12:31 +02:00
Jay fe0edeaab5 [AIRFLOW-756][AIRFLOW-751] Replace ssh hook, operator & sftp operator with paramiko based
Closes #1999 from jhsenjaliya/AIRFLOW-756
2017-07-20 22:07:45 +02:00
Younghee Kwon c450b60878 [AIRFLOW-1338][AIRFLOW-782] Add GCP dataflow hook runner change to UPDATING.md
Closes #2326 from yk5/df-python
2017-06-23 15:07:45 -07:00
Chris Riccomini cb336464cc [AIRFLOW-XXX] Updating CHANGELOG, README, and UPDATING after 1.8.1 release 2017-05-09 13:20:31 -07:00
Jeremiah Lowin 4da3611c46 [AIRFLOW-886] Pass result to post_execute() hook
The post_execute() hook should receive
the Operator result in addition to the
execution context.
2017-02-18 18:38:58 -05:00
Jeremiah Lowin 6e22102782 [AIRFLOW-862] Add DaskExecutor
Adds a DaskExecutor for running Airflow tasks
in Dask clusters.

Closes #2067 from jlowin/dask-executor
2017-02-12 16:06:31 -05:00
Bolke de Bruin b56e642247 Add known issue of 'num_runs' 2017-02-10 14:54:46 +01:00
Bolke de Bruin e63cb1fced Add pool upgrade issue description 2017-02-09 16:10:17 +01:00
Bolke de Bruin c64832718b [AIRFLOW-789] Update UPDATING.md
Closes #2011 from bolkedebruin/AIRFLOW-789
2017-02-01 15:52:50 +00:00
Alex Van Boxel 7e691d3f60 Update upgrade documentation for Google Cloud
Closes #1979 from alexvanboxel/pr/doc_gcloud
2017-01-10 09:03:44 +01:00
Jeremiah Lowin 9a61a5bd58 [AIRFLOW-31][AIRFLOW-200] Add note to updating.md
AIRFLOW-31 and AIRFLOW-200 deprecated the old important mechanism and should be noted in UPDATING.md

Closes #1643 from jlowin/patch-1
2016-07-06 10:41:46 +02:00
Rob Froetscher 8d501b0cea [AIRFLOW-171] Add upgrade notes on email and S3 to 1.7.1.2
Closes #1587 from rfroetscher/upgrading_readme
2016-06-14 12:27:58 +02:00
Bolke de Bruin bd414161da Use os.execvp instead of subprocess.Popen for the webserver
subprocess.Popen forks before doing execv. This makes it difficult
for some manager daemons (like supervisord) to send kill signals.
This patch uses os.execve directly. os.execve takes over the current
process and thus responds correctly to signals

* Resolves residue in ISSUE-852
2016-04-21 16:23:11 +02:00
Bolke de Bruin e8c1144bb8 Add consistent and thorough signal handling and logging
Airflow spawns childs in the form of a webserver, scheduler, and executors.
If the parent gets terminated (SIGTERM) it needs to properly propagate the
signals to the childs otherwise these will get orphaned and end up as
zombie processes. This patch resolves that issue.

In addition Airflow does not store the PID of its services so they can be
managed by traditional unix systems services like rc.d / upstart / systemd
and the likes. This patch adds the "--pid" flag. By default it stores the
PID in ~/airflow/airflow-<service>.pid

Lastly, the patch adds support for different log file locations: log,
stdout, and stderr (respectively: --log-file, --stdout, --stderr). By
default these are stored in ~/airflow/airflow-<service>.log/out/err.

* Resolves ISSUE-852
2016-04-06 20:40:43 +02:00
Jeremiah Lowin 10ee622330 Deprecate *args and **kwargs in BaseOperator
BaseOperator silently accepts any arguments. This deprecates the
behavior with a warning that says it will be forbidden in Airflow 2.0.

This PR also turns on DeprecationWarnings by default, which in turn
revealed that inspect.getargspec is deprecated. Here it is replaced by
`inspect.signature` (Python 3) or `funcsigs.signature` (Python 2).

Lastly, this brought to attention that example_http_operator was
passing an illegal argument.
2016-04-05 10:04:55 +02:00
Bence Nagy e1fd48b2ec Set dags_are_paused_at_creation's default value to True 2016-03-31 10:53:43 +02:00