Граф коммитов

264 Коммитов

Автор SHA1 Сообщение Дата
Moe Nadal 667a26ce49 [AIRFLOW-1551] Add operator to trigger Jenkins job
Closes #2553 from moe-nadal-ck/AIRFLOW-1551/AddJenkinsOperator
2018-02-27 11:51:49 +01:00
Marcos Bernardelli beadcd327c [AIRFLOW-2125] Using binary package psycopg2-binary
Closes #3055 from bern4rdelli/AIRFLOW-2125
2018-02-24 16:16:00 +01:00
Bolke de Bruin 772dbae298 [AIRFLOW-1927] Convert naive datetimes for TaskInstances
TaskInstances are sometimes instantiated outside
core
Airflow with naive datetimes. In case this happens
we
now default to using the time zone of the DAG if
that
is available or the default system time zone.

Closes #2946 from bolkedebruin/AIRFLOW-1927
2018-02-06 17:26:08 +01:00
Fokko Driesprong e76cda0ff5 [AIRFLOW-2038] Add missing kubernetes dependency for dev
When doing initdb, it fails on the kubernetes
dependency from the
examples

Closes #2978 from Fokko/fd-fix-dependencies
2018-02-05 20:52:28 +01:00
Bolke de Bruin 1e36b37b68 [AIRFLOW-1755] Allow mount below root
This enables Airflow and Celery Flower to live
below root. Draws on the work of Geatan Semet
(@Stibbons).

This closes #2723 and closes #2818

Closes #2952 from bolkedebruin/AIRFLOW-1755
2018-01-19 18:54:26 +01:00
Bolke de Bruin 88130a5d7e [AIRFLOW-2003] Use flask-caching instead of flask-cache
Flask-cache has been unmaintained for over three
years,
flask-caching is the community supported version.

Closes #2944 from bolkedebruin/AIRFLOW-2003
2018-01-15 21:12:03 +01:00
Bolke de Bruin 1abe7f6d54 Merge pull request #2853 from dimberman/Airflow_1517_kubenetes_operator 2018-01-12 19:02:52 +01:00
fenglu-g cc9295fe37 [AIRFLOW-1953] Add labels to dataflow operators
Closes #2913 from fenglu-g/master
2018-01-03 11:16:39 -08:00
Fokko Driesprong b9f4a7437e [AIRFLOW-1967] Update Celery to 4.0.2
Update Celery to 4.0.2 for fixing error
TypeError: '<=' not supported between instances of
'NoneType' and 'int'

Hi all,

I'd like to update Celery to version 4.0.2. While
updating my Docker container to version 1.9, I
caught this error:
```
worker_1     | [2018-01-03 10:34:29,934:
CRITICAL/MainProcess] Unrecoverable error:
TypeError("'<=' not supported between instances of
'NoneType' and 'int'",)
worker_1     | Traceback (most recent call last):
worker_1     |   File "/usr/local/lib/python3.6
/site-packages/celery/worker/worker.py", line 203,
in start
worker_1     |     self.blueprint.start(self)
worker_1     |   File "/usr/local/lib/python3.6
/site-packages/celery/bootsteps.py", line 115, in
start
worker_1     |     self.on_start()
worker_1     |   File "/usr/local/lib/python3.6
/site-packages/celery/apps/worker.py", line 143,
in on_start
worker_1     |     self.emit_banner()
worker_1     |   File "/usr/local/lib/python3.6
/site-packages/celery/apps/worker.py", line 159,
in emit_banner
worker_1     |
string(self.colored.reset(self.extra_info() or
'')),
worker_1     |   File "/usr/local/lib/python3.6
/site-packages/celery/apps/worker.py", line 188,
in extra_info
worker_1     |     if self.loglevel <=
logging.INFO:
worker_1     | TypeError: '<=' not supported
between instances of 'NoneType' and 'int'
```

This is because I've been running Python 2 in my
local environments, and the Docker image is Python
3:
https://github.com/puckel/docker-
airflow/pull/143/files

This is the issue in Celery:
https://github.com/celery/celery/blob/0dde9df9d8dd
5dbbb97ef75a81757bc2d9a4b33e/Changelog#L145

Make sure you have checked _all_ steps below.

### JIRA
- [x] My PR addresses the following [Airflow JIRA]
(https://issues.apache.org/jira/browse/AIRFLOW/)
issues and references them in the PR title. For
example, "[AIRFLOW-XXX] My Airflow PR"
    - https://issues.apache.org/jira/browse/AIRFLOW-
XXX

### Description
- [x] Here are some details about my PR, including
screenshots of any UI changes:

### Tests
- [x] My PR adds the following unit tests __OR__
does not need testing for this extremely good
reason:

### Commits
- [x] My commits all reference JIRA issues in
their subject lines, and I have squashed multiple
commits if they address the same issue. In
addition, my commits follow the guidelines from
"[How to write a good git commit
message](http://chris.beams.io/posts/git-
commit/)":
    1. Subject is separated from body by a blank line
    2. Subject is limited to 50 characters
    3. Subject does not end with a period
    4. Subject uses the imperative mood ("add", not
"adding")
    5. Body wraps at 72 characters
    6. Body explains "what" and "why", not "how"

- [x] Passes `git diff upstream/master -u --
"*.py" | flake8 --diff`

Closes #2914 from Fokko/AIRFLOW-1967-update-celery
2018-01-03 09:20:00 -08:00
Bolke de Bruin 5e4d7d8d7d [AIRFLOW-XXX] Pin sqlalchemy dependency 2017-12-29 12:28:51 +01:00
Daniel Imberman 78ff2fc180 [AIRFLOW-1517] Kubernetes Operator 2017-12-26 08:45:31 -08:00
William Pursell 355135bcb8 [AIRFLOW-1915] Relax flask-wtf dependency specification
Closes #2876 from wrp/flask-wtf
2017-12-22 13:57:47 +01:00
William Pursell a5ca8cdce9 [AIRFLOW-1938] Clean up unused exception
There is no longer the possibility of a
GitCommandError (since cc4404b5f7)

Closes #2898 from wrp/setup
2017-12-22 13:49:37 +01:00
Chris Riccomini cc4404b5f7 [AIRFLOW-1938] Remove tag version check in setup.py
Closes #2889 from criccomini/AIRFLOW-1938
2017-12-19 12:29:39 -08:00
Kamil Chmielewski 2dbd81fa81 [AIRFLOW-1896] FIX bleach <> html5lib incompatibility
Running airflow with bleach 2.0.0 can cause:
`ImportError: No module named base`
https://github.com/mozilla/bleach/issues/267

This was resolved in https://github.com/mozilla/bl
each/releases/tag/v2.1.2

Closes #2858 from kamilchm/patch-1
2017-12-09 09:34:34 +01:00
Bolke de Bruin 518a41acf3 [AIRFLOW-1826] Update views to use timezone aware objects 2017-11-27 15:54:27 +01:00
Bolke de Bruin 2f168634aa [AIRFLOW-1807] Force use of time zone aware db fields
This change will check if all date times being stored are
indeed timezone aware.
2017-11-27 15:54:27 +01:00
Bolke de Bruin c857436b75 [AIRFLOW-1808] Convert all utcnow() to time zone aware
datetime.utcnow() does not set time zone information.
2017-11-27 15:54:20 +01:00
Bolke de Bruin a47255fb2d [AIRFLOW-1804] Add time zone configuration options
Time zone defaults to UTC as is the default now in order
to maintain backwards compatibility.
2017-11-27 15:53:03 +01:00
Cedrik Neumann 5157b5a763 [AIRFLOW-1817] use boto3 for s3 dependency
Since S3Hook is reimplemented based on the AwsHook
using boto3, its package dependencies need to be
updated as well.

Closes #2790 from m1racoli/fix-setup-s3
2017-11-15 11:55:13 +01:00
Brian Charous cbb00d4055 [AIRFLOW-1102] Upgrade Gunicorn >=19.4.0
Closes #2775 from briancharous/upgrade-gunicorn
2017-11-10 08:49:41 +01:00
William Pursell 9425d359b0 [AIRFLOW-646] Add docutils to setup_requires
python-daemon declares its docutils dependency in a setup_requires
clause, and 'python setup.py install' fails since it misses
that dependency.

Closes #2765 from wrp/docutils
2017-11-09 09:15:12 -08:00
Stefanie Grunwald a61d9444cd
[AIRFLOW-1669] Fix Docker and pin Moto to 1.1.19
https://github.com/spulec/moto/pull/1048 introduced `docker` as a
dependency in Moto, causing a conflict as Airflow uses `docker-py`. As
both packages don't work together, Moto is pinned to the version
prior to that change.
2017-11-02 14:23:32 +01:00
Sumit Maheshwari c5776375fd [AIRFLOW-1315] Add Qubole File & Partition Sensors
Closes #2401 from msumit/AIRFLOW-1315
2017-10-31 19:32:07 +05:30
r-richmond 21257e8f06 [AIRFLOW-926] Fix JDBC Hook
JayDeBeApi made a backwards incompatible change
This updates the JDBC Hook's implementation
and changes the required JayDeBeApi to >= 1.1.1

Closes #2651 from r-richmond/AIRFLOW-926
2017-10-22 20:04:30 +02:00
fenglu-g 7cb818bbac [AIRFLOW-1723] Support sendgrid in email backend
Closes #2695 from fenglu-g/master
2017-10-18 12:27:14 -07:00
Bolke de Bruin 65f3b468a2 [AIRFLOW-1527] Refactor celery config
The celery config is currently part of the celery executor definition.
This is really inflexible for users wanting to change it. In addition
Celery 4 is moving to lowercase.

Closes #2542 from bolkedebruin/upgrade_celery
2017-09-25 11:19:16 -07:00
Bolke de Bruin fa1dc1eb20 Revert "[AIRFLOW-1368] Automatically remove Docker container on exit"
This reverts commit 46c86a5cd2.
2017-09-24 19:35:28 +02:00
Nathaniel Varona 46c86a5cd2 [AIRFLOW-1368] Automatically remove Docker container on exit
Closes #2411 from nathanielvarona/docker-operator
2017-09-22 10:15:23 -07:00
Ash Berlin-Taylor a6b23a36e0 [AIRFLOW-1594] Don't install test packages into python root.[]
By default `find_packages()` will find _any_ valid
python package,
including things under tests. We don't want to
install the tests
packages into the python path, so exclude those.

Closes #2597 from ashb/AIRFLOW-1594-dont-install-
tests
2017-09-13 10:11:39 +02:00
Fokko Driesprong a7a518902d [AIRFLOW-1582] Improve logging within Airflow
Clean the way of logging within Airflow. Remove
the old logging.py and
move to the airflow.utils.log.* interface. Remove
setting the logging
outside of the settings/configuration code. Move
away from the string
format to logging_function(msg, *args).

Closes #2592 from Fokko/AIRFLOW-1582-Improve-
logging-structure
2017-09-13 09:36:58 +02:00
Dan Fuller aa95f25796 [AIRFLOW-1573] Remove `thrift < 0.10.0` requirement
Closes #2574 from dan-disqus/Thrift
2017-09-11 13:14:47 +02:00
Fokko Driesprong de99aa20f4 [AIRFLOW-1324] Generalize Druid operator and hook
Make the druid operator and hook more specific.
This allows us to
have a more flexible configuration, for example
ingest parquet.
Also get rid of the PyDruid extension since it is
more focussed on
querying druid, rather than ingesting data. Just
requests is
sufficient to submit an indexing job. Add a test
to the hive_to_druid
operator to make sure it behaves as we expect.
Furthermore cleaned
up the docstring a bit

Closes #2378 from Fokko/AIRFLOW-1324-make-more-
general-druid-hook-and-operator
2017-08-18 21:34:03 +02:00
Jay fe0edeaab5 [AIRFLOW-756][AIRFLOW-751] Replace ssh hook, operator & sftp operator with paramiko based
Closes #1999 from jhsenjaliya/AIRFLOW-756
2017-07-20 22:07:45 +02:00
Feng Lu cf2605d3e5 [AIRFLOW-1338] Fix incompatible GCP dataflow hook
Closes #2388 from fenglu-g/master
2017-06-23 13:26:16 -07:00
Kengo Seki 0f55477ccb [AIRFLOW-1172] Support nth weekday of the month cron expression
Closes #2321 from sekikn/AIRFLOW-1172
2017-06-14 17:59:02 -07:00
Sumit Maheshwari 6be02475f8 [AIRFLOW-1192] Some enhancements to qubole_operator
1. Upgrade qds_sdk version to latest
2. Add support to run Zeppelin Notebooks
3. Move out initialization of QuboleHook from
init()

Closes #2322 from msumit/AIRFLOW-1192
2017-06-07 09:09:50 +02:00
Stanislav Kudriashev d2d3e49ca0 [AIRFLOW-1201] Update deprecated 'nose-parameterized'
The 'parameterized' package should be used now,

Closes #2298 from skudriashev/airflow-1201
2017-05-16 11:34:52 +02:00
Kengo Seki ae61987945 [AIRFLOW-1180] Fix flask-wtf version for test_csrf_rejection
For now, SecurityTests.test_csrf_rejection fails
because flask-wtf version specified in setup.py is
too old.
This PR fixes it.

Closes #2280 from sekikn/AIRFLOW-1180
2017-05-13 21:03:51 +02:00
Niels Zeilemaker ac9ccb1518 [AIRFLOW-1179] Fix Pandas 0.2x breaking Google BigQuery change
Closes #2279 from NielsZeilemaker/AIRFLOW-1179
2017-05-09 09:42:32 -07:00
Richard Lee f5bfda0d64 [AIRFLOW-945][AIRFLOW-941] Remove psycopg2 connection workaround
Closes #2272 from dlackty/AIRFLOW-945
2017-05-04 21:34:38 +02:00
Bolke de Bruin 4fb05d8cc7 [AIRFLOW-1000] Rebrand distribution to Apache Airflow
Per Apache requirements Airflow should be branded
Apache Airflow.
It is impossible to provide a forward compatible
automatic update
path and users will be required to manually
upgrade.

Closes #2172 from bolkedebruin/AIRFLOW-1000
2017-04-17 10:09:47 +02:00
Andrew Chen 53ca508456 [AIRFLOW-1028] Databricks Operator for Airflow
Add DatabricksSubmitRun Operator

In this PR, we contribute a DatabricksSubmitRun operator and a
Databricks hook. This operator enables easy integration of Airflow
with Databricks. In addition to the operator, we have created a
databricks_default connection, an example_dag using this
DatabricksSubmitRunOperator, and matching documentation.

Closes #2202 from andrewmchen/databricks-operator-
squashed
2017-04-06 08:30:33 -07:00
Henk Griffioen f1bc5f38ac [AIRFLOW-1065] Add functionality for Azure Blob Storage over wasb://
This PR implements a hook to interface with Azure
storage over wasb://
via azure-storage; adds sensors to check for blobs
or prefixes; and
adds an operator to transfer a local file to the
Blob Storage.

Design is similar to that of the S3Hook in
airflow.operators.S3_hook.

Closes #2216 from hgrif/AIRFLOW-1065
2017-04-05 09:56:23 +02:00
Alex Guziel fe9ebe3ccf [AIRFLOW-1047] Sanitize strings passed to Markup
We add the Apache-licensed bleach library and use
it to sanitize html
passed to Markup (which is supposed to be already
escaped). This avoids
some XSS issues with unsanitized user input being
displayed.

Closes #2193 from saguziel/aguziel-xss
2017-03-28 16:40:32 -07:00
MSempere 8de8501626 [AIRFLOW-999] Add support for Redis database
This PR includes a redis_hook and a redis_key_sensor to enable
checking for key existence in redis. It also updates the
documentation and add the relevant unit tests.

- [x] Opened a PR on Github

- [x] My PR addresses the following Airflow JIRA
issues:
    -
https://issues.apache.org/jira/browse/AIRFLOW-999
- [x] The PR title references the JIRA issues. For
example, "[AIRFLOW-1] My Airflow PR"

- [x] My PR adds unit tests
- [ ] __OR__ my PR does not need testing for this
extremely good reason:

- [x] Here are some details about my PR:
- [ ] Here are screenshots of any UI changes, if
appropriate:

- [x] Each commit subject references a JIRA issue.
For example, "[AIRFLOW-1] Add new feature"
- [x] Multiple commits addressing the same JIRA
issue have been squashed
- [x] My commits follow the guidelines from "[How
to write a good git commit
message](http://chris.beams.io/posts/git-
commit/)":
  1. Subject is separated from body by a blank line
  2. Subject is limited to 50 characters
  3. Subject does not end with a period
  4. Subject uses the imperative mood ("add", not
"adding")
  5. Body wraps at 72 characters
  6. Body explains "what" and "why", not "how"

Closes #2165 from msempere/AIRFLOW-999/support-
for-redis-database
2017-03-20 11:11:31 -07:00
Sean Cronin f3924696ff [AIRFLOW-954] Fix configparser ImportError
Fixes support for Python 2.7 since
https://github.com/apache/incubator-airflow/pull/2091 was merged
2017-03-07 20:52:48 -05:00
Bolke de Bruin 4f52db317f [AIRFLOW-911] Add coloring and timing to tests
Closes #2106 from bolkedebruin/profile_tests
2017-02-25 22:10:14 +01:00
Bolke de Bruin 784b3638c5 [AIRFLOW-895] Address Apache release incompliancies
* Fixes missing licenses in NOTICE
* Corrects license header
* Removes HighCharts left overs.

Closes #2098 from bolkedebruin/AIRFLOW-895
2017-02-23 23:48:03 +01:00
Jeremiah Lowin 50902d0736 [AIRFLOW-887] Support future v0.16 2017-02-18 18:39:01 -05:00