incubator-airflow

Граф коммитов

Автор	SHA1	Сообщение	Дата
Kamil Breguła	6d229b240b	[AIRFLOW-3707] Group subpackages/extras by cloud providers (#4524 )	2019-02-08 10:23:32 +00:00
Tao Feng	2f70347bdc	[AIRFLOW-3771] Minor refactor securityManager (#4594 )	2019-01-26 22:49:58 -08:00
Kaxil Naik	0b1e453033	[AIRFLOW-XXX] Update the UPDATING.md file for 1.10.2	2019-01-23 01:03:45 +00:00
Andrew Stahlman	9f8ca32da4	[AIRFLOW-3752] Add/remove user from role via CLI (#4572 ) * [AIRFLOW-3752] Add/remove user from role via the CLI Update the `users` subcommand to enable 2 new actions: - `--add-role`: Make the user a member of the given role - `--remove-role`: Remove the user's membership in the given role For installations that use an external identity provider (e.g., Google OAuth) the username is typically a long ID string. For the sake of convenience, we allow the CLI operator to reference the target user via either their `username` or their `email` (but not both). * Update argparse spec Accidentally left off this update to the argparse spec in the last commit. * Add unit tests * Fix lint failures	2019-01-22 14:29:15 -08:00
Jarek Potiuk	383ec374ea	[AIRFLOW-3713] Updated documentation for GCP optional project_id (#4541 )	2019-01-17 09:37:07 -08:00
Verdan Mahmood	c030729dcb	[AIRFLOW-3303] Deprecate old UI in favor of FAB (#4339 )	2019-01-14 14:33:45 +00:00
bolkedebruin	69adaee25c	[AIRFLOW-3692] Remove ENV variables to avoid GPL (#4506 )	2019-01-13 12:34:00 +00:00
Felix	ff6ccee748	[AIRFLOW-3594] Unify different License Header	2019-01-11 19:17:20 +01:00
Fokko Driesprong	327860fe4f	[AIRFLOW-3515] Remove the run_duration option (#4320 )	2019-01-08 10:40:10 +00:00
Tao Feng	67572025cc	[AIRFLOW-3612] Remove incubation/incubator mention (#4419 )	2019-01-05 14:05:25 +00:00
aoen	ecc88a5326	[AIRFLOW-3581] Fix next_ds/prev_ds semantics for manual runs (#4385 )	2018-12-30 22:31:11 -08:00
Camilo Aguilar	fb52d7b7ca	Update UPDATING.md (#4348 ) [AIRFLOW-XXX] Add section to Updating.md regarding timezones	2018-12-26 21:25:35 +01:00
Joshua Carp	71129cd210	[AIRFLOW-3540] Respect environment config when looking up config file. (#4340 )	2018-12-19 10:28:56 -08:00
Kaxil Naik	e44fc88397	[AIRFLOW-3447] Add 2 options for ts_nodash Macro (#4323 )	2018-12-15 23:13:36 +00:00
thomasbrockmeier	9d87552ceb	[AIRFLOW-1552] Airflow Filter_by_owner not working with password_auth (#4276 ) Local users were always a superuser, this adds a column to the DB (and defaults to false, which is going to cause a bit of an upgrade pain for people, but defaulting to not being an admin is the only secure default.)	2018-12-15 15:27:10 +00:00
Daniel Lamblin	f8027d4a4f	[AIRFLOW-XXX] Correct typos in UPDATING.md (#4242 ) Started with "habe", "serever" and "certificiate" needing to be: "have", "server", and "certificate". Ran a check, ignoring British and US accepted spellings. Kept jargon. EG admin, aync, auth, backend, config, dag, s3, utils, etc. Took exception to: "num of dag run" meaning "number of dag runs", "upness" is normally for quarks, "url" being lower-case, and sftp example having an excess file ending. Python documentation writes "builtin" hyphenated, cases "PYTHONPATH". Gave up on mixed use of "dag" and "DAG" as well as long line lengths.	2018-11-26 22:14:24 -08:00
Ash Berlin-Taylor	b9fc03ea1a	[AIRFLOW-2779] Add license headers to doc files (#4178 ) This adds ASF license headers to all the .rst and .md files with the exception of the Pull Request template (as that is included verbatim when opening a Pull Request on Github which would be messy)	2018-11-13 15:01:44 +01:00
Ash Berlin-Taylor	4f4ed1d8af	[AIRFLOW-XXX] Airflow 1.10.1 release notes in UPDATING.md Simply adding a header in the right place, as through dumb luck we had the changes in the right order.	2018-11-12 17:32:48 +00:00
bolkedebruin	2c4b0eab7d	[AIRFLOW-3164] Verify server certificate when connecting to LDAP (#4006 )	2018-11-09 13:58:34 +00:00
Kevin Yang	75e2288a3f	[Airflow-2760] Decouple DAG parsing loop from scheduler loop (#3873 )	2018-10-26 09:37:10 +01:00
Ash Berlin-Taylor	3e47e3ab52	[AIRFLOW-3197] EMRHook is missing new parameters of the AWS API (#4044 ) Allow passing any params to the CreateJobFlow API, so that we don't have to stay up to date with AWS api changes.	2018-10-13 08:25:57 +02:00
Greg Neiheisel	1582da88ee	[AIRFLOW-3177] Change scheduler_heartbeat from gauge to counter (#4027 ) This updates the scheduler_heartbeat metric from a gauge to a counter to better support the statsd_exporter for usage with Prometheus. A counter allows users to track the rate of the heartbeat, and integrates with the exporter better. A crashing or down scheduler will no longer emit the metric, but the statsd_exporter will continue to show a 1 for the metric value. This fixes that issue because a counter will continually change, and the lack of change indicates an issue with the scheduler. Add statsd change notice in UPDATING.md	2018-10-12 07:51:13 +02:00
Joshua Carp	fb5ffd146a	[AIRFLOW-3103][AIRFLOW-3147] Update flask-appbuilder (#3937 )	2018-10-04 08:20:24 +01:00
Kaxil Naik	33a1fe76c5	[AIRFLOW-XXX] Add Delete for CLI Example in UPDATING.md	2018-09-28 17:40:33 +01:00
Kaxil Naik	4c572a4b2d	[AIRFLOW-3130] Add CLI docs for users command	2018-09-28 08:49:36 -07:00
Eric Chang	2656319165	[AIRFLOW-3111] Fix instructions in UPDATING.md and remove comment (#3944 ) artifacts in default_airflow.cfg - fixed incorrect instructions in UPDATING.md regarding core.log_filename_template and elasticsearch.elasticsearch_log_id_template - removed comments referencing "additional curly braces" from default_airflow.cfg since they're irrelevant to the rendered airflow.cfg	2018-09-25 09:26:41 -07:00
Ash Berlin-Taylor	748ad939f1	[AIRFLOW-XXX] Move Dag level access control out of 1.10 section (#3882 ) It isn't in 1.10 (and wasn't in this section when the PR was created).	2018-09-11 10:57:48 -07:00
yrqls21	9b82fcb5fb	[AIRFLOW-2156] Parallelize Celery Executor task state fetching (#3830 )	2018-09-11 09:12:18 -07:00
Dan Davydov	7142ae0732	[AIRFLOW-2895] Prevent scheduler from spamming heartbeats/logs Reverts most of AIRFLOW-2027 until the issues with it can be fixed. Closes #3747 from aoen/revert_min_file_parsing_time_commit	2018-08-20 09:14:36 -04:00
bolkedebruin	48229328f2	[AIRFLOW-2888] Remove shell=True and bash from task launch (#3740 ) shell=True is a security risk. Bash is not required to launch tasks and will consume extra resources.	2018-08-15 07:13:02 +02:00
Ash Berlin-Taylor	d9fecba14c	[AIRFLOW-XXX] Updating instructions about logging changes in 1.10 (#3715 ) We had a few other logging changes that weren't mentioned in here that meant previous logs were not viewable anymore.	2018-08-07 22:34:56 +02:00
bolkedebruin	c37fc0b6ba	[AIRFLOW-2817] Force explicit choice on GPL dependency (#3660 ) By default one of Apache Airflow's dependencies pulls in a GPL library. Airflow should not install (and upgrade) without an explicit choice. This is part of the Apache requirements as we cannot depend on Category X software.	2018-08-01 11:25:31 +02:00
Taylor D. Edmiston	6d31c9e416	[AIRFLOW-2832] Lint and resolve inconsistencies in Markdown files (#3670 ) Clean up the Markdown files and make the formatting consistent	2018-08-01 09:50:23 +02:00
Jacob Hayes	fcd51f362c	[AIRFLOW-2716] Replace async and await py3.7 keywords Closes #3578 from JacobHayes/py37-keywords	2018-07-29 11:56:41 +02:00
Tao feng	f3f2eb323f	[AIRFLOW-2267] Airflow DAG level access Make sure you have checked _all_ steps below. ### JIRA - [x] My PR addresses the following [Airflow JIRA] (https://issues.apache.org/jira/browse/AIRFLOW/) issues and references them in the PR title. For example, "\[AIRFLOW-XXX\] My Airflow PR" - https://issues.apache.org/jira/browse/AIRFLOW-2267 - In case you are fixing a typo in the documentation you can prepend your commit with \[AIRFLOW-XXX\], code changes always need a JIRA issue. ### Description - [x] Here are some details about my PR, including screenshots of any UI changes: Provide DAG level access for airflow. The detail design could be found at https://docs.google.com/d ocument/d/1qs26lE9kAuCY0Qa0ga-80EQ7d7m4s-590lhjtMB jmxw/edit# ### Tests - [x] My PR adds the following unit tests __OR__ does not need testing for this extremely good reason: Unit tests are added. ### Commits - [x] My commits all reference JIRA issues in their subject lines, and I have squashed multiple commits if they address the same issue. In addition, my commits follow the guidelines from "[How to write a good git commit message](http://chris.beams.io/posts/git- commit/)": 1. Subject is separated from body by a blank line 2. Subject is limited to 50 characters 3. Subject does not end with a period 4. Subject uses the imperative mood ("add", not "adding") 5. Body wraps at 72 characters 6. Body explains "what" and "why", not "how" - [x] Passes `git diff upstream/master -u -- "*.py" \| flake8 --diff` Closes #3197 from feng-tao/airflow-2267	2018-07-16 13:13:42 -07:00
Kevin Yang	e9babff4eb	[AIRFLOW-2463] Make task instance context available for hive queries [AIRFLOW-2463] Make task instance context available for hive queries update UPDATING.md, please squash Closes #3405 from yrqls21/kevin_yang_add_context	2018-07-11 10:28:06 +02:00
Eric Lo	985a433a8d	[AIRFLOW-2696] Setting UTF-8 as default mime_charset mail update UPDATING.md Closes #3559 from lxneng/feature/utf8_mime_charset	2018-07-01 10:05:56 +02:00
Ash Berlin-Taylor	a4592f91eb	[AIRFLOW-1840] Support back-compat on old celery config The new names are in-line with Celery 4, but if anyone upgrades Airflow without following the UPDATING.md instructions (which we probably assume most people won't, not until something stops working) their workers would suddenly just start failing. That's bad. This will issue a warning but carry on working as expected. We can remove the deprecation settings (but leave the code in config) after this release has been made. Closes #3549 from ashb/AIRFLOW-1840-back-compat	2018-06-27 22:07:31 +02:00
Hugo Prudente	6c3c8f4459	[AIRFLOW-2542][AIRFLOW-1790] Rename AWS Batch Operator queue to job_queue - Improved the retries times to jobs below 60s - Renamed property queue to job_queue to prevent AWS Batch and CeleryExecutor queue conflict - Added Breaking Chain note for the UPDATING.md master - Fixed operator infinit loop - Added documentation warning about the Breaking chain - Fixed the commit parameter to keep it on Airflow guidelines - Fixed logging typo - rebased with master Changes to be committed: modified: ../../../UPDATING.md modified: awsbatch_operator.py modified: ../../../tests/contrib/operators/test_ awsbatch_operator.py Closes #3436 from hprudent/master	2018-06-19 10:00:47 +02:00
niels	d62a037671	[AIRFLOW-2539][AIRFLOW-2359] Move remaing log config to configuration file Closes #3435 from NielsZeilemaker/env_logging_filename	2018-06-15 13:25:26 +02:00
Kaxil Naik	b220fe60d5	[AIRFLOW-2513] Change `bql` to `sql` for BigQuery Hooks & Ops - Change `bql` to `sql` for BigQuery Hooks & Operators for consistency Closes #3454 from kaxil/consistent-bq-lang	2018-06-04 10:04:03 +01:00
Tao feng	2a55ffe0cd	[AIRFLOW-2086][AIRFLOW-2393] Customize default dagrun number in tree view Closes #3279 from feng-tao/reduce-tree-view This introduces a new configuration variable to set the default number of dag runs displayed in the tree view. For large DAGs, this could cause timeouts in the webserver.	2018-05-09 08:45:17 -07:00
Fokko Driesprong	97ab9e762c	[AIRFLOW-1812] Update logging example The logging has changed, therefore we should also update the updating.md guide Closes #2784 from Fokko/AIRFLOW-1812-update- logging-example	2018-05-03 22:58:44 -07:00
Philippus Baalman	ad28decc74	[AIRFLOW-1933] Fix some typos Closes #2474 from Philippus/patch-1	2018-04-29 20:08:48 -07:00
Tao feng	64d9501667	[AIRFLOW-74] SubdagOperators can consume all celeryd worker processes Closes #3251 from feng-tao/airflow-74	2018-04-24 10:13:25 -07:00
DerekRoy	8e83e2b3ef	[AIRFLOW-2350] Fix grammar in UPDATING.md Closes #3248 from r39132/patch-1	2018-04-21 08:34:16 +02:00
Sathyaprakash Govindasamy	a148043107	[AIRFLOW-2254] Put header as first row in unload Currently, data is ordered by first column in descending order Header row comes as first only if the first column is integer This fix puts header as first row regardless of first column data type Closes #3180 from sathyaprakashg/AIRFLOW-2254	2018-04-16 10:21:22 +02:00
Dan Davydov	3c4f1fd9e6	[AIRFLOW-2027] Only trigger sleep in scheduler after all files have parsed Closes #2986 from aoen/ddavydov--open_source_disab le_unecessary_sleep_in_scheduler_loop	2018-04-09 10:22:11 +02:00
Taylor D. Edmiston	9bdcc4760a	[AIRFLOW-2282] Fix grammar in UPDATING.md Also remove trailing whitespace.	2018-04-04 17:28:53 -04:00
Tao feng	bf86b89439	[AIRFLOW-2233] Update updating.md to include the info of hdfs_sensors renaming Closes #3145 from feng-tao/airflow-2233	2018-03-31 11:16:46 +02:00
Joy Gao	05e1861e24	[AIRFLOW-1433][AIRFLOW-85] New Airflow Webserver UI with RBAC support Closes #3015 from jgao54/rbac	2018-03-23 09:18:48 +01:00
Fokko Driesprong	bb287ecf5b	[AIRFLOW-2226] Rename google_cloud_storage_default to google_cloud_default The Google cloud operators uses both google_cloud_storage_default and google_cloud_default as a default conn_id. This is confusing and the google_cloud_storage_default conn_id isnt initialized by default in db.py Therefore we rename the google_cloud_storage_default to google_cloud_default for simplicity and convenience Closes #3141 from Fokko/airflow-2226	2018-03-19 22:02:10 +01:00
Bolke de Bruin	a1d5551777	[AIRFLOW-1895] Fix primary key integrity for mysql sla_miss and task_instances cannot have NULL execution_dates. The timezone migration scripts forgot to set this properly. In addition to make sure MySQL does not set "ON UPDATE CURRENT_TIMESTAMP" or MariaDB "DEFAULT 0000-00-00 00:00:00" we now check if explicit_defaults_for_timestamp is turned on and otherwise fail an database upgrade. Closes #2969, #2857 Closes #2979 from bolkedebruin/AIRFLOW-1895	2018-01-27 09:01:10 +01:00
fenglu-g	cc9295fe37	[AIRFLOW-1953] Add labels to dataflow operators Closes #2913 from fenglu-g/master	2018-01-03 11:16:39 -08:00
Joy Gao	c0dffb57c2	[AIRFLOW-1821] Enhance default logging config by removing extra loggers Closes #2793 from jgao54/logging-enhancement	2017-12-22 14:07:29 +01:00
Fokko Driesprong	30076f1e45	[AIRFLOW-1840] Make celery configuration congruent with Celery 4 Explicitly set the celery backend from the config and align the config with the celery config as this might be confusing. Closes #2806 from Fokko/AIRFLOW-1840-Fix-celery- config	2017-12-11 18:56:29 +01:00
Ash Berlin-Taylor	98df0d6e3b	[AIRFLOW-1795] Correctly call S3Hook after migration to boto3 In the migration of S3Hook to boto3 the connection ID parameter changed to `aws_conn_id`. This fixes the uses of `s3_conn_id` in the code base and adds a note to UPDATING.md about the change. In correcting the tests for S3ToHiveTransfer I noticed that S3Hook.get_key was returning a dictionary, rather then the S3.Object as mentioned in it's doc string. The important thing that was missing was ability to get the key name from the return a call to get_wildcard_key. Closes #2795 from ashb/AIRFLOW-1795-s3hook_boto3_fixes	2017-11-18 14:07:38 +01:00
Fokko Driesprong	635ab01a76	[AIRFLOW-1731] Set pythonpath for logging Before initializing the logging framework, we want to set the python path so the logging config can be found. Closes #2721 from Fokko/AIRFLOW-1731-import- pythonpath	2017-10-27 16:02:56 +02:00
Dan Davydov	21e94c7d15	[AIRFLOW-1697] Mode to disable charts endpoint	2017-10-10 11:33:50 -07:00
Chris Riccomini	ebe715c565	[AIRFLOW-1691] Add better Google cloud logging documentation Closes #2671 from criccomini/fix-log-docs	2017-10-09 10:32:34 -07:00
Crystal Qian	dd861f8cd0	[AIRFLOW-1323] Made Dataproc operator parameter names consistent Closes #2636 from cjqian/1323	2017-10-03 11:15:27 +02:00
Fokko Driesprong	3c3a65a3fe	[AIRFLOW-1611] Customize logging Change the configuration of the logging to make use of the python logging and make the configuration easy configurable. Some of the settings which are now not needed anymore since they can easily be implemented in the config file. Closes #2631 from Fokko/AIRFLOW-1611-customize- logging-in-airflow	2017-10-02 17:14:01 +02:00
Fokko Driesprong	a7a518902d	[AIRFLOW-1582] Improve logging within Airflow Clean the way of logging within Airflow. Remove the old logging.py and move to the airflow.utils.log.* interface. Remove setting the logging outside of the settings/configuration code. Move away from the string format to logging_function(msg, *args). Closes #2592 from Fokko/AIRFLOW-1582-Improve- logging-structure	2017-09-13 09:36:58 +02:00
Dan Davydov	4cf904cf5a	[AIRFLOW-855] Replace PickleType with LargeBinary in XCom PickleType in Xcom allows remote code execution. In order to deprecate it without changing mysql table schema, change PickleType to LargeBinary because they both maps to blob type in mysql. Add "enable_pickling" to function signature to control using ether pickle type or JSON. "enable_pickling" should also be added to core section of airflow.cfg Picked up where https://github.com/apache /incubator-airflow/pull/2132 left off. Took this PR, fixed merge conflicts, added documentation/tests, fixed broken tests/operators, and fixed the python3 issues. Closes #2518 from aoen/disable-pickle-type	2017-08-15 12:24:07 -07:00
AllisonWang	6825d97b82	[AIRFLOW-1443] Update Airflow configuration documentation This PR updates Airflow configuration documentations to include a recent change to split task logs by try number #2383. Closes #2467 from AllisonWang/allison--update-doc	2017-08-09 14:49:56 -07:00
Bolke de Bruin	3927723263	Fix new SSH documentation	2017-07-20 22:12:31 +02:00
Jay	fe0edeaab5	[AIRFLOW-756][AIRFLOW-751] Replace ssh hook, operator & sftp operator with paramiko based Closes #1999 from jhsenjaliya/AIRFLOW-756	2017-07-20 22:07:45 +02:00
Younghee Kwon	c450b60878	[AIRFLOW-1338][AIRFLOW-782] Add GCP dataflow hook runner change to UPDATING.md Closes #2326 from yk5/df-python	2017-06-23 15:07:45 -07:00
Chris Riccomini	cb336464cc	[AIRFLOW-XXX] Updating CHANGELOG, README, and UPDATING after 1.8.1 release	2017-05-09 13:20:31 -07:00
Jeremiah Lowin	4da3611c46	[AIRFLOW-886] Pass result to post_execute() hook The post_execute() hook should receive the Operator result in addition to the execution context.	2017-02-18 18:38:58 -05:00
Jeremiah Lowin	6e22102782	[AIRFLOW-862] Add DaskExecutor Adds a DaskExecutor for running Airflow tasks in Dask clusters. Closes #2067 from jlowin/dask-executor	2017-02-12 16:06:31 -05:00
Bolke de Bruin	b56e642247	Add known issue of 'num_runs'	2017-02-10 14:54:46 +01:00
Bolke de Bruin	e63cb1fced	Add pool upgrade issue description	2017-02-09 16:10:17 +01:00
Bolke de Bruin	c64832718b	[AIRFLOW-789] Update UPDATING.md Closes #2011 from bolkedebruin/AIRFLOW-789	2017-02-01 15:52:50 +00:00
Alex Van Boxel	7e691d3f60	Update upgrade documentation for Google Cloud Closes #1979 from alexvanboxel/pr/doc_gcloud	2017-01-10 09:03:44 +01:00
Jeremiah Lowin	9a61a5bd58	[AIRFLOW-31][AIRFLOW-200] Add note to updating.md AIRFLOW-31 and AIRFLOW-200 deprecated the old important mechanism and should be noted in UPDATING.md Closes #1643 from jlowin/patch-1	2016-07-06 10:41:46 +02:00
Rob Froetscher	8d501b0cea	[AIRFLOW-171] Add upgrade notes on email and S3 to 1.7.1.2 Closes #1587 from rfroetscher/upgrading_readme	2016-06-14 12:27:58 +02:00
Bolke de Bruin	bd414161da	Use os.execvp instead of subprocess.Popen for the webserver subprocess.Popen forks before doing execv. This makes it difficult for some manager daemons (like supervisord) to send kill signals. This patch uses os.execve directly. os.execve takes over the current process and thus responds correctly to signals * Resolves residue in ISSUE-852	2016-04-21 16:23:11 +02:00
Bolke de Bruin	e8c1144bb8	Add consistent and thorough signal handling and logging Airflow spawns childs in the form of a webserver, scheduler, and executors. If the parent gets terminated (SIGTERM) it needs to properly propagate the signals to the childs otherwise these will get orphaned and end up as zombie processes. This patch resolves that issue. In addition Airflow does not store the PID of its services so they can be managed by traditional unix systems services like rc.d / upstart / systemd and the likes. This patch adds the "--pid" flag. By default it stores the PID in ~/airflow/airflow-<service>.pid Lastly, the patch adds support for different log file locations: log, stdout, and stderr (respectively: --log-file, --stdout, --stderr). By default these are stored in ~/airflow/airflow-<service>.log/out/err. * Resolves ISSUE-852	2016-04-06 20:40:43 +02:00
Jeremiah Lowin	10ee622330	Deprecate args and *kwargs in BaseOperator BaseOperator silently accepts any arguments. This deprecates the behavior with a warning that says it will be forbidden in Airflow 2.0. This PR also turns on DeprecationWarnings by default, which in turn revealed that inspect.getargspec is deprecated. Here it is replaced by `inspect.signature` (Python 3) or `funcsigs.signature` (Python 2). Lastly, this brought to attention that example_http_operator was passing an illegal argument.	2016-04-05 10:04:55 +02:00
Bence Nagy	e1fd48b2ec	Set dags_are_paused_at_creation's default value to True	2016-03-31 10:53:43 +02:00

1 2 3 4 5

231 Коммитов