Adds some misconceptions to FAQ

This commit is contained in:
Ask Solem 2012-06-08 15:53:57 +01:00
Родитель ff5fc5e877
Коммит 51a35ec446
12 изменённых файлов: 1026 добавлений и 1009 удалений

140
AUTHORS
Просмотреть файл

@ -1,140 +0,0 @@
=========
AUTHORS
=========
:order: sorted
Aaron Ross
Adam Endicott
Adriano Petrich
Akira Matsuzaki
Alec Clowes
Ales Zoulek
Allan Caffee
Andrew McFague
Andrew Watts
Armin Ronacher
Ask Solem
Augusto Becciu
Balachandran C
Bartosz Ptaszynski
Ben Firshman
Brad Jasper
Branko Čibej
Brendon Crawford
Brian Rosner
Bryan Berg
Chase Seibert
Chris Adams
Chris Angove
Chris Chamberlin
Chris Rose
Chris St. Pierre
Chris Streeter
Christoph Burgmer
Christopher Peplin
Clay Gerrard
Dan McGee
Daniel Hepper
Daniel Lundin
Daniel Watkins
David Arthur
David Cramer
David Miller
David Strauss
David White
Eran Rundstein
Felix Berger
Flavio Percoco Premoli
Florian Apolloner
Frédéric Junod
Gert Van Gool
Greg Haskins
Greg Taylor
Grégoire Cachet
Gunnlaugur Thor Briem
Hari
Harm Verhagen
Honza Kral
Ignas Mikalajūnas
Ionel Maries Cristian
Ionut Turturica
Iurii Kriachko
Ivan Metzlar
Jannis Leidel
Jason Baker
Jeff Balogh
Jeff Terrace
Jerzy Kozera
Jesper Noehr
John Watson
Jonas Haag
Jonas Obrist
Jonatan Heyman
Joshua Ginsberg
Juan Ignacio Catalano
Juarez Bochi
Jude Nagurney
Julien Poissonnier
Keith Perkins
Kevin Tran
Kornelijus Survila
Leo Dirac
Luis Clara Gomez
Lukas Linhart
Luke Zapart
Marcin Kuźmiński
Marcin Lulek
Mark Hellewell
Mark Lavin
Mark Stover
Mark Thurman
Martin Galpin
Martin Melin
Matt Williamson
Matthew J Morrison
Matthew Miller
Mauro Rocco
Maxim Bodyansky
Mher Movsisyan
Michael Elsdoerfer
Michael Fladischer
Miguel Hernandez
Mikhail Gusarov
Mikhail Korobov
Mitar
Neil Chintomby
Noah Kantrowitz
Norman Richards
Patrick Altman
Piotr Sikora
Remy Noel
Reza Lotun
Roberto Gaiser
Roger Hu
Rune Halvorsen
Ryan P. Kelly
Ryan Petrello
Sam Cooke
Sean Creeley
Sean O'Connor
Seong Won Mun
Simon Josi
Steeve Morin
Stefan Kjartansson
Steven Skoczen
Tayfun Sen
Thomas Johansson
Timo Sugliani
Travis Swicegood
Vincent Driessen
Vitaly Babiy
Vladimir Kryachko
Wes Turner
Wes Winham
Yury V. Zaytsev
jpellerin
kuno
lookfwd
sdcooke
Łukasz Langa
Łukasz Oleś

803
FAQ
Просмотреть файл

@ -1,803 +0,0 @@
.. _faq:
============================
Frequently Asked Questions
============================
.. contents::
:local:
.. _faq-general:
General
=======
.. _faq-when-to-use:
What kinds of things should I use Celery for?
---------------------------------------------
**Answer:** `Queue everything and delight everyone`_ is a good article
describing why you would use a queue in a web context.
.. _`Queue everything and delight everyone`:
http://decafbad.com/blog/2008/07/04/queue-everything-and-delight-everyone
These are some common use cases:
* Running something in the background. For example, to finish the web request
as soon as possible, then update the users page incrementally.
This gives the user the impression of good performance and "snappiness", even
though the real work might actually take some time.
* Running something after the web request has finished.
* Making sure something is done, by executing it asynchronously and using
retries.
* Scheduling periodic work.
And to some degree:
* Distributed computing.
* Parallel execution.
.. _faq-misconceptions:
Misconceptions
==============
.. _faq-serializion-is-a-choice:
Is Celery dependent on pickle?
------------------------------
**Answer:** No.
Celery can support any serialization scheme and has built-in support for
JSON, YAML, Pickle and msgpack. Also, as every task is associated with a
content type, you can even send one task using pickle, and another using JSON.
The default serialization format is pickle simply because it is
convenient (it supports sending complex Python objects as task arguments).
If you need to communicate with other languages you should change
to a serialization format that is suitable for that.
You can set a global default serializer, the default serializer for a
particular Task, or even what serializer to use when sending a single task
instance.
.. _faq-is-celery-for-django-only:
Is Celery for Django only?
--------------------------
**Answer:** No.
Celery does not depend on Django anymore. To use Celery with Django you have
to use the `django-celery`_ package.
.. _`django-celery`: http://pypi.python.org/pypi/django-celery
.. _faq-is-celery-for-rabbitmq-only:
Do I have to use AMQP/RabbitMQ?
-------------------------------
**Answer**: No.
You can also use Redis, Beanstalk, CouchDB, MongoDB or an SQL database,
see :ref:`brokers`.
These "virtual transports" may have limited broadcast and event functionality.
For example remote control commands only works with AMQP and Redis.
Redis or a database won't perform as well as
an AMQP broker. If you have strict reliability requirements you are
encouraged to use RabbitMQ or another AMQP broker. Redis/database also use
polling, so they are likely to consume more resources. However, if you for
some reason are not able to use AMQP, feel free to use these alternatives.
They will probably work fine for most use cases, and note that the above
points are not specific to Celery; If using Redis/database as a queue worked
fine for you before, it probably will now. You can always upgrade later
if you need to.
.. _faq-is-celery-multilingual:
Is Celery multilingual?
------------------------
**Answer:** Yes.
:mod:`~celery.bin.celeryd` is an implementation of Celery in Python. If the
language has an AMQP client, there shouldn't be much work to create a worker
in your language. A Celery worker is just a program connecting to the broker
to process messages.
Also, there's another way to be language independent, and that is to use REST
tasks, instead of your tasks being functions, they're URLs. With this
information you can even create simple web servers that enable preloading of
code. See: `User Guide: Remote Tasks`_.
.. _`User Guide: Remote Tasks`:
http://celery.github.com/celery/userguide/remote-tasks.html
.. _faq-troubleshooting:
Troubleshooting
===============
.. _faq-mysql-deadlocks:
MySQL is throwing deadlock errors, what can I do?
-------------------------------------------------
**Answer:** MySQL has default isolation level set to `REPEATABLE-READ`,
if you don't really need that, set it to `READ-COMMITTED`.
You can do that by adding the following to your :file:`my.cnf`::
[mysqld]
transaction-isolation = READ-COMMITTED
For more information about InnoDB`s transaction model see `MySQL - The InnoDB
Transaction Model and Locking`_ in the MySQL user manual.
(Thanks to Honza Kral and Anton Tsigularov for this solution)
.. _`MySQL - The InnoDB Transaction Model and Locking`: http://dev.mysql.com/doc/refman/5.1/en/innodb-transaction-model.html
.. _faq-worker-hanging:
celeryd is not doing anything, just hanging
--------------------------------------------
**Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
or `Why is Task.delay/apply\* just hanging?`.
.. _faq-results-unreliable:
Task results aren't reliably returning
--------------------------------------
**Answer:** If you're using the database backend for results, and in particular
using MySQL, see `MySQL is throwing deadlock errors, what can I do?`_.
.. _faq-publish-hanging:
Why is Task.delay/apply\*/celeryd just hanging?
-----------------------------------------------
**Answer:** There is a bug in some AMQP clients that will make it hang if
it's not able to authenticate the current user, the password doesn't match or
the user does not have access to the virtual host specified. Be sure to check
your broker logs (for RabbitMQ that is :file:`/var/log/rabbitmq/rabbit.log` on
most systems), it usually contains a message describing the reason.
.. _faq-celeryd-on-freebsd:
Does it work on FreeBSD?
------------------------
**Answer:** The multiprocessing pool requires a working POSIX semaphore
implementation which isn't enabled in FreeBSD by default. You have to enable
POSIX semaphores in the kernel and manually recompile multiprocessing.
Luckily, Viktor Petersson has written a tutorial to get you started with
Celery on FreeBSD here:
http://www.playingwithwire.com/2009/10/how-to-get-celeryd-to-work-on-freebsd/
.. _faq-duplicate-key-errors:
I'm having `IntegrityError: Duplicate Key` errors. Why?
---------------------------------------------------------
**Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
Thanks to howsthedotcom.
.. _faq-worker-stops-processing:
Why aren't my tasks processed?
------------------------------
**Answer:** With RabbitMQ you can see how many consumers are currently
receiving tasks by running the following command::
$ rabbitmqctl list_queues -p <myvhost> name messages consumers
Listing queues ...
celery 2891 2
This shows that there's 2891 messages waiting to be processed in the task
queue, and there are two consumers processing them.
One reason that the queue is never emptied could be that you have a stale
worker process taking the messages hostage. This could happen if celeryd
wasn't properly shut down.
When a message is received by a worker the broker waits for it to be
acknowledged before marking the message as processed. The broker will not
re-send that message to another consumer until the consumer is shut down
properly.
If you hit this problem you have to kill all workers manually and restart
them::
ps auxww | grep celeryd | awk '{print $2}' | xargs kill
You might have to wait a while until all workers have finished the work they're
doing. If it's still hanging after a long time you can kill them by force
with::
ps auxww | grep celeryd | awk '{print $2}' | xargs kill -9
.. _faq-task-does-not-run:
Why won't my Task run?
----------------------
**Answer:** There might be syntax errors preventing the tasks module being imported.
You can find out if Celery is able to run the task by executing the
task manually:
>>> from myapp.tasks import MyPeriodicTask
>>> MyPeriodicTask.delay()
Watch celeryd`s log file to see if it's able to find the task, or if some
other error is happening.
.. _faq-periodic-task-does-not-run:
Why won't my periodic task run?
-------------------------------
**Answer:** See `Why won't my Task run?`_.
.. _faq-purge-the-queue:
How do I purge all waiting tasks?
---------------------------------
**Answer:** You can use celeryctl to purge all configured task queues::
$ celeryctl purge
or programatically::
>>> from celery import current_app as celery
>>> celery.control.purge()
1753
If you only want to purge messages from a specific queue
you have to use the AMQP API or the :program:`camqadm` utility::
$ camqadm queue.purge <queue name>
The number 1753 is the number of messages deleted.
You can also start :mod:`~celery.bin.celeryd` with the
:option:`--purge` argument, to purge messages when the worker starts.
.. _faq-messages-left-after-purge:
I've purged messages, but there are still messages left in the queue?
---------------------------------------------------------------------
**Answer:** Tasks are acknowledged (removed from the queue) as soon
as they are actually executed. After the worker has received a task, it will
take some time until it is actually executed, especially if there are a lot
of tasks already waiting for execution. Messages that are not acknowledged are
held on to by the worker until it closes the connection to the broker (AMQP
server). When that connection is closed (e.g. because the worker was stopped)
the tasks will be re-sent by the broker to the next available worker (or the
same worker when it has been restarted), so to properly purge the queue of
waiting tasks you have to stop all the workers, and then purge the tasks
using :func:`celery.control.purge`.
.. _faq-results:
Results
=======
.. _faq-get-result-by-task-id:
How do I get the result of a task if I have the ID that points there?
----------------------------------------------------------------------
**Answer**: Use `Task.AsyncResult`::
>>> result = MyTask.AsyncResult(task_id)
>>> result.get()
This will give you a :class:`~celery.result.BaseAsyncResult` instance
using the tasks current result backend.
If you need to specify a custom result backend you should use
:class:`celery.result.BaseAsyncResult` directly::
>>> from celery.result import BaseAsyncResult
>>> result = BaseAsyncResult(task_id, backend=...)
>>> result.get()
.. _faq-security:
Security
========
Isn't using `pickle` a security concern?
----------------------------------------
**Answer**: Yes, indeed it is.
You are right to have a security concern, as this can indeed be a real issue.
It is essential that you protect against unauthorized
access to your broker, databases and other services transmitting pickled
data.
For the task messages you can set the :setting:`CELERY_TASK_SERIALIZER`
setting to "json" or "yaml" instead of pickle. There is
currently no alternative solution for task results (but writing a
custom result backend using JSON is a simple task)
Note that this is not just something you should be aware of with Celery, for
example also Django uses pickle for its cache client.
Can messages be encrypted?
--------------------------
**Answer**: Some AMQP brokers supports using SSL (including RabbitMQ).
You can enable this using the :setting:`BROKER_USE_SSL` setting.
It is also possible to add additional encryption and security to messages,
if you have a need for this then you should contact the :ref:`mailing-list`.
Is it safe to run :program:`celeryd` as root?
---------------------------------------------
**Answer**: No!
We're not currently aware of any security issues, but it would
be incredibly naive to assume that they don't exist, so running
the Celery services (:program:`celeryd`, :program:`celerybeat`,
:program:`celeryev`, etc) as an unprivileged user is recommended.
.. _faq-brokers:
Brokers
=======
Why is RabbitMQ crashing?
-------------------------
**Answer:** RabbitMQ will crash if it runs out of memory. This will be fixed in a
future release of RabbitMQ. please refer to the RabbitMQ FAQ:
http://www.rabbitmq.com/faq.html#node-runs-out-of-memory
.. note::
This is no longer the case, RabbitMQ versions 2.0 and above
includes a new persister, that is tolerant to out of memory
errors. RabbitMQ 2.1 or higher is recommended for Celery.
If you're still running an older version of RabbitMQ and experience
crashes, then please upgrade!
Misconfiguration of Celery can eventually lead to a crash
on older version of RabbitMQ. Even if it doesn't crash, this
can still consume a lot of resources, so it is very
important that you are aware of the common pitfalls.
* Events.
Running :mod:`~celery.bin.celeryd` with the :option:`-E`/:option:`--events`
option will send messages for events happening inside of the worker.
Events should only be enabled if you have an active monitor consuming them,
or if you purge the event queue periodically.
* AMQP backend results.
When running with the AMQP result backend, every task result will be sent
as a message. If you don't collect these results, they will build up and
RabbitMQ will eventually run out of memory.
Results expire after 1 day by default. It may be a good idea
to lower this value by configuring the :setting:`CELERY_TASK_RESULT_EXPIRES`
setting.
If you don't use the results for a task, make sure you set the
`ignore_result` option:
.. code-block python
@celery.task(ignore_result=True)
def mytask():
...
class MyTask(Task):
ignore_result = True
.. _faq-use-celery-with-stomp:
Can I use Celery with ActiveMQ/STOMP?
-------------------------------------
**Answer**: No. It used to be supported by Carrot,
but is not currently supported in Kombu.
.. _faq-non-amqp-missing-features:
What features are not supported when not using an AMQP broker?
--------------------------------------------------------------
This is an incomplete list of features not available when
using the virtual transports:
* Remote control commands (supported only by Redis).
* Monitoring with events may not work in all virtual transports.
* The `header` and `fanout` exchange types
(`fanout` is supported by Redis).
.. _faq-tasks:
Tasks
=====
.. _faq-tasks-connection-reuse:
How can I reuse the same connection when applying tasks?
--------------------------------------------------------
**Answer**: See the :setting:`BROKER_POOL_LIMIT` setting.
The connection pool is enabled by default since version 2.5.
.. _faq-sudo-subprocess:
Sudo in a :mod:`subprocess` returns :const:`None`
-------------------------------------------------
There is a sudo configuration option that makes it illegal for process
without a tty to run sudo::
Defaults requiretty
If you have this configuration in your :file:`/etc/sudoers` file then
tasks will not be able to call sudo when celeryd is running as a daemon.
If you want to enable that, then you need to remove the line from sudoers.
See: http://timelordz.com/wiki/Apache_Sudo_Commands
.. _faq-deletes-unknown-tasks:
Why do workers delete tasks from the queue if they are unable to process them?
------------------------------------------------------------------------------
**Answer**:
The worker rejects unknown tasks, messages with encoding errors and messages
that doesn't contain the proper fields (as per the task message protocol).
If it did not reject them they could be redelivered again and again,
causing a loop.
Recent versions of RabbitMQ has the ability to configure a dead-letter
queue for exchange, so that rejected messages is moved there.
.. _faq-execute-task-by-name:
Can I execute a task by name?
-----------------------------
**Answer**: Yes. Use :func:`celery.execute.send_task`.
You can also execute a task by name from any language
that has an AMQP client.
>>> from celery.execute import send_task
>>> send_task("tasks.add", args=[2, 2], kwargs={})
<AsyncResult: 373550e8-b9a0-4666-bc61-ace01fa4f91d>
.. _faq-get-current-task-id:
How can I get the task id of the current task?
----------------------------------------------
**Answer**: The current id and more is available in the task request::
@celery.task()
def mytask():
cache.set(mytask.request.id, "Running")
For more information see :ref:`task-request-info`.
.. _faq-custom-task-ids:
Can I specify a custom task_id?
-------------------------------
**Answer**: Yes. Use the `task_id` argument to :meth:`Task.apply_async`::
>>> task.apply_async(args, kwargs, task_id="...")
Can I use decorators with tasks?
--------------------------------
**Answer**: Yes. But please see note at :ref:`tasks-decorating`.
.. _faq-natural-task-ids:
Can I use natural task ids?
---------------------------
**Answer**: Yes, but make sure it is unique, as the behavior
for two tasks existing with the same id is undefined.
The world will probably not explode, but at the worst
they can overwrite each others results.
.. _faq-task-callbacks:
How can I run a task once another task has finished?
----------------------------------------------------
**Answer**: You can safely launch a task inside a task.
Also, a common pattern is to use callback tasks:
.. code-block:: python
@celery.task()
def add(x, y, callback=None):
result = x + y
if callback:
subtask(callback).delay(result)
return result
@celery.task(ignore_result=True)
def log_result(result, **kwargs):
logger = log_result.get_logger(**kwargs)
logger.info("log_result got: %s" % (result, ))
Invocation::
>>> add.delay(2, 2, callback=log_result.subtask())
See :doc:`userguide/tasksets` for more information.
.. _faq-cancel-task:
Can I cancel the execution of a task?
-------------------------------------
**Answer**: Yes. Use `result.revoke`::
>>> result = add.apply_async(args=[2, 2], countdown=120)
>>> result.revoke()
or if you only have the task id::
>>> from celery import current_app as celery
>>> celery.control.revoke(task_id)
.. _faq-node-not-receiving-broadcast-commands:
Why aren't my remote control commands received by all workers?
--------------------------------------------------------------
**Answer**: To receive broadcast remote control commands, every worker node
uses its host name to create a unique queue name to listen to,
so if you have more than one worker with the same host name, the
control commands will be received in round-robin between them.
To work around this you can explicitly set the host name for every worker
using the :option:`--hostname` argument to :mod:`~celery.bin.celeryd`::
$ celeryd --hostname=$(hostname).1
$ celeryd --hostname=$(hostname).2
etc., etc...
.. _faq-task-routing:
Can I send some tasks to only some servers?
--------------------------------------------
**Answer:** Yes. You can route tasks to an arbitrary server using AMQP,
and a worker can bind to as many queues as it wants.
See :doc:`userguide/routing` for more information.
.. _faq-change-periodic-task-interval-at-runtime:
Can I change the interval of a periodic task at runtime?
--------------------------------------------------------
**Answer**: Yes. You can use the Django database scheduler, or you can
override `PeriodicTask.is_due` or turn `PeriodicTask.run_every` into a
property:
.. code-block:: python
class MyPeriodic(PeriodicTask):
def run(self):
# ...
@property
def run_every(self):
return get_interval_from_database(...)
.. _faq-task-priorities:
Does celery support task priorities?
------------------------------------
**Answer**: No. In theory, yes, as AMQP supports priorities. However
RabbitMQ doesn't implement them yet.
The usual way to prioritize work in Celery, is to route high priority tasks
to different servers. In the real world this may actually work better than per message
priorities. You can use this in combination with rate limiting to achieve a
highly responsive system.
.. _faq-acks_late-vs-retry:
Should I use retry or acks_late?
--------------------------------
**Answer**: Depends. It's not necessarily one or the other, you may want
to use both.
`Task.retry` is used to retry tasks, notably for expected errors that
is catchable with the `try:` block. The AMQP transaction is not used
for these errors: **if the task raises an exception it is still acknowledged!**.
The `acks_late` setting would be used when you need the task to be
executed again if the worker (for some reason) crashes mid-execution.
It's important to note that the worker is not known to crash, and if
it does it is usually an unrecoverable error that requires human
intervention (bug in the worker, or task code).
In an ideal world you could safely retry any task that has failed, but
this is rarely the case. Imagine the following task:
.. code-block:: python
@celery.task()
def process_upload(filename, tmpfile):
# Increment a file count stored in a database
increment_file_counter()
add_file_metadata_to_db(filename, tmpfile)
copy_file_to_destination(filename, tmpfile)
If this crashed in the middle of copying the file to its destination
the world would contain incomplete state. This is not a critical
scenario of course, but you can probably imagine something far more
sinister. So for ease of programming we have less reliability;
It's a good default, users who require it and know what they
are doing can still enable acks_late (and in the future hopefully
use manual acknowledgement)
In addition `Task.retry` has features not available in AMQP
transactions: delay between retries, max retries, etc.
So use retry for Python errors, and if your task is idempotent
combine that with `acks_late` if that level of reliability
is required.
.. _faq-schedule-at-specific-time:
Can I schedule tasks to execute at a specific time?
---------------------------------------------------
.. module:: celery.task.base
**Answer**: Yes. You can use the `eta` argument of :meth:`Task.apply_async`.
Or to schedule a periodic task at a specific time, use the
:class:`celery.schedules.crontab` schedule behavior:
.. code-block:: python
from celery.schedules import crontab
from celery.task import periodic_task
@periodic_task(run_every=crontab(hour=7, minute=30, day_of_week="mon"))
def every_monday_morning():
print("This is run every Monday morning at 7:30")
.. _faq-safe-worker-shutdown:
How do I shut down `celeryd` safely?
--------------------------------------
**Answer**: Use the :sig:`TERM` signal, and the worker will finish all currently
executing jobs and shut down as soon as possible. No tasks should be lost.
You should never stop :mod:`~celery.bin.celeryd` with the :sig:`KILL` signal
(:option:`-9`), unless you've tried :sig:`TERM` a few times and waited a few
minutes to let it get a chance to shut down. As if you do tasks may be
terminated mid-execution, and they will not be re-run unless you have the
`acks_late` option set (`Task.acks_late` / :setting:`CELERY_ACKS_LATE`).
.. seealso::
:ref:`worker-stopping`
.. _faq-daemonizing:
How do I run celeryd in the background on [platform]?
-----------------------------------------------------
**Answer**: Please see :ref:`daemonizing`.
.. _faq-django:
Django
======
.. _faq-django-database-tables:
What purpose does the database tables created by django-celery have?
--------------------------------------------------------------------
Several database tables are created by default, these relate to
* Monitoring
When you use the django-admin monitor, the cluster state is written
to the ``TaskState`` and ``WorkerState`` models.
* Periodic tasks
When the database-backed schedule is used the periodic task
schedule is taken from the ``PeriodicTask`` model, there are
also several other helper tables (``IntervalSchedule``,
``CrontabSchedule``, ``PeriodicTasks``).
* Task results
The database result backend is enabled by default when using django-celery
(this is for historical reasons, and thus for backward compatibility).
The results are stored in the ``TaskMeta`` and ``TaskSetMeta`` models.
*these tables are not created if another result backend is configured*.
.. _faq-windows:
Windows
=======
.. _faq-windows-worker-spawn-loop:
celeryd keeps spawning processes at startup
-------------------------------------------
**Answer**: This is a known issue on Windows.
You have to start celeryd with the command::
$ python -m celery.bin.celeryd
Any additional arguments can be appended to this command.
See http://bit.ly/bo9RSw
.. _faq-windows-worker-embedded-beat:
The `-B` / `--beat` option to celeryd doesn't work?
----------------------------------------------------------------
**Answer**: That's right. Run `celerybeat` and `celeryd` as separate
services instead.
.. _faq-windows-django-settings:
`django-celery` can't find settings?
--------------------------------------
**Answer**: You need to specify the :option:`--settings` argument to
:program:`manage.py`::
$ python manage.py celeryd start --settings=settings
See http://bit.ly/bo9RSw

19
INSTALL
Просмотреть файл

@ -1,19 +0,0 @@
Installing Celery
=================
You can install Celery either via the Python Package Index (PyPI)
or from source.
To install using `pip`::
$ pip install Celery
To install using `easy_install`::
$ easy_install Celery
If you have downloaded a source tarball you can install it
by doing the following::
$ python setup.py build
# python setup.py install # as root

Просмотреть файл

@ -1,8 +1,5 @@
include CONTRIBUTORS.txt
include Changelog
include FAQ
include INSTALL
include README
include README.rst
include MANIFEST.in
include LICENSE

1
README
Просмотреть файл

@ -1 +0,0 @@
README.rst

Просмотреть файл

@ -1,40 +1,58 @@
celery/__init__.py
celery/app/__init__.py
celery/app/base.py
celery/app/defaults.py
celery/app/amqp.py
celery/apps/__init__.py
celery/apps/beat.py
celery/apps/worker.py
celery/backends/__init__.py
celery/backends/base.py
celery/backends/amqp.py
celery/beat.py
celery/concurrency/__init__.py
celery/concurrency/base.py
celery/datastructures.py
celery/decorators.py
celery/events/__init__.py
celery/events/state.py
celery/exceptions.py
celery/execute/trace.py
celery/loaders/app.py
celery/loaders/base.py
celery/registry.py
celery/result.py
celery/app.routes.py
celery/schedules.py
celery/signals.py
celery/states.py
celery/task/__init__.py
celery/task/base.py
celery/task/control.py
celery/task/sets.py
celery/worker/__init__.py
celery/worker/buckets.py
celery/worker/consumer.py
celery/worker/controllers.py
celery/worker/heartbeat.py
celery/worker/job.py
celery/worker/state.py
celery/worker/control/builtins.py
celery//__init__.py
celery//app/__init__.py
celery//app/abstract.py
celery//app/amqp.py
celery//app/annotations.py
celery//app/base.py
celery//app/builtins.py
celery//app/control.py
celery//app/defaults.py
celery//app/log.py
celery//app/registry.py
celery//app/routes.py
celery//app/task.py
celery//app/utils.py
celery//apps/__init__.py
celery//apps/worker.py
celery//backends/__init__.py
celery//backends/base.py
celery//bin/__init__.py
celery//bin/base.py
celery//bin/celeryd.py
celery//canvas.py
celery//concurrency/__init__.py
celery//concurrency/base.py
celery//concurrency/processes/__init__.py
celery//concurrency/processes/_win.py
celery//concurrency/solo.py
celery//datastructures.py
celery//events/__init__.py
celery//events/state.py
celery//exceptions.py
celery//loaders/__init__.py
celery//loaders/app.py
celery//loaders/base.py
celery//loaders/default.py
celery//result.py
celery//security/__init__.py
celery//security/certificate.py
celery//security/key.py
celery//security/serialization.py
celery//security/utils.py
celery//signals.py
celery//state.py
celery//states.py
celery//task/trace.py
celery//worker/__init__.py
celery//worker/abstract.py
celery//worker/autoreload.py
celery//worker/autoscale.py
celery//worker/buckets.py
celery//worker/consumer.py
celery//worker/control.py
celery//worker/heartbeat.py
celery//worker/hub.py
celery//worker/job.py
celery//worker/mediator.py
celery//worker/state.py
celery//worker/strategy.py

Просмотреть файл

Просмотреть файл

@ -103,7 +103,44 @@ html_sidebars = {
### Issuetracker
if False:
if True:
issuetracker = "github"
issuetracker_project = "celery/celery"
issuetracker_issue_pattern = r'[Ii]ssue #(\d+)'
# -- Options for Epub output ---------------------------------------------------
# Bibliographic Dublin Core info.
epub_title = 'Celery Manual, Version 2.6'
epub_author = 'Ask Solem'
epub_publisher = 'Celery Project'
epub_copyright = '2009-2012'
# The language of the text. It defaults to the language option
# or en if the language is not set.
epub_language = 'en'
# The scheme of the identifier. Typical schemes are ISBN or URL.
epub_scheme = 'ISBN'
# The unique identifier of the text. This can be a ISBN number
# or the project homepage.
epub_identifier = 'celeryproject.org'
# A unique identification for the text.
epub_uid = 'Celery Manual, Version 2.6'
# HTML files that should be inserted before the pages created by sphinx.
# The format is a list of tuples containing the path and title.
#epub_pre_files = []
# HTML files shat should be inserted after the pages created by sphinx.
# The format is a list of tuples containing the path and title.
#epub_post_files = []
# A list of files that should not be packed into the epub file.
epub_exclude_files = ['search.html']
# The depth of the table of contents in toc.ncx.
epub_tocdepth = 3

Просмотреть файл

@ -1 +0,0 @@
../FAQ

916
docs/faq.rst Normal file
Просмотреть файл

@ -0,0 +1,916 @@
.. _faq:
============================
Frequently Asked Questions
============================
.. contents::
:local:
.. _faq-general:
General
=======
.. _faq-when-to-use:
What kinds of things should I use Celery for?
---------------------------------------------
**Answer:** `Queue everything and delight everyone`_ is a good article
describing why you would use a queue in a web context.
.. _`Queue everything and delight everyone`:
http://decafbad.com/blog/2008/07/04/queue-everything-and-delight-everyone
These are some common use cases:
* Running something in the background. For example, to finish the web request
as soon as possible, then update the users page incrementally.
This gives the user the impression of good performance and "snappiness", even
though the real work might actually take some time.
* Running something after the web request has finished.
* Making sure something is done, by executing it asynchronously and using
retries.
* Scheduling periodic work.
And to some degree:
* Distributed computing.
* Parallel execution.
.. _faq-misconceptions:
Misconceptions
==============
.. _faq-loc:
Does Celery really consist of 50.000 lines of code?
---------------------------------------------------
**Answer:**: No, this and similarly large number have
been reported in various locations.
This a list of the actual numbers:
- core: 7,141 lines of code.
- tests: 14,209 lines.
- backends, contrib, compat utilities: 9,032 lines.
Lines of code is not a useful metric to prove anything, so
even if Celery did consist of 50k lines of code you would not
be able to draw any conclusions from such a number.
Does Celery have many dependencies?
-----------------------------------
A common criticism is that Celery uses too many dependencies.
The rationale behind such a fear is hard to imagine, especially considering
code reuse as the established way to combat complexity in modern software
development, and that the cost of adding dependencies is very low now
that package managers like pip and PyPI makes the hassle of installing
and maintaining dependencies a thing of the past.
Celery has replaced several dependencies along the way, and
the current list of dependencies are:
celery
~~~~~~
- `kombu`_
Kombu is part of the Celery ecosystem and is the library used
to send and receive messages. It is also the library that enables
us to support many different message brokers. It is also used by the
OpenStack project, and many others, validating the choice to separate
it from the Celery codebase.
.. _`kombu`: http://pypi.python.org/pypi/kombu
- `billiard`_
Billiard is a fork of the Python multiprocessing module containing
many performance and stability improvements. It is an eventual goal
that these improvements will be merged back into Python one day.
It is also used for compatibility with older Python versions.
.. _`billiard`: http://pypi.python.org/pypi/billiard
- `python-dateutil`_
The dateutil module is used by Celery to parse ISO-8601 formatted time strings,
as well as its ``relativedelta`` class which is used in the implementation
of crontab style periodic tasks.
.. _`python-dateutil`: http://pypi.python.org/pypi/python-dateutil
django-celery
~~~~~~~~~~~~~
If you use django-celery then you don't have to install celery separately,
as it will make sure that the required version is installed.
django-celery does not have any other dependencies.
kombu
~~~~~
Kombu depends on the following packages:
- `amqplib`_
The underlying pure-Python amqp client implementation. AMQP being the default
broker it is a natural dependency.
.. _`amqplib`: http://pypi.python.org/pypi/amqplib
- `anyjson`_
anyjson is an utility library to select the best possible
JSON implementation.
.. _`anyjson`: http://pypi.python.org/pypi/anyjson
.. note::
For compatibility reasons additional packages may be installed
if you are running on older Python versions,
for example Python 2.6 depends on the ``importlib``,
and ``ordereddict`` libraries.
Also, to handle the dependencies for popular configuration
choices Celery defines a number of "bundle" packages,
see :ref:`bundles`.
.. _faq-heavyweight
Is Celery heavy-weight?
-----------------------
Celery poses very little overhead both in memory footprint and
performance.
But not that the default configuration is not optimized for either,
see the :ref:`guide-optimizing` guide for more information.
.. _faq-serializion-is-a-choice:
Is Celery dependent on pickle?
------------------------------
**Answer:** No.
Celery can support any serialization scheme and has built-in support for
JSON, YAML, Pickle and msgpack. Also, as every task is associated with a
content type, you can even send one task using pickle, and another using JSON.
The default serialization format is pickle simply because it is
convenient (it supports sending complex Python objects as task arguments).
If you need to communicate with other languages you should change
to a serialization format that is suitable for that.
You can set a global default serializer, the default serializer for a
particular Task, or even what serializer to use when sending a single task
instance.
.. _faq-is-celery-for-django-only:
Is Celery for Django only?
--------------------------
**Answer:** No.
Celery does not depend on Django anymore. To use Celery with Django you have
to use the `django-celery`_ package.
.. _`django-celery`: http://pypi.python.org/pypi/django-celery
.. _faq-is-celery-for-rabbitmq-only:
Do I have to use AMQP/RabbitMQ?
-------------------------------
**Answer**: No.
You can also use Redis, Beanstalk, CouchDB, MongoDB or an SQL database,
see :ref:`brokers`.
These "virtual transports" may have limited broadcast and event functionality.
For example remote control commands only works with AMQP and Redis.
Redis or a database won't perform as well as
an AMQP broker. If you have strict reliability requirements you are
encouraged to use RabbitMQ or another AMQP broker. Redis/database also use
polling, so they are likely to consume more resources. However, if you for
some reason are not able to use AMQP, feel free to use these alternatives.
They will probably work fine for most use cases, and note that the above
points are not specific to Celery; If using Redis/database as a queue worked
fine for you before, it probably will now. You can always upgrade later
if you need to.
.. _faq-is-celery-multilingual:
Is Celery multilingual?
------------------------
**Answer:** Yes.
:mod:`~celery.bin.celeryd` is an implementation of Celery in Python. If the
language has an AMQP client, there shouldn't be much work to create a worker
in your language. A Celery worker is just a program connecting to the broker
to process messages.
Also, there's another way to be language independent, and that is to use REST
tasks, instead of your tasks being functions, they're URLs. With this
information you can even create simple web servers that enable preloading of
code. See: `User Guide: Remote Tasks`_.
.. _`User Guide: Remote Tasks`:
http://celery.github.com/celery/userguide/remote-tasks.html
.. _faq-troubleshooting:
Troubleshooting
===============
.. _faq-mysql-deadlocks:
MySQL is throwing deadlock errors, what can I do?
-------------------------------------------------
**Answer:** MySQL has default isolation level set to `REPEATABLE-READ`,
if you don't really need that, set it to `READ-COMMITTED`.
You can do that by adding the following to your :file:`my.cnf`::
[mysqld]
transaction-isolation = READ-COMMITTED
For more information about InnoDB`s transaction model see `MySQL - The InnoDB
Transaction Model and Locking`_ in the MySQL user manual.
(Thanks to Honza Kral and Anton Tsigularov for this solution)
.. _`MySQL - The InnoDB Transaction Model and Locking`: http://dev.mysql.com/doc/refman/5.1/en/innodb-transaction-model.html
.. _faq-worker-hanging:
celeryd is not doing anything, just hanging
--------------------------------------------
**Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
or `Why is Task.delay/apply\* just hanging?`.
.. _faq-results-unreliable:
Task results aren't reliably returning
--------------------------------------
**Answer:** If you're using the database backend for results, and in particular
using MySQL, see `MySQL is throwing deadlock errors, what can I do?`_.
.. _faq-publish-hanging:
Why is Task.delay/apply\*/celeryd just hanging?
-----------------------------------------------
**Answer:** There is a bug in some AMQP clients that will make it hang if
it's not able to authenticate the current user, the password doesn't match or
the user does not have access to the virtual host specified. Be sure to check
your broker logs (for RabbitMQ that is :file:`/var/log/rabbitmq/rabbit.log` on
most systems), it usually contains a message describing the reason.
.. _faq-celeryd-on-freebsd:
Does it work on FreeBSD?
------------------------
**Answer:** The multiprocessing pool requires a working POSIX semaphore
implementation which isn't enabled in FreeBSD by default. You have to enable
POSIX semaphores in the kernel and manually recompile multiprocessing.
Luckily, Viktor Petersson has written a tutorial to get you started with
Celery on FreeBSD here:
http://www.playingwithwire.com/2009/10/how-to-get-celeryd-to-work-on-freebsd/
.. _faq-duplicate-key-errors:
I'm having `IntegrityError: Duplicate Key` errors. Why?
---------------------------------------------------------
**Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
Thanks to howsthedotcom.
.. _faq-worker-stops-processing:
Why aren't my tasks processed?
------------------------------
**Answer:** With RabbitMQ you can see how many consumers are currently
receiving tasks by running the following command::
$ rabbitmqctl list_queues -p <myvhost> name messages consumers
Listing queues ...
celery 2891 2
This shows that there's 2891 messages waiting to be processed in the task
queue, and there are two consumers processing them.
One reason that the queue is never emptied could be that you have a stale
worker process taking the messages hostage. This could happen if celeryd
wasn't properly shut down.
When a message is received by a worker the broker waits for it to be
acknowledged before marking the message as processed. The broker will not
re-send that message to another consumer until the consumer is shut down
properly.
If you hit this problem you have to kill all workers manually and restart
them::
ps auxww | grep celeryd | awk '{print $2}' | xargs kill
You might have to wait a while until all workers have finished the work they're
doing. If it's still hanging after a long time you can kill them by force
with::
ps auxww | grep celeryd | awk '{print $2}' | xargs kill -9
.. _faq-task-does-not-run:
Why won't my Task run?
----------------------
**Answer:** There might be syntax errors preventing the tasks module being imported.
You can find out if Celery is able to run the task by executing the
task manually:
>>> from myapp.tasks import MyPeriodicTask
>>> MyPeriodicTask.delay()
Watch celeryd`s log file to see if it's able to find the task, or if some
other error is happening.
.. _faq-periodic-task-does-not-run:
Why won't my periodic task run?
-------------------------------
**Answer:** See `Why won't my Task run?`_.
.. _faq-purge-the-queue:
How do I purge all waiting tasks?
---------------------------------
**Answer:** You can use celeryctl to purge all configured task queues::
$ celeryctl purge
or programatically::
>>> from celery import current_app as celery
>>> celery.control.purge()
1753
If you only want to purge messages from a specific queue
you have to use the AMQP API or the :program:`camqadm` utility::
$ camqadm queue.purge <queue name>
The number 1753 is the number of messages deleted.
You can also start :mod:`~celery.bin.celeryd` with the
:option:`--purge` argument, to purge messages when the worker starts.
.. _faq-messages-left-after-purge:
I've purged messages, but there are still messages left in the queue?
---------------------------------------------------------------------
**Answer:** Tasks are acknowledged (removed from the queue) as soon
as they are actually executed. After the worker has received a task, it will
take some time until it is actually executed, especially if there are a lot
of tasks already waiting for execution. Messages that are not acknowledged are
held on to by the worker until it closes the connection to the broker (AMQP
server). When that connection is closed (e.g. because the worker was stopped)
the tasks will be re-sent by the broker to the next available worker (or the
same worker when it has been restarted), so to properly purge the queue of
waiting tasks you have to stop all the workers, and then purge the tasks
using :func:`celery.control.purge`.
.. _faq-results:
Results
=======
.. _faq-get-result-by-task-id:
How do I get the result of a task if I have the ID that points there?
----------------------------------------------------------------------
**Answer**: Use `Task.AsyncResult`::
>>> result = MyTask.AsyncResult(task_id)
>>> result.get()
This will give you a :class:`~celery.result.BaseAsyncResult` instance
using the tasks current result backend.
If you need to specify a custom result backend you should use
:class:`celery.result.BaseAsyncResult` directly::
>>> from celery.result import BaseAsyncResult
>>> result = BaseAsyncResult(task_id, backend=...)
>>> result.get()
.. _faq-security:
Security
========
Isn't using `pickle` a security concern?
----------------------------------------
**Answer**: Yes, indeed it is.
You are right to have a security concern, as this can indeed be a real issue.
It is essential that you protect against unauthorized
access to your broker, databases and other services transmitting pickled
data.
For the task messages you can set the :setting:`CELERY_TASK_SERIALIZER`
setting to "json" or "yaml" instead of pickle. There is
currently no alternative solution for task results (but writing a
custom result backend using JSON is a simple task)
Note that this is not just something you should be aware of with Celery, for
example also Django uses pickle for its cache client.
Can messages be encrypted?
--------------------------
**Answer**: Some AMQP brokers supports using SSL (including RabbitMQ).
You can enable this using the :setting:`BROKER_USE_SSL` setting.
It is also possible to add additional encryption and security to messages,
if you have a need for this then you should contact the :ref:`mailing-list`.
Is it safe to run :program:`celeryd` as root?
---------------------------------------------
**Answer**: No!
We're not currently aware of any security issues, but it would
be incredibly naive to assume that they don't exist, so running
the Celery services (:program:`celeryd`, :program:`celerybeat`,
:program:`celeryev`, etc) as an unprivileged user is recommended.
.. _faq-brokers:
Brokers
=======
Why is RabbitMQ crashing?
-------------------------
**Answer:** RabbitMQ will crash if it runs out of memory. This will be fixed in a
future release of RabbitMQ. please refer to the RabbitMQ FAQ:
http://www.rabbitmq.com/faq.html#node-runs-out-of-memory
.. note::
This is no longer the case, RabbitMQ versions 2.0 and above
includes a new persister, that is tolerant to out of memory
errors. RabbitMQ 2.1 or higher is recommended for Celery.
If you're still running an older version of RabbitMQ and experience
crashes, then please upgrade!
Misconfiguration of Celery can eventually lead to a crash
on older version of RabbitMQ. Even if it doesn't crash, this
can still consume a lot of resources, so it is very
important that you are aware of the common pitfalls.
* Events.
Running :mod:`~celery.bin.celeryd` with the :option:`-E`/:option:`--events`
option will send messages for events happening inside of the worker.
Events should only be enabled if you have an active monitor consuming them,
or if you purge the event queue periodically.
* AMQP backend results.
When running with the AMQP result backend, every task result will be sent
as a message. If you don't collect these results, they will build up and
RabbitMQ will eventually run out of memory.
Results expire after 1 day by default. It may be a good idea
to lower this value by configuring the :setting:`CELERY_TASK_RESULT_EXPIRES`
setting.
If you don't use the results for a task, make sure you set the
`ignore_result` option:
.. code-block python
@celery.task(ignore_result=True)
def mytask():
...
class MyTask(Task):
ignore_result = True
.. _faq-use-celery-with-stomp:
Can I use Celery with ActiveMQ/STOMP?
-------------------------------------
**Answer**: No. It used to be supported by Carrot,
but is not currently supported in Kombu.
.. _faq-non-amqp-missing-features:
What features are not supported when not using an AMQP broker?
--------------------------------------------------------------
This is an incomplete list of features not available when
using the virtual transports:
* Remote control commands (supported only by Redis).
* Monitoring with events may not work in all virtual transports.
* The `header` and `fanout` exchange types
(`fanout` is supported by Redis).
.. _faq-tasks:
Tasks
=====
.. _faq-tasks-connection-reuse:
How can I reuse the same connection when applying tasks?
--------------------------------------------------------
**Answer**: See the :setting:`BROKER_POOL_LIMIT` setting.
The connection pool is enabled by default since version 2.5.
.. _faq-sudo-subprocess:
Sudo in a :mod:`subprocess` returns :const:`None`
-------------------------------------------------
There is a sudo configuration option that makes it illegal for process
without a tty to run sudo::
Defaults requiretty
If you have this configuration in your :file:`/etc/sudoers` file then
tasks will not be able to call sudo when celeryd is running as a daemon.
If you want to enable that, then you need to remove the line from sudoers.
See: http://timelordz.com/wiki/Apache_Sudo_Commands
.. _faq-deletes-unknown-tasks:
Why do workers delete tasks from the queue if they are unable to process them?
------------------------------------------------------------------------------
**Answer**:
The worker rejects unknown tasks, messages with encoding errors and messages
that doesn't contain the proper fields (as per the task message protocol).
If it did not reject them they could be redelivered again and again,
causing a loop.
Recent versions of RabbitMQ has the ability to configure a dead-letter
queue for exchange, so that rejected messages is moved there.
.. _faq-execute-task-by-name:
Can I execute a task by name?
-----------------------------
**Answer**: Yes. Use :func:`celery.execute.send_task`.
You can also execute a task by name from any language
that has an AMQP client.
>>> from celery.execute import send_task
>>> send_task("tasks.add", args=[2, 2], kwargs={})
<AsyncResult: 373550e8-b9a0-4666-bc61-ace01fa4f91d>
.. _faq-get-current-task-id:
How can I get the task id of the current task?
----------------------------------------------
**Answer**: The current id and more is available in the task request::
@celery.task()
def mytask():
cache.set(mytask.request.id, "Running")
For more information see :ref:`task-request-info`.
.. _faq-custom-task-ids:
Can I specify a custom task_id?
-------------------------------
**Answer**: Yes. Use the `task_id` argument to :meth:`Task.apply_async`::
>>> task.apply_async(args, kwargs, task_id="...")
Can I use decorators with tasks?
--------------------------------
**Answer**: Yes. But please see note at :ref:`tasks-decorating`.
.. _faq-natural-task-ids:
Can I use natural task ids?
---------------------------
**Answer**: Yes, but make sure it is unique, as the behavior
for two tasks existing with the same id is undefined.
The world will probably not explode, but at the worst
they can overwrite each others results.
.. _faq-task-callbacks:
How can I run a task once another task has finished?
----------------------------------------------------
**Answer**: You can safely launch a task inside a task.
Also, a common pattern is to use callback tasks:
.. code-block:: python
@celery.task()
def add(x, y, callback=None):
result = x + y
if callback:
subtask(callback).delay(result)
return result
@celery.task(ignore_result=True)
def log_result(result, **kwargs):
logger = log_result.get_logger(**kwargs)
logger.info("log_result got: %s" % (result, ))
Invocation::
>>> add.delay(2, 2, callback=log_result.subtask())
See :doc:`userguide/tasksets` for more information.
.. _faq-cancel-task:
Can I cancel the execution of a task?
-------------------------------------
**Answer**: Yes. Use `result.revoke`::
>>> result = add.apply_async(args=[2, 2], countdown=120)
>>> result.revoke()
or if you only have the task id::
>>> from celery import current_app as celery
>>> celery.control.revoke(task_id)
.. _faq-node-not-receiving-broadcast-commands:
Why aren't my remote control commands received by all workers?
--------------------------------------------------------------
**Answer**: To receive broadcast remote control commands, every worker node
uses its host name to create a unique queue name to listen to,
so if you have more than one worker with the same host name, the
control commands will be received in round-robin between them.
To work around this you can explicitly set the host name for every worker
using the :option:`--hostname` argument to :mod:`~celery.bin.celeryd`::
$ celeryd --hostname=$(hostname).1
$ celeryd --hostname=$(hostname).2
etc., etc...
.. _faq-task-routing:
Can I send some tasks to only some servers?
--------------------------------------------
**Answer:** Yes. You can route tasks to an arbitrary server using AMQP,
and a worker can bind to as many queues as it wants.
See :doc:`userguide/routing` for more information.
.. _faq-change-periodic-task-interval-at-runtime:
Can I change the interval of a periodic task at runtime?
--------------------------------------------------------
**Answer**: Yes. You can use the Django database scheduler, or you can
override `PeriodicTask.is_due` or turn `PeriodicTask.run_every` into a
property:
.. code-block:: python
class MyPeriodic(PeriodicTask):
def run(self):
# ...
@property
def run_every(self):
return get_interval_from_database(...)
.. _faq-task-priorities:
Does celery support task priorities?
------------------------------------
**Answer**: No. In theory, yes, as AMQP supports priorities. However
RabbitMQ doesn't implement them yet.
The usual way to prioritize work in Celery, is to route high priority tasks
to different servers. In the real world this may actually work better than per message
priorities. You can use this in combination with rate limiting to achieve a
highly responsive system.
.. _faq-acks_late-vs-retry:
Should I use retry or acks_late?
--------------------------------
**Answer**: Depends. It's not necessarily one or the other, you may want
to use both.
`Task.retry` is used to retry tasks, notably for expected errors that
is catchable with the `try:` block. The AMQP transaction is not used
for these errors: **if the task raises an exception it is still acknowledged!**.
The `acks_late` setting would be used when you need the task to be
executed again if the worker (for some reason) crashes mid-execution.
It's important to note that the worker is not known to crash, and if
it does it is usually an unrecoverable error that requires human
intervention (bug in the worker, or task code).
In an ideal world you could safely retry any task that has failed, but
this is rarely the case. Imagine the following task:
.. code-block:: python
@celery.task()
def process_upload(filename, tmpfile):
# Increment a file count stored in a database
increment_file_counter()
add_file_metadata_to_db(filename, tmpfile)
copy_file_to_destination(filename, tmpfile)
If this crashed in the middle of copying the file to its destination
the world would contain incomplete state. This is not a critical
scenario of course, but you can probably imagine something far more
sinister. So for ease of programming we have less reliability;
It's a good default, users who require it and know what they
are doing can still enable acks_late (and in the future hopefully
use manual acknowledgement)
In addition `Task.retry` has features not available in AMQP
transactions: delay between retries, max retries, etc.
So use retry for Python errors, and if your task is idempotent
combine that with `acks_late` if that level of reliability
is required.
.. _faq-schedule-at-specific-time:
Can I schedule tasks to execute at a specific time?
---------------------------------------------------
.. module:: celery.task.base
**Answer**: Yes. You can use the `eta` argument of :meth:`Task.apply_async`.
Or to schedule a periodic task at a specific time, use the
:class:`celery.schedules.crontab` schedule behavior:
.. code-block:: python
from celery.schedules import crontab
from celery.task import periodic_task
@periodic_task(run_every=crontab(hour=7, minute=30, day_of_week="mon"))
def every_monday_morning():
print("This is run every Monday morning at 7:30")
.. _faq-safe-worker-shutdown:
How do I shut down `celeryd` safely?
--------------------------------------
**Answer**: Use the :sig:`TERM` signal, and the worker will finish all currently
executing jobs and shut down as soon as possible. No tasks should be lost.
You should never stop :mod:`~celery.bin.celeryd` with the :sig:`KILL` signal
(:option:`-9`), unless you've tried :sig:`TERM` a few times and waited a few
minutes to let it get a chance to shut down. As if you do tasks may be
terminated mid-execution, and they will not be re-run unless you have the
`acks_late` option set (`Task.acks_late` / :setting:`CELERY_ACKS_LATE`).
.. seealso::
:ref:`worker-stopping`
.. _faq-daemonizing:
How do I run celeryd in the background on [platform]?
-----------------------------------------------------
**Answer**: Please see :ref:`daemonizing`.
.. _faq-django:
Django
======
.. _faq-django-database-tables:
What purpose does the database tables created by django-celery have?
--------------------------------------------------------------------
Several database tables are created by default, these relate to
* Monitoring
When you use the django-admin monitor, the cluster state is written
to the ``TaskState`` and ``WorkerState`` models.
* Periodic tasks
When the database-backed schedule is used the periodic task
schedule is taken from the ``PeriodicTask`` model, there are
also several other helper tables (``IntervalSchedule``,
``CrontabSchedule``, ``PeriodicTasks``).
* Task results
The database result backend is enabled by default when using django-celery
(this is for historical reasons, and thus for backward compatibility).
The results are stored in the ``TaskMeta`` and ``TaskSetMeta`` models.
*these tables are not created if another result backend is configured*.
.. _faq-windows:
Windows
=======
.. _faq-windows-worker-spawn-loop:
celeryd keeps spawning processes at startup
-------------------------------------------
**Answer**: This is a known issue on Windows.
You have to start celeryd with the command::
$ python -m celery.bin.celeryd
Any additional arguments can be appended to this command.
See http://bit.ly/bo9RSw
.. _faq-windows-worker-embedded-beat:
The `-B` / `--beat` option to celeryd doesn't work?
----------------------------------------------------------------
**Answer**: That's right. Run `celerybeat` and `celeryd` as separate
services instead.
.. _faq-windows-django-settings:
`django-celery` can't find settings?
--------------------------------------
**Answer**: You need to specify the :option:`--settings` argument to
:program:`manage.py`::
$ python manage.py celeryd start --settings=settings
See http://bit.ly/bo9RSw

Просмотреть файл

@ -246,6 +246,8 @@ To install using `easy_install`,::
$ easy_install -U Celery
.. _bundles:
Bundles
-------

Просмотреть файл

@ -2,6 +2,18 @@
Celery - Distributed Task Queue
=================================
Celery aims to be a fast, reliable and flexible, best-of-breed solution
to process vast amounts of messages in a distributed fashion, while
providing operations with the tools to maintain such a system.
Celery is Open Source and licensed under the `BSD License`_.
If you're new to Celery you can get started by following
the :ref:`first-steps` tutorial.
.. _`BSD License`: http://www.opensource.org/licenses/BSD-3-Clause
Contents:
.. toctree::

Просмотреть файл

@ -120,7 +120,6 @@ def clean_readme(options):
def readme(options):
sh("%s contrib/release/sphinx-to-rst.py docs/templates/readme.txt \
> README.rst" % (sys.executable, ))
sh("ln -sf README.rst README")
@task