Граф коммитов

22 Коммитов

Автор SHA1 Сообщение Дата
Cameron Dawson 60fa75d4ee
Bug 1646753 - Increase size of job_type.name field (#6539) 2020-06-23 16:34:53 -07:00
Sarah Clements d598ad4544
Bug 1603249 - Remove job artifacts storage and TinderboxPrint parsing (#6349)
* remove artifact storage in the job loader
* remove BuildbotJobArtifactBuilder and TinderboxPrint parser
* update pulse-job schema and tests
2020-05-07 09:54:06 -07:00
Armen Zambrano G 7f04a91af7 Revert "Bug 1395254 - Consume Taskcluster jobs from standard queue exchanges (#5042)"
This reverts commit 75dc375a93.
2019-07-23 16:24:13 -04:00
Armen Zambrano 75dc375a93
Bug 1395254 - Consume Taskcluster jobs from standard queue exchanges (#5042)
* Bug 1395254 - Consume Taskcluster Pulse messages from standard queue exchanges

Currently, Treeherder consumes Pulse messages from an intermediary service called `taskcluster-treeherder`.
Such service needs to be shut down and its functionality imported into Treeherder.

In order to do this we need to switch to the standard Taskcluster exchanges as defined in here:
https://docs.taskcluster.net/docs/reference/platform/queue/exchanges

On a first pass we are only including the code from `taskcluster-treeherder` without changing
much of Treeherder's code. The code is translated from Javascript to Python and only some minor
code changes were done to reduce the difficulty on porting the code without introducing bugs.
Internally, on this first pass, we will still have an intermediary data structure representing
what `taskcluster-treeherder` is emitting, however, we will stop consuming the messages
from it and be able to shut it down.

Instead of consuming from one single exchange we will be consuming multiple ones. Each one representing
a different kind of task (e.g. pending vs running).

In order to test this change you need to open 4 terminal windows and follow these steps:

* On the first two windows `export PULSE_URL="amqp://foo:bar@pulse.mozilla.org:5671/?ssl=1"` and run the following commands:
  * `docker-compose run -e PULSE_URL backend ./manage.py pulse_listener_jobs`
  * `docker-compose run -e PULSE_URL backend ./manage.py pulse_listener_pushes`
* On the third window run `docker-compose run backend celery -A treeherder worker -B --concurrency 5`
* On the last window run `docker-compose up`
* Open on your browser `http://localhost:5000`

This is just a summary from [the docs](https://treeherder.readthedocs.io/pulseload.html).

= ETL management commands =
This change also introduces two ETL management command that can be executed like this:

== Ingest push and tasks ==

This script can ingest into Treeherder all tasks associated to a push.
It uses Python's asyncio to speed up the ingestion of tasks.

```bash
./manage.py ingest_push_and_tasks
```

== Update Pulse test fixtures ==

```bash
./manage.py update_pulse_test_fixtures
```

This command will read 100 Taskcluster Pulse messages, process them and store them as test fixtures
under these two files: `tests/sample_data/pulse_consumer/taskcluster_{jobs,metadata}.json`

Following this work would be to get rid of the intermediary job representation ([bug 1560596](https://bugzilla.mozilla.org/show_bug.cgi?id=1560596) which will
clean up some of the code and some of the old tests.

= Other changes =

Other changes included:

* Import `taskcluster-treeherder`'s validation to ensure we're not fed garbage.
* Change `yaml.load(f)` for `yaml.load(f, Loader=yaml.FullLoader)`. Read [this](https://github.com/yaml/pyyaml/wiki/PyYAML-yaml.load(input)-Deprecation) for details
* Introduce `taskcluster` and `taskcluster-urls` as dependencies
* The test `test_retry_missing_revision_never_succeeds` makes no sense because
we make Json validation on the Pulse message
2019-07-23 13:14:18 -04:00
Ed Morley 96fb922072 Bug 1522101 - Use Prettier to format YAML 2019-01-23 20:19:04 +00:00
Simon Sapin 27eb1af2a6 Fix schema descriptions for GitHub jobs (#4327) 2018-11-30 08:20:00 +00:00
Ed Morley 445766d958
Bug 1443251 - Remove support for buildbot job ingestion (#4087)
The buildapi celerybeat tasks were disabled previously in #4007, so
these tasks are unused.
2018-10-02 11:07:27 +01:00
Dustin J. Mitchell a6411c3538
Use $id instead of id (#3949)
This schema is a version-6 schema, which uses $id instead of id.
2018-08-27 13:06:07 -04:00
Pete Moore 3be9568cba Applied changes from taskcluster-treeherder fork of pulse-job.yml 2018-04-26 11:23:05 -07:00
Pete Moore 07d274dd49 Cleaned up titles in pulse-job.yml to aid downstream client generation (#3464) 2018-04-25 10:10:12 -07:00
Ed Morley f7f38ef4d0
Bug 1257602 - Remove support for revision_hash (#2983)
Now that no submissions are using revision_hash, it can be removed.
This removes everything but the model field, which will be handled
later.

I've removed revision_hash from the Pulse jobs schema without bumping
the version, which wouldn't normally be ok, but no one is still using
it, and I'd rather have explicit failures later than if we left the
schema unchanged.
2017-11-28 00:09:17 +00:00
Cameron Dawson 75e2766e7b Bug 1400069 - Ingest superseded result value from pulse 2017-09-29 17:18:42 -07:00
Dave Hunt ea90972662 Bug 1352403 - Replace invalid "url" format with "uri" (#2307) 2017-04-04 11:56:32 -07:00
camd 4cc00cf784 Bug 1278711 - Set pulse schema jobdetails field lengths higher (#1665)
This sets the field lengths to what they will be in a later PR for
the job_details model.  But these are still within the constraints
of the current field lengths for that table.
2016-07-08 14:33:57 -07:00
camd 8e3e5b7518 Bug 1283865 - remove pattern requirements for pulse job display fields (#1639)
Several existing jobs are already out of compliance with these patterns
and there is no existing way to tell task definition developers how to
comply with our required patterns.

created bug 1283866 in Taskcluster for that tool/workflow

This removes the pattern requirements.  If we ever decide that we DO
need these patterns, we can create that tool and then fix old task
definitions to comply.
2016-07-01 08:23:51 -07:00
camd 128ad99190 Bug 1280306 - Add buildSystem field to pulse-job.yml (#1600) 2016-06-15 14:19:06 -07:00
Cameron Dawson ac30956a56 Bug 1277955 - Add support for revision_hash to pulse jobs
Some repos are longer-lived and do not yet have the Task Cluster
code that allows them to submit tasks with a revision.  They only
have the older code to submit revision_hash.  This prevents the
jobs from being ingested via Pulse.  This commit adds support
for revision_hash until a time when it's no longer needed.
2016-06-07 11:31:38 -07:00
Cameron Dawson f98bf3f8b0 Bug 1266229 - Fix Pulse job ingestion prior to enabling it
This contains several tweaks and fixes that allow us to ingest data from
a real Task Cluster owned exchange.

One of the main fixes is the way I was binding to the exchange and
routing keys with the same Queue.  Before, it was re-creating the Queue,
so would miss some of the bindings.

This will also prune the durable queue if the config has removed some
exchanges and/or routing keys.
2016-05-25 12:33:33 -07:00
Cameron Dawson be37805e08 Bug 1266229 - Pulse ingestion fixes and updates
These changes were discovered to be needed after direct testing
against Task Cluster Pulse exchanges.

-Made some JSON/YML schema changes to be more precise for several fields
-Modified to job_loader to be more resilient to optional data being
  missing
2016-05-02 09:39:44 -07:00
Cameron Dawson 825913ece4 Bug 1244325 - adjust the pulse job YAML schema to accommodate task cluster needs 2016-02-22 13:39:53 -08:00
Cameron Dawson 53bc948f94 Bug 1079796 - store long and short revisions
We store both long and short, but only utilize the short (as before).  We
need to populate all the short and long revision records before we can
start using them.  So after this commit, we will begin backfilling the
old records that don't yet have those values populated.  Once they all
are, we can move to using the long_revision primarily in Bug 1199364.
2015-10-21 13:02:01 -07:00
Cameron Dawson d23435f8ed Bug 1169320, 1199506 - Adds requirements, settings and JSON schema for Pulse Ingestion 2015-10-01 15:50:12 -07:00