Add introduction to the crash_summary dataset

This is part of the `choosing a dataset` section.
This commit is contained in:
Mauro Doglio 2017-06-08 13:44:46 +01:00
Родитель dad23bee06
Коммит 56d2abbc8d
2 изменённых файлов: 30 добавлений и 3 удалений

Просмотреть файл

@ -54,6 +54,10 @@ This section describes the derived datasets we provide to make analyzing this da
{% include "/datasets/batch_view/crash_aggregates/intro.md" %}
## crash_summary
{% include "/datasets/batch_view/crash_summary/intro.md" %}
# Appendix
## Mobile Metrics

Просмотреть файл

@ -1,3 +1,26 @@
Work in progress.
Work is being tracked
[here](https://bugzilla.mozilla.org/show_bug.cgi?id=1364174).
The `crash_summary` table is the most direct representation of a crash ping.
#### Contents
The `crash_summary` table contains one row for each crash ping.
Each column represents one field from the crash ping payload,
though only a subset of all crash ping fields are included.
#### Accessing the Data
The data is stored as a parquet table in S3 at the following address.
See [this cookbook](/cookbooks/parquet.md) to get started working with the data in Spark.
```
s3://telemetry-parquet/crash_summary/v1/
```
`crash_summary` is accessible through re:dash.
Here's an [example query](https://sql.telemetry.mozilla.org/queries/4793/source).
#### Further Reading
The technical documentation for `crash_summary` is located in the
[telemetry-batch-view documentation](https://github.com/mozilla/telemetry-batch-view/blob/master/docs/CrashSummary.md).
The code responsible for generating this dataset is
[here](https://github.com/mozilla/telemetry-batch-view/blob/master/src/main/scala/com/mozilla/telemetry/views/CrashSummaryView.scala)