telemetry-batch-view/docs/CrashSummary.md

739 B

The Crash Summary dataset

The Crash Summary dataset is generated by src/main/scala/com/mozilla/telemetry/views/CrashSummaryView.scala.

Generating the dataset

For distributed execution, we can build a self-contained JAR file, then run it with Spark. For example, to generate the main_summary dataset for April 12, 2016 to April 28, 2016, and storing the resulting data in an s3 bucket called example_bucket:

sbt assembly
spark-submit \
    --master yarn \
    --deploy-mode client \
    --class com.mozilla.telemetry.views.CrashSummaryView \
    target/scala-2.11/telemetry-batch-view-1.1.jar \
    --outputBucket example_bucket \
    --from 20160412 \
    --to 20160428