doAzureParallel/docs
Brian Hoang 96bfc22662
Fix: Upgrading to R Batch SDK to 2018-12-01.8.0 (#354)
* Added resource files

* Added resource files

* Removed comments

* Fixed resource files documentation

* Added check on job state

* Fixed jobState
2019-06-18 21:04:30 -07:00
..
00-azure-introduction.md Clarifying comment on DSVM (#274) 2018-05-17 09:39:45 -07:00
01-getting-started.md Added documentation for containerImage (#339) 2019-02-01 11:23:48 -08:00
02-getting-started-script.md Fix: Documentation typos and fixes (#307) 2018-09-12 11:41:52 -07:00
03-national-clouds.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
04-azure-requirements.md Fix: Documentation typos and fixes (#307) 2018-09-12 11:41:52 -07:00
20-package-management.md Feature/custom package (#272) 2018-05-14 13:47:00 -07:00
22-parallelizing-cores.md README update 2017-03-13 21:54:41 +00:00
30-customize-cluster.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
31-vm-sizes.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
32-autoscale.md Feature: Added Queue/Running Autoscale Formula (#316) 2018-10-16 09:05:55 -07:00
33-building-containers.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
40-clusters.md Fix: Documentation typos and fixes (#307) 2018-09-12 11:41:52 -07:00
51-long-running-job.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
52-azure-foreach-options.md Fix Azure foreach options documentation (#288) 2018-07-13 16:24:32 -07:00
53-error-handling.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
71-distributing-data.md Fix: Upgrading to R Batch SDK to 2018-12-01.8.0 (#354) 2019-06-18 21:04:30 -07:00
72-persistent-storage.md Fix: Updating Persistent Storage Documentation (#283) 2018-06-27 11:10:53 -07:00
73-managing-storage.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
80-performance-tuning.md Fix: Updated MaxTasksPerNode documentation (#279) 2018-06-06 23:04:30 -07:00
90-troubleshooting.md Feature: Batch Insights support (#336) 2019-02-04 18:45:03 -08:00
91-quota-limitations.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
92-faq.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00
README.md Documentation rewrite (#273) 2018-05-15 16:22:11 -07:00

README.md

doAzureParallel Guide

This section will provide information about how Azure works, how best to take advantage of Azure, and best practices when using the doAzureParallel package.

  1. Azure Introduction (link)

    Using Azure Batch

  2. Getting Started (link)

    Using the Getting Started to create credentials

    i. Generate Credentials Script (link)

    • Pre-built bash script for getting Azure credentials without Azure Portal

    ii. National Cloud Support (link)

    • How to run workload in Azure national clouds
  3. Customize Cluster (link)

    Setting up your cluster to user's specific needs

    i. Virtual Machine Sizes (link)

    • How do you choose the best VM type/size for your workload?

    ii. Autoscale (link)

    • Automatically scale up/down your cluster to save time and/or money.

    iii. Building Containers (link)

    • Creating your own Docker containers for reproducibility
  4. Managing Cluster (link)

    Managing your cluster's lifespan

  5. Customize Job

    Setting up your job to user's specific needs

    i. Asynchronous Jobs (link)

    • Best practices for managing long running jobs

    ii. Foreach Azure Options (link)

    • Use Azure package-defined foreach options to improve performance and user experience

    iii. Error Handling (link)

    • How Azure handles errors in your Foreach loop?
  6. Package Management (link)

    Best practices for managing your R packages in code. This includes installation at the cluster or job level as well as how to use different package providers.

  7. Storage Management

    i. Distributing your Data (link)

    • Best practices and limitations for working with distributed data.

    ii. Persistent Storage (link)

    • Taking advantage of persistent storage for long-running jobs

    iii. Accessing Azure Storage through R (link)

    • Manage your Azure Storage files via R
  8. Performance Tuning (link)

    Best practices on optimizing your Foreach loop

  9. Debugging and Troubleshooting (link)

    Best practices on diagnosing common issues

  10. Azure Limitations (link)

    Learn about the limitations around the size of your cluster and the number of foreach jobs you can run in Azure.

Additional Documentation

Read our FAQ for known issues and common questions.