Simplify HPC and Batch workloads on Azure
Перейти к файлу
Fred Park 60b4fc446f Support ARM Images for custom images (#126) 2017-10-03 10:05:17 -07:00
cascade Tag for 2.9.4 release 2017-09-12 08:56:03 -07:00
config_templates Support ARM Images for custom images (#126) 2017-10-03 10:05:17 -07:00
contrib Add deep learning notebooks to contrib (#111) 2017-08-14 08:54:01 -07:00
convoy Support ARM Images for custom images (#126) 2017-10-03 10:05:17 -07:00
docker Fix third party notices typo 2017-09-12 13:42:37 -07:00
docs Support ARM Images for custom images (#126) 2017-10-03 10:05:17 -07:00
recipes Update all recipes to use YAML (#122) 2017-10-03 10:04:03 -07:00
resources Azure file dvd support for all supported hosts 2016-08-29 12:12:50 -07:00
rjm Tag for 2.9.4 release 2017-09-12 08:56:03 -07:00
scripts Native container support 2017-10-03 10:03:20 -07:00
site-extension Tag for 2.9.2 release 2017-08-16 08:59:53 -07:00
tfm Change autogen task id property to complex 2017-08-08 08:45:15 -07:00
.gitattributes Add AppVeyor build 2017-08-10 10:29:22 -07:00
.gitignore Allow CentOS 7.3 on NC/NV 2017-07-06 11:12:05 -07:00
.travis.yml Misc doc updates 2017-08-10 08:40:07 -07:00
CHANGELOG.md Support ARM Images for custom images (#126) 2017-10-03 10:05:17 -07:00
CODE_OF_CONDUCT.md Update docs 2017-08-29 08:04:30 -07:00
CONTRIBUTING.md Update docs 2017-08-29 08:04:30 -07:00
LICENSE Add dummy README 2016-07-18 08:15:56 -07:00
README.md Add Read the Docs build 2017-10-03 09:53:32 -07:00
THIRD_PARTY_NOTICES.txt Fix third party notices typo 2017-09-12 13:42:37 -07:00
appveyor.yml Missing join_by function in blobxfer helper 2017-08-31 10:28:22 -07:00
install.cmd Add conda-forge detection to windows script 2017-06-26 07:58:28 -07:00
install.sh Default install.sh to python3 2017-10-03 10:03:20 -07:00
mkdocs.yml Add Read the Docs build 2017-10-03 09:53:32 -07:00
requirements.txt Add ARM VNet support in Batch service mode (#126) 2017-10-03 10:05:17 -07:00
shipyard.py Add YAML configuration support 2017-10-03 10:04:03 -07:00

README.md

Build Status Build status Docker Pulls Image Layers

Batch Shipyard

Batch Shipyard is a tool to help provision and execute batch processing and HPC Docker workloads on Azure Batch compute pools. No experience with the Azure Batch SDK is needed; run your Dockerized tasks with easy-to-understand configuration files!

Additionally, Batch Shipyard provides the ability to provision and manage entire standalone remote file systems (storage clusters) in Azure, independent of any integrated Azure Batch functionality.

Batch Shipyard is now integrated directly into Azure Cloud Shell and you can execute any Batch Shipyard workload using your web browser or the Microsoft Azure Android and iOS app.

Major Features

  • Automated Docker Host Engine installation tuned for Azure Batch compute nodes
  • Automated deployment of required Docker images to compute nodes
  • Accelerated Docker image deployment at scale to compute pools consisting of a large number of VMs via private peer-to-peer distribution of Docker images among the compute nodes
  • Comprehensive data movement support: move data easily between locally accessible storage systems, remote filesystems, Azure Blob or File Storage, and compute nodes
  • Docker Private Registry support
  • Standalone Remote Filesystem Provisioning with integration to auto-link these filesystems to compute nodes with support for
  • Automatic shared data volume support
  • Seamless integration with Azure Batch job, task and file concepts along with full pass-through of the Azure Batch API to containers executed on compute nodes
  • Support for Low Priority Compute Nodes
  • Support for pool autoscale and autopool to dynamically scale and control computing resources on-demand
  • Support for Task Factories with the ability to generate tasks based on parametric (parameter) sweeps, randomized input, file enumeration, replication, and custom Python code-based generators
  • Support for Azure Batch task dependencies allowing complex processing pipelines and DAGs with Docker containers
  • Transparent support for GPU-accelerated Docker applications on Azure N-Series VM instances
  • Support for multi-instance tasks to accommodate Dockerized MPI and multi-node cluster applications on compute pools with automatic job completion and Docker task termination
  • Transparent assist for running Docker containers utilizing Infiniband/RDMA for MPI on HPC low-latency Azure VM instances:
  • Support for job schedules and recurrences for automatic execution of tasks at set intervals
  • Support for live job and job schedule migration between pools
  • Automatic setup of SSH users to all nodes in the compute pool and optional tunneling to Docker Hosts on compute nodes
  • Support for credential management through Azure KeyVault
  • Support for execution on an Azure Function App environment
  • Support for custom host images

Installation

Azure Cloud Shell

Batch Shipyard is now integrated into Azure Cloud Shell with no installation required. Simply request a Cloud Shell session and type shipyard to invoke the CLI.

Local Installation

Installation is typically an easy two-step process. The CLI is also available as a Docker image: alfpark/batch-shipyard:cli-latest. Please see the installation guide for more information regarding installation and requirements.

Documentation and Recipes

Please refer to the Batch Shipyard Documentation on Read the Docs.

Visit the Batch Shipyard Recipes section for various sample Docker workloads using Azure Batch and Batch Shipyard.

Batch Shipyard Compute Node OS Support

Batch Shipyard is currently compatible with supported Marketplace Linux VMs and Linux custom images supported by Azure Batch.

Change Log

See the CHANGELOG.md file.


Please see this project's Code of Conduct and Contributing guidelines.