AzureStack-QuickStart-Templ.../hadoop-cluster
Jeffrey Robinson 35c70483ae Hadoop Support multiple data disks (#416)
* Remove apiVersion, use mikes RAID code.

* Add numberDataDisks to parameters.json

* resource deployments need api version

* Worker DataDisk names

* Naming bug

* -_-

* Type

* limits were not set correctly.
2019-02-06 15:09:51 -08:00
..
config Hadoop Support multiple data disks (#416) 2019-02-06 15:09:51 -08:00
scripts Hadoop Support multiple data disks (#416) 2019-02-06 15:09:51 -08:00
README.md Hadoop Quickstart Template (#366) 2018-09-06 13:30:59 -07:00
hadoop_cluster.json Hadoop Support multiple data disks (#416) 2019-02-06 15:09:51 -08:00
jumpbox_node.json Hadoop Support multiple data disks (#416) 2019-02-06 15:09:51 -08:00
master_node.json Hadoop Support multiple data disks (#416) 2019-02-06 15:09:51 -08:00
parameters.json Hadoop Support multiple data disks (#416) 2019-02-06 15:09:51 -08:00
worker_node.json Hadoop Support multiple data disks (#416) 2019-02-06 15:09:51 -08:00

README.md

Hadoop Cluster

ARM Templates

Dependencies

This template requires Ubuntu 16.04LTS and Custom Script 2.0 for Linux to function.

hadoop_cluster.json

This ARM template will setup your Hadoop cluster along with a jump box used to transfer data.

jumpbox_node.json

This ARM template will deploy the jumpbox which is used to manage and copy date to your Hadoop cluster.

master_node.json

This ARM template will deploy the Hadoop master nodes with each given a public IP address.

worker_node.json

This ARM template will deploy the Hadoop worker nodes.

Virtual Machine Extensions

Hadoop Cluster Setup

This script will download and install hadoop on each machine. This includes

  • Install Hadoop pre-requisites
  • Download and extract Hadoop
  • Create user accounts

Jumpbox Setup

Hadoop Node Setup Script

  • Log into each machine and
    • Configure Hadoop for that node type
    • Copy SSH keys to other nodes
    • Add Hadoop to startup
    • Restart VMs to finalize installation