История

Jeffrey Robinson 35c70483ae Hadoop Support multiple data disks (#416 ) * Remove apiVersion, use mikes RAID code. * Add numberDataDisks to parameters.json * resource deployments need api version * Worker DataDisk names * Naming bug * -_- * Type * limits were not set correctly.		2019-02-06 15:09:51 -08:00
..
config	Hadoop Support multiple data disks (#416 )	2019-02-06 15:09:51 -08:00
scripts	Hadoop Support multiple data disks (#416 )	2019-02-06 15:09:51 -08:00
README.md	Hadoop Quickstart Template (#366 )	2018-09-06 13:30:59 -07:00
hadoop_cluster.json	Hadoop Support multiple data disks (#416 )	2019-02-06 15:09:51 -08:00
jumpbox_node.json	Hadoop Support multiple data disks (#416 )	2019-02-06 15:09:51 -08:00
master_node.json	Hadoop Support multiple data disks (#416 )	2019-02-06 15:09:51 -08:00
parameters.json	Hadoop Support multiple data disks (#416 )	2019-02-06 15:09:51 -08:00
worker_node.json	Hadoop Support multiple data disks (#416 )	2019-02-06 15:09:51 -08:00

Hadoop Cluster

ARM Templates

This template requires Ubuntu 16.04LTS and Custom Script 2.0 for Linux to function.

This ARM template will setup your Hadoop cluster along with a jump box used to transfer data.

This ARM template will deploy the jumpbox which is used to manage and copy date to your Hadoop cluster.

This ARM template will deploy the Hadoop master nodes with each given a public IP address.

This ARM template will deploy the Hadoop worker nodes.

This script will download and install hadoop on each machine. This includes

Hadoop Node Setup Script

Log into each machine and
- Configure Hadoop for that node type
- Copy SSH keys to other nodes
- Add Hadoop to startup
- Restart VMs to finalize installation