azurehpc/examples/grafana/readme.md

1.6 KiB

Deploy an initialize a Grafana server

This example shows how deploy a Grafana server and configure Telegraf on monitored machines.

The configuration file requires the following variables to be set:

Variable Description
location The location of resources
resource_group The resource group for the project
vm_type Azure GPU VM full name (NC or ND series)
key_vault Keyvault to store the GrafanaPassword secret

Note : Create an Azure Key Vault and store the Grafana Password in the secret named GrafanaPassword

Once deployed:

  • Add port 3000 to the NSG of the grafana server
  • access the portal thru the URL : http://[grafana server fqdn]:3000/
  • Authenticate with the admin user and the password stored into your KeyVault
  • Access the dashboard thru the left meny Dashboards/Manage and then select "Telegraf : system dashboard"

Note : To monitor other VMs, just add the telegraf tag to your resources and its associated install script as specified the in the configuration file

        {
            "script": "install-telegraf.sh",
            "tag": "telegraf",
            "sudo": true,
            "args": [
                "<grafana server or ip address>",
                "azhpc",
                "secret.{{variables.key_vault}}.GrafanaPassword"
             ] 
        }