Scripts for installing HPC libraries, tools, to be used in Azure HPC/AI images
Перейти к файлу
Li Tan bfe9d60495
Merge pull request #389 from darkwhite29/master
Onboard Azure Linux to functional tests
2024-10-25 17:05:56 -04:00
alma Bug fixes (#387) 2024-10-17 19:38:34 -07:00
azure-pipelines
common
customizations
tests Onboard Azure Linux to functional tests (updated) 2024-10-25 14:08:21 -04:00
tools/torset-tool
topology
ubuntu Bug fixes (#387) 2024-10-17 19:38:34 -07:00
.gitattributes
.gitignore
CODE_OF_CONDUCT.md
LICENSE
README.md Update Alma version in README.md (#388) 2024-10-23 12:39:45 -07:00
SECURITY.md
azure-pipelines.yml
versions.json Bug fixes (#387) 2024-10-17 19:38:34 -07:00

README.md

Build Status

OS Version Status Badge
Ubuntu 20.04 Build Status
Ubuntu 22.04 Build Status
AlmaLinux 8.10 Build Status

Azure HPC/AI VM Images

This repository houses a collection of scripts meticulously crafted for installing High-Performance Computing (HPC) and Artificial Intelligence (AI) libraries, along with tools essential for building Azure HPC/AI images. Whether you're provisioning compute-intensive workloads or crafting advanced AI models in the cloud, these scripts streamline the process, ensuring efficiency and reliability in your deployments.

Following are the current supported HPC/AI VM images that are available in Azure Marketplace:

  • Ubuntu-HPC 22.04 (microsoft-dsvm:ubuntu-hpc:2204:latest)
  • Ubuntu-HPC 20.04 (microsoft-dsvm:ubuntu-hpc:2004:latest)
  • AlmaLinux-HPC 8.10 (almalinux:almalinux-hpc:8_10-hpc-gen2:latest)

How to Use

The high level steps to create your own HPC images using our repository are:

  1. Deploy a VM (tutorial).
  2. Run install.sh (pick the corresponding install.sh in our repository for your OS, e.g., Ubuntu 22.04).
  3. Generate an image from the VM (tutorial).

Kernel Update/Patching

Generally, OS kernel updates break compatibility of HPC components we install, e.g., Lustre. In our HPC images, the kernel is excluded from updates for this reason.

We implement it this way, since lots of kernel dependencies are installed which are highly coupled to a specific kernel version. Thus, kernel updates are not encouraged in our HPC images.

Our HPC image releasing primary cadence is quarterly. In between releases, if we get flagged for security issues, we quickly apply the patch and release a hotfix in an adhoc fashion which can be done within a week or two.

Please keep using our latest HPC images. If any compliance issues (e.g., security bugs) are identified, please also report them (and patches, if any) to us. We will apply the fix and release the patched images as a hotfix.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.