AzureDSVM is an R package that offers convenient harness of Azure DSVM, remote execution of scalable and elastic data science work, and monitoring of on-demand resource consumption.
Перейти к файлу
yueguoguo 7d6c858ba8 Merge branch 'master' of https://github.com/Azure/AzureDSVM 2017-08-18 18:38:23 +08:00
R Update Azure VM Extension version 2017-08-18 18:38:19 +08:00
inst/etc Updated Windows DSVM image to Windows Server 2016 2017-08-14 17:04:32 +08:00
man Updated man and NAMESPACE 2017-07-18 11:12:42 +08:00
test Add extension and mrs one-box config 2017-08-14 14:38:51 +08:00
tests Added test items. 2017-06-30 15:34:06 +08:00
vignettes Add extension and mrs one-box config 2017-08-14 14:38:51 +08:00
.Rbuildignore New files to ignore. 2017-02-28 11:22:42 +08:00
.gitignore Remove NAMESPACVE and man from ignore - we should have these 2017-03-22 11:29:53 +08:00
AzureDSVM.Rproj Rewrote vignettes 2017-03-21 15:59:31 +08:00
DESCRIPTION Format 2017-07-04 14:43:54 +08:00
LICENSE Initial commit 2017-02-09 17:18:42 -08:00
Makefile Document new target 2017-06-14 20:49:43 +08:00
NAMESPACE Updated man and NAMESPACE 2017-07-18 11:12:42 +08:00
README.md Update README.md 2017-08-15 14:08:59 +08:00
data.xdf Add extension and mrs one-box config 2017-08-14 14:38:51 +08:00
git.mk Remove info target 2017-03-22 09:39:23 +08:00
r.mk Format R file list. 2017-03-22 09:38:37 +08:00

README.md

AzureDSVM

The AzureDSVM (Azure Data Science Virtual Machine) is an R Package for Data Scientists working with the Azure compute platform as a complement to the underlying AzureSMR for controlling Azure Data Science Virtual Machines.

Azure Data Science Virtual Machine (DSVM) is a powerful data science development environment with pre-installed tools and packages that empower data scientists for convenient data wrangling, model building, and service deployment.

The R package of AzureDSVM aims at offering functions that can be conveniently used by R data scientists for operating and using Azure Data Science Virtual Machine (DSVM) elastically and economically within local R session.

To install the package from github:

devtools::install_github("Azure/AzureDSVM")

Help pages are also provided for all functions within the package. With RStudio for example type AzureDSVM into search when the package is loaded to see a list of functions/help pages or else

library(help=AzureDSVM)

Note: The package will work with any open source R Session or with Microsoft R extensions.

Features

  • Elasiticity

    • Deployment of a DSVM with customized information such as machine name, machine size (with compute/memory optimized general-purpose CPU, Nvidia K80/M60 GPU, etc.), operating system (Windows Server 2016, Ubunbut 16.04, and CentOS), authentication method (public key based or password based), etc.
    • Enjoy all benefits of a Windows/Linux DSVM. E.g., all tools for data science work such as R/Python/Julia programming languages, SQL Server, Visual Studio with RTVS, etc., remote working environment via RStudio Server or Jupyter Notebook interface, and machine learning & artificial intelligence packages such as Microsoft CNTK, MXNet, and XGBoost.
    • Execution of R analytics on DSVM(s) with various Microsoft R Server computing contexts such as "local parallel" and "cluster parallel".
    • Seamless interaction with remote R Server session with mrsdeploy functions.
    • Post-deployment installation of extension for customizing system environment, reinstalling/uninstalling software, etc.
  • Scalability

    • Deployment of a collection of heterogeneous DSVMs for a group of data scientists.
    • Scale up DSVM and form them into a cluster for parallel/distributed computation with Microsoft R Server backend.
  • Usability

    • Deploy, start, stop, and delete DSVM(s) on demand.
    • Monitor data consumption and estimate expense of using DSVM(s) with hourly aggregation granularity.

Tutorials

To get started with this package, see the Vignettes:

Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.