AZTK lets you to provision on-demand GPU enabled Spark clusters on top of Azure Batch's infrastructure, helping you take your high-performance GPU code and distribute it across your Spark cluster.
Azure Distributed Data Engineering Toolkit - a open source python CLI tool that allows you to provision on-demand Spark clusters and submit Spark jobs directly from your CLI.
doAzureParallel's second major release comes with full support for low-priority VMs, letting R users run their R jobs on Azure’s surplus compute capacity at up to an 80% discount. In addition to this capability, we are introducing new features to the package to let users take advantage of Azure Batch’s flexible infrastructure for parameter tuning, data-prep/ETL, and simulation.
We are excited to announce doAzureParallel – a lightweight R package built on top of Azure Batch, that allows you to easily use Azure’s flexible compute resources right from your R session.