Skip to main content

R Server for HDInsight

Predictive analytics, machine learning, and statistical modeling for big data.

What is R Server for HDInsight?

By combining enterprise-scale R analytics software with the power of Apache Hadoop and Apache Spark, Microsoft R Server for HDInsight gives you the scale and performance you need. Multi-threaded math libraries and transparent parallelization in R Server handle up to 1000x more data and up to 50x faster speeds than open-source R, which helps you to train more accurate models for better predictions. R Server works with the open-source R language, so all of your R scripts run without changes.

R Server for HDInsight
  • Large portable R parallel analytics and machine learning library

  • Enterprise-grade security and support backed by a Microsoft SLA

  • Terabyte-scale machine learning—1,000x larger than in open source R

  • Access Spark data sources through Spark SQL

  • " "

    Deliver up to 50x faster performance using R Server for Apache Spark 2.0 and optimized vector/math libraries

  • Easy setup for fast results

Work with the power and familiarity of R

A top choice among data scientists, the R programming language has a global community of more than two million users worldwide, and the total number of open-source analytics packages is growing every year. R Server for HDInsight gives you full compatibility with the R language running at scale on Hadoop and Spark.

R usage is on the rise. From 2007 to 2013, the number of data miners that report using R increased from 20% to 70%. From 2008 to 2013, the number of data miners that use R as their primary tool increased from less than 5% to 24%.
The number of CRAN packages released has increased significantly in the last few years. In 2005, there were very few. The number increased to 1000 by 2012, to 3000 by 2014, and to over 8000 by 2016.

Large portable R parallel analytics and machine learning library

Take advantage of a large parallel analytics and machine learning library, built to work with the open-source R language, that’s portable across popular data platforms—including decision trees and ensembles, regression models, clustering, data preparation, visualization, and statistical functions.

Terabyte-scale machine learning handles 1,000x more data

With transparent parallelization on top of Hadoop and Spark, R Server for HDInsight lets you handle terabytes of data—1,000x more than the open source R language alone. Train logistic regression models, trees, and ensembles on any amount of data. You’re only limited by the size of your Spark cluster.

Get up to 50x faster performance

Combine Spark, multithreaded vector and matrix math libraries, and R Server for HDInsight to experience up to 50x faster performance than previously possible with open source R.

Run distributed parameter sweeps and simulations with existing R functions

Run any open source R function over hundreds of nodes for parallel parameter sweeps and simulations. Explore and refine your models for faster, easier, and more accurate predictions.

Access Spark data sources through Spark SQL

Analyze data in Hadoop and Spark, using Apache Spark SQL as a data source for R Server. Load the results of a Spark SQL query against sources such as Apache Hive and Apache Parquet to a Spark Data Frame, and analyze it directly using any R Server distributed computing algorithms.

Choose your development tools

R Server on HDInsight includes R Studio Server Community Edition, which makes it easy for you to get started. Download R Tools for Visual Studio for free to get a convenient local development environment.

Enterprise-grade security and support

Rely on enterprise-grade security and support from Azure, which includes version packages, patching, security updates, and continuous cluster monitoring. A Microsoft Service Level Agreement (SLA) with 99.9% connectivity helps to protect your R Server for HDInsight clusters against catastrophic events.

Easy setup, fast results

There’s no time-consuming installation or setup with R Server for HDInsight. Azure does it for you. You’ll be up and running in minutes, ready to train your statistical and machine learning models, without buying new hardware or paying other up-front costs. You only pay for the compute and storage that you use.

Apache Hadoop® and associated open source project names are trademarks of The Apache Software Foundation.

Comprehensive security and compliance, built in

  • Microsoft invests more than $1 billion annually on cybersecurity research and development.

  • We employ more than 3,500 security experts who are dedicated to data security and privacy.

  • Azure has more certifications than any other cloud provider. View the comprehensive list.

Get started with an Azure free account

1

Start free. Get $200 credit to use within 30 days. While you have your credit, get free amounts of many of our most popular services, plus free amounts of 55+ other services that are always free.

2

After your credit, move to pay as you go to keep building with the same free services. Pay only if you use more than your free monthly amounts.

3

After 12 months, you'll keep getting 55+ always-free services—and still pay only for what you use beyond your free monthly amounts.

Try R Server for HDInsight

Try Azure for free

Chat with sales