Spark

Understanding HDInsight Spark jobs and data through visualizations in the Jupyter Notebook

29 апреля 2019 г.

The Jupyter Notebook on HDInsight Spark clusters is useful when you need to quickly explore data sets, perform trend analysis, or try different machine learning models. Not being able to track the status of Spark jobs and intermediate data can make it difficult for data scientists to monitor and optimize what they are doing inside the Jupyter Notebook.

Senior Program Manager, Big Data Team

Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update

10 декабря 2018 г.

Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.

Principal Program Manager, Azure HDInsight

Azure Toolkit for IntelliJ – Spark Interactive Console

15 ноября 2018 г.

We are pleased to reveal the release of Spark Interactive Console in Azure Toolkit for IntelliJ. This new component intends to facilitate your Spark job authoring, and enable you to run code interactively in a shell-like environment within IntelliJ.

Principal Program Manager, Big Data Team

Top 8 reasons to choose Azure HDInsight

18 июня 2018 г.

Household names such as Adobe, Jet, ASOS, Schneider Electric, and Milliman are amongst thousands of enterprises that are powering their Big Data Analytics using Azure HDInsight.

Principal Program Manager, Azure HDInsight