Understanding HDInsight Spark jobs and data through visualizations in the Jupyter Notebook

lunes, 29 de abril de 2019

The Jupyter Notebook on HDInsight Spark clusters is useful when you need to quickly explore data sets, perform trend analysis, or try different machine learning models. Not being able to track the status of Spark jobs and intermediate data can make it difficult for data scientists to monitor and optimize what they are doing inside the Jupyter Notebook.

Senior Program Manager, Big Data Team

Processing trillions of events per day with Apache Kafka on Azure

martes, 5 de febrero de 2019

n the current era, companies generate huge volumes of data every second. Whether it be for business intelligence, user analytics, or operational intelligence; ingestion, and analysis of streaming data requires moving this data from its sources to the multiple consumers that are interested in it.

Software Engineer

Herramientas de HDInsight para Visual Studio Code ya está disponible con carácter general

miércoles, 23 de enero de 2019

Nos complace anunciar la disponibilidad general de Herramientas de Azure HDInsight para Visual Studio Code (VSCode). Herramientas de HDInsight para VSCode ofrece a los desarrolladores un editor de código ligero multiplataforma para el desarrollo de trabajos por lotes de HDInsight PySpark y Hive y consultas interactivas.

Principal Program Manager, Big Data Team

HDInsight now supported in Azure CLI as a public preview

jueves, 17 de enero de 2019

We recently introduced support for HDInsight in Azure CLI as a public preview. With the addition of the new HDInsight command group, you can now utilize all of the features and benefits that come with the familiar cross-platform Azure CLI to manage your HDInsight clusters.

Program Manager, Azure HDInsight

Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update

lunes, 10 de diciembre de 2018

Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.

Principal Program Manager, Azure HDInsight