Skip to main content

 Subscribe
Ruixin Xu

Ruixin Xu

Senior Program Manager, Big Data Team

Latest posts

Showing 1 – 1 of 1 posts found

Published • 2 min read

Understanding HDInsight Spark jobs and data through visualizations in the Jupyter Notebook 

The Jupyter Notebook on HDInsight Spark clusters is useful when you need to quickly explore data sets, perform trend analysis, or try different machine learning models. Not being able to track the status of Spark jobs and intermediate data can make it difficult for data scientists to monitor and optimize what they are doing inside the Jupyter Notebook.