Microsoft Academic Graph PySpark Samples

PySpark examples to analyze sample Microsoft Academic Graph Data on Azure storage.

Getting Started

Prerequisites

Before running these examples, you need to have following setups: - Azure HDInsight cluster with Spark - Access to Microsoft Academic Graph Data

Quickstart

  1. git clone https://github.com/Azure-Samples/microsoft-academic-graph-pyspark-samples.git
  2. cd microsoft-academic-graph-pyspark-samples/src
  3. python path-to-one-source-file

Resources

  • https://docs.microsoft.com/en-us/academic-services/graph/