Microsoft Academic Graph PySpark Samples
PySpark examples to analyze sample Microsoft Academic Graph Data on Azure storage.
Before running these examples, you need to have following setups: - Azure HDInsight cluster with Spark - Access to Microsoft Academic Graph Data
- git clone https://github.com/Azure-Samples/microsoft-academic-graph-pyspark-samples.git
- cd microsoft-academic-graph-pyspark-samples/src
- python path-to-one-source-file