In this eBook, we expand, augment and curate on concepts initially published on KDnuggets. In addition, we augment the eBook with assets specific to Delta Lake and Apache Spark 2.x, written and presented by leading Spark contributors and members of Spark PMC including:
• Matei Zaharia, the creator of Spark
• Reynold Xin, chief architect
• Michael Armbrust, lead architect behind Spark SQL and Structured Streaming
• Joseph Bradley, one of the drivers behind Spark MLlib and SparkR
• Tathagata Das, lead developer for Structured Streaming