Published • 5 min read
Azure HDInsight now supports Apache Spark 2.3
...serialization in Python UDFs It is worth mentioning that PySpark is already fast and takes advantage of the vectorized data processing in core Spark engine as long as you are using DataFrame APIs. This is good news as it represents majority of the use cases if you follow...