The Internals of Spark SQL (Apache Spark 2.4.1)
Welcome to The Internals of Spark SQL gitbook! I’m very excited to have you here and hope you will enjoy exploring the internals of Spark SQL as much as I have.
I write to discover what I know.
I’m Jacek Laskowski, an independent consultant, software developer and technical instructor specializing in Apache Spark, Apache Kafka and Kafka Streams (with Scala, sbt, Kubernetes, DC/OS, Apache Mesos, and Hadoop YARN).
|I’m also writing Mastering Apache Spark, Mastering Kafka Streams, Mastering Apache Kafka and The Internals of Spark Structured Streaming gitbooks.|
Expect text and code snippets from a variety of public sources. Attribution follows.
Now, let me introduce you to Spark SQL.