Mastering Apache Spark (2.3.0)
Welcome to Mastering Apache Spark gitbook! I’m very excited to have you here and hope you will enjoy exploring the internals of Apache Spark (Core) as much as I have.
I write to discover what I know.
I’m Jacek Laskowski, an independent consultant, software developer and technical instructor specializing in Apache Spark, Apache Kafka and Kafka Streams (with Scala, sbt, Kubernetes, DC/OS, Apache Mesos, and Hadoop YARN).
|I’m also writing Mastering Spark SQL, Mastering Kafka Streams, Apache Kafka Notebook and Spark Structured Streaming Notebook gitbooks.|
Expect text and code snippets from a variety of public sources. Attribution follows.
Now, let me introduce you to Apache Spark.