Important notes and source code of Oreilly book - Learning Spark 2nd Edition
- Introduction to Apache Spark: A Unified Analytics Engine
- Downloading Apache Spark and Getting Started
- Apache Spark’s Structured APIs
- Spark SQL and DataFrames: Introduction to Built-in Data Sources
- Spark SQL and DataFrames: Interacting with External Data Sources
- Spark SQL and Datasets
- Optimizing and Tuning Spark Applications
- Structured Streaming
- Building Reliable Data Lakes with Apache Spark
- Machine Learning with MLlib
- Managing, Deploying, and Scaling Machine Learning Pipelines with Apache Spark
- Epilogue: Apache Spark 3.0