Apache Spark 

Course Description:

This course provides a comprehensive introduction to Apache Spark, a powerful framework for large-scale data processing. Spark offers significant performance improvements over traditional MapReduce approaches. You'll gain hands-on experience writing both batch processing and streaming applications using Spark.

Target Audience:

Developers tasked with building Spark applications. Prior experience with Scala or Python is recommended.

Course Objectives:

Course Length: 3 Days

Course Outline:

Module 1: Introduction to Apache Spark

Module 2: Working with RDDs

Module 3: Parallel Programming with Spark

Module 4: Building Spark Applications

Module 5: Spark Streaming

Module 6: Advanced Spark Topics 


Note: This class leverages the Databricks environment. If your company uses another Spark environment, the workshop can be adapted.