Apache Spark Fundamentals Training Course

Apache Spark is an analytics engine designed to distribute data across a cluster in order to process it in parallel. It contains modules for streaming, SQL, machine learning and graph processing.

This instructor-led, live training (online or onsite) is aimed at engineers who wish to deploy Apache Spark system for processing very large amounts of data.

By the end of this training, participants will be able to:

Install and configure Apache Spark.
Understand the difference between Apache Spark and Hadoop MapReduce and when to use which.
Quickly read in and analyze very large data sets.
Integrate Apache Spark with other machine learning tools.

Format of the Course

Interactive lecture and discussion.
Lots of exercises and practice.
Hands-on implementation in a live-lab environment.

Course Customization Options

To request a customized training for this course, please contact us to arrange.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Testimonials (5)

A lot of practical examples, different ways to approach the same problem, and sometimes not so obvious tricks how to improve the current solution

Rafal - Nordea

Course - Apache Spark MLlib

Sufficient hands on, trainer is knowledgable

Chris Tan

Course - A Practical Introduction to Stream Processing

practice tasks

Pawel Kozikowski - GE Medical Systems Polska Sp. Zoo

Course - Python and Spark for Big Data (PySpark)

The VM I liked very much The Teacher was very knowledgeable regarding the topic as well as other topics, he was very nice and friendly I liked the facility in Dubai.

Safar Alqahtani - Elm Information Security

Course - Big Data Analytics in Health

This is one of the best hands-on with exercises programming courses I have ever taken.

Laura Kahn

Course - Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

6317 USD (Classroom)

Apache Spark Fundamentals Training Course

Course Outline

Requirements

Testimonials (5)

Rafal - Nordea

Course - Apache Spark MLlib

Chris Tan

Course - A Practical Introduction to Stream Processing

Pawel Kozikowski - GE Medical Systems Polska Sp. Zoo

Course - Python and Spark for Big Data (PySpark)

Safar Alqahtani - Elm Information Security

Course - Big Data Analytics in Health

Laura Kahn

Course - Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

Upcoming Courses

Apache Spark Fundamentals

Apache Spark Fundamentals

Apache Spark Fundamentals

Apache Spark Fundamentals

Apache Spark Fundamentals

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Apache Spark Fundamentals Training Course

Course Outline

Requirements

Testimonials (5)

Rafal - Nordea

Course - Apache Spark MLlib

Chris Tan

Course - A Practical Introduction to Stream Processing

Pawel Kozikowski - GE Medical Systems Polska Sp. Zoo

Course - Python and Spark for Big Data (PySpark)

Safar Alqahtani - Elm Information Security

Course - Big Data Analytics in Health

Laura Kahn

Course - Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

Upcoming Courses

Apache Spark Fundamentals

Apache Spark Fundamentals

Apache Spark Fundamentals

Apache Spark Fundamentals

Apache Spark Fundamentals

Related Courses

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

Big Data Analytics with Google Colab and Apache Spark

Big Data Analytics in Health

Introduction to Graph Computing

Hadoop and Spark for Administrators

Hortonworks Data Platform (HDP) for Administrators

A Practical Introduction to Stream Processing

Python and Spark for Big Data (PySpark)

Apache Spark MLlib

Related Categories

Apache Spark

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites