Start the new year with a discount – 25% off until Jan 31!
Write your awesome label here.
0580624f38761426e9da-public/a0b9c5a2eb0feeb3ae788b6287e35e78.jpeg");">
Write your awesome label here.
0580624f38761426e9da-public/a0b9c5a2eb0feeb3ae788b6287e35e78.jpeg");">
– THE PROFESSIONAL WAY TO LEARN PYSPARK FOR WORK

Apache Spark
Certification Training

A premium PySpark course to prepare you for becoming a Databricks Certified Associate Developer
Beginner friendly  Comprehensive  Fun

What Learners Say

“Florian's Spark course was a career game-changer and one of the best I've taken! His expert, engaging teaching is outstanding. Ideal for anyone aiming to master Spark and make a significant career impact!”
ANDRÉ ICHIRO 
Senior Data Engineer at phData | São Paulo, Brazil
(Photo: After the course, André and Florian met up in São Paulo.)
“This course significantly upskilled me, providing immense value in my workplace. The content was comprehensive and well-structured, covering both basics and advanced topics. The emphasis on hands-on experience allowed me to immediately apply what I learned. The practical relevance of the course was exceptional, with industry best practices and real-world use cases. The instructor's engaging teaching style and prompt support made the learning process enjoyable. Overall, this course has had a profound impact on my professional growth and performance.”
ADAM BELKADA 

Cloud Data Engineer at EDF Renewables | San Diego, California, US

Whether you are a data engineer, data scientist, data analyst, or student: Learning Apache Spark with this course could be your big data career game-changer!

Watch the videos. Get hands-on in the labs.
THEORY AND PRACTICE IN ONE.
Meticulous explanations. A fun storyline.
DEEP, ENGAGED LEARNING.
200+ practice questions. Detailed feedback.
QUICK ITERATIONS TO LEARN FAST.

What will you learn?

The goal of this course is to transform you into a PySpark professional. It will teach you fundamental PySpark skills and prepare you to get certified with the Databricks Certified Associate Developer for Apache Spark certification.

The course includes 18 modules to help you understand how Apache Spark works internally and how to use it in practice. You can find all topics covered below, but here is an overview:
  • Become a seasoned expert at coding with Spark DataFrames
  • Get confident with the Databricks certification exam content
  • Discover Spark's distributed, fault-tolerant data processing
  • Master how to work with Spark in Databricks
  • Understand the Spark cluster architecture
  • Learn when and how Spark evaluates code
  • Grasp Spark's efficient memory management mechanisms
  • Analyze typical Spark problems like out-of-memory errors
  • See how Spark executes complex operations like joins
  • Become proficient in navigating through the Spark UI
  • ...and many more topics – check out the full list below!

Who is this for?

Anyone with basic Python skills who wants to develop their big data processing skills! And anyone who would like to pass the popular Databricks Certified Associate Developer for Apache Spark certification using PySpark.

If you want to learn how to use Apache Spark with the Scala programming language, this course isn't a fit. We focus on Python and PySpark exclusively, but the fundamental Spark concepts taught are applicable to both languages.
  • Data analysts and developers who want to add verified big data skills and Databricks experience to their portfolio
  • Data engineers who want or need a proof of their Apache Spark skills via a certification to boost their career
  • Data scientists wanting to work efficiently and frustration-free with large data sets in Apache Spark
  • Companies who want to enable their data staff to use Apache Spark in a professional, time- and cost-efficient way
  • Anyone wanting to brush up their Apache Spark skills with a solid understanding of how it works under the hood

Modules

Hours of Video

Assignments

Practice questions

– New Year, New Goals –

Get 25% off and start learning now.

Limited offer until January 31, 2025.

 Free Previews

Course Introduction
Get an overview of the course in this lesson. It is the very first lesson you will hear in the course and will give you a great idea of what to expect.Course Introduction

Course Introduction

Get an overview of the course in this lesson. It is the very first lesson you will hear in the course and will give you a great idea of what to expect.
Course Introduction
Get an overview of the course in this lesson. It is the very first lesson you will hear in the course and will give you a great idea of what to expect.Course Introduction

Overview of the Spark Cluster Architecture

If you want to know what a typical lesson about the theoretical concepts behind Spark looks and feels like, then check out this video. Taken from the second module in the course, it makes you familiar with the components of an Apache Spark cluster.
Course Introduction
Get an overview of the course in this lesson. It is the very first lesson you will hear in the course and will give you a great idea of what to expect.Course Introduction

Lab: Reading from CSV Files

If you want to get an impression of a practical part in this course, then, this video is for you. Taken from the third of in total 6 labs in the course, in this lesson we read CSV files together.
Note: This exercise depends on data which you will get access to once you signed up for the course.
Course Introduction
Get an overview of the course in this lesson. It is the very first lesson you will hear in the course and will give you a great idea of what to expect.Course Introduction

Lab Exercise: Reading from CSV Files

After you coded along with me in the lab, you are in the driver's seat. For each new skill, there is an exercise like this one. Enabling you to learn-by-doing, you need to solve exercises on your own. You submit your solutions to a scoring script and get immediate feedback.
There are 20 exercises like this in the course.
Course Introduction
Get an overview of the course in this lesson. It is the very first lesson you will hear in the course and will give you a great idea of what to expect.Course Introduction

Lab Exercise Solution: Reading from CSV Files

After you have solved an exercise, I show you how I would have solved the problem.
Here, I explain to you how I would have dealt with the special challenge of reading CSV files into Apache Spark.

Ready to dig deeper?

When you sign up for the course now, you get immediate access to all 96 videos, 200+ exercises, and the community of learners. Join to learn Apache Spark the fun way!
Any doubts before signing up? Please reach out!

What's included?

Overview of learning content
The focus on Practical Experience, High Quality Content, and Diverse Learning Materials will help you understand Apache Spark in depth and keep you engaged.
COURSE CONTENT
  • 96 Video Lessons (14+ hrs)
  • 20 Coding Assignments
  • Fun storyline + learning badges
  • 3 Full Original Practice Exams
  • 38 Knowledge Check Questions
  • 12 Mastery Map PDFs
  • 10-page Certification Exam Guide
“Let's get you certified!”

Meet Florian Roscheck

“I can't believe I don't know anything about big data processing!” – this is what set me off on an exciting journey to learn Apache Spark. Today, I have taught more than 5,000 students how to pass the Databricks Spark certification with my popular practice exams.

As a Sr. Data Scientist at a major consumer goods company in Germany, I currently apply big data models with my data team in a business context. Sustainability is an important topic for me, not only since working in California as a data scientist at a renewable energy company.

I love that Apache Spark is open-source and volunteer for promoting open practices in research, data, and scientific computing at NumFOCUS.
Write your awesome label here.

Follow me

Apache Spark Corporate Training

Are you interested in purchasing multiple seats in the course for your entire team? Please reach out for a formal offer.

Course Content

 English subtitles are available for all videos.

Take your Apache Spark skills to the next level!

Apache Spark Certification Training

Ready to boost your profile as a data professional with verified, job-ready Apache Spark skills?
  • Watch all 96 HD videos
  • Stream the course from any device
  • Get access to all 12 practice notebooks, incl. solutions
  • Clarify any doubts in exchange with other learners in the course discussions
  • Get super prepared for the Databricks exam with 3 practice tests, incl. 180 questions with explanations
Note: Purchase of this course does not include the certification fee charged by Databricks.

“Is it worth it?”

...a question you or your employer might ask.
Unique Clarity
Based on my experience of teaching over 5,000 students how to pass the Spark certification, I have spent more than 500 hours creating the crystal clear educational materials in this course from scratch.
My laser focus on exceptional quality means that in the 15 to 30 hours you typically need to pass this course, you save yourself the frustration of looking for 150 hours for exam resources online, only to resort to StackOverflow for the intricate questions nobody could sufficiently answer.

Learning by Doing
This course is not (just) a video tutorial. It is a full-fledged educational experience that puts you in the driver's seat. In 20 interactive coding exercises you will submit your code snippets and receive automated feedback about how to improve. The 218 questions will challenge your understanding, empowering you to solve practical Spark situations on your own. This will enable you to excel at your data job, saving the company expensive data worker hours and keeping the cloud bill low.

Validated Skills
By taking the Databricks Certified Associate Developer for Apache Spark certification, you acquire a certification of a leading company in the data space today. If you are looking for a data job, employers will be excited about your experience with Apache Spark and Databricks. If you are in a consultant position, the Databricks Spark certification shows your clients that you can be trusted with taking on Spark workloads on the Databricks platform.

FAQ

What format is the course in?

The course consists of videos, interactive coding exercises, and practice questions, incl. practice exams. This content is distributed over 18 modules, alternating between a focus on theory and practice, helping to keep you interested and engaged.

There are 14.5 hours of video content distributed over 96 HD videos. The videos are used for different purposes. One purpose is to be a guided educational tour of Spark's architecture. Another purpose is to allow you to "learn by doing". In part of the videos, you will be coding along in your browser as you watch Florian write and explain Apache Spark code. This allows you to use and learn Apache Spark in an intuitive, practical way.

The 20 interactive coding exercises in this course each consist of an instructional video, an interactive notebook, an evaluation script, and a solution video. In the instructional video, you will read the instruction for the exercise together with Florian and he will give you tips on how to best solve the exercise. Then, you will solve the exercises on your own in an interactive coding environment in your browser. When you submit code that is not (yet) solving the exercise, you get valuable feedback specifying what you can improve based on an evaluation script that is provided in the course. Finally, independent of if you figured out the solution to the exercise or not, Florian will show you his approach to solving the exercise. All exercises are ungraded and you get the opportunity to repeat them as many times as possible for you to feel fit for the Databricks exam.

At the end of each module, you will find a couple of practice questions that check whether you understood the content of the module. When you provide a wrong answer, you will get an explanation about why your answer was wrong, allowing you to learn from your mistakes.

The 3 original practice tests in this course help to prepare you for the environment and content of the Databricks certification in a targeted way. You will get familiar with the question format of the Databricks certification as well as the breadth of topics that it covers. The greatest opportunity for you here is to take advantage of the many thorough explanations of every answer of the, all in all, 180 exam questions. Once you submit an answer, you will get immediate feedback about if your answer was right and, if it was wrong, what was wrong with your answer. This targeted feedback will give you a chance to rapidly improve your understanding of PySpark, setting you up for an express path to pass the Databricks certification and nurture your big data career.

Is there a discount for students?

If you are a student and are interested in picking up solid Apache Spark skills, then, first of all, I congratulate you on your wise decision. Second, please reach out to florian@headindata.com with proof that you are a student who is currently enrolled in an educational institution (e.g. a photo of your valid student ID card), so I can provide you with an individualized coupon.