Live Online Style Video +Live Instructor + Official Course-Book + Hands-on Labs

Data Processing with PySpark


  1. 6 months access to videos.
  2. Access via Laptop, Tab, Mobile, and Smart TV.
  3. Certificate of Completion.
  4. Official Course-Book

We do not have video for this course. It will take 48 hours to deliver.

You can also purchase an annual plan for USD 999. For one year, all Flexi Videos will be free for you. Buy Annual Plan

Flexi Video

USD 199

Official Course-Book Instructor Hand-outs





USD 199

100% Refund for Flexi Video (Course-Book is non-refundable) if user is not satisfied with the Video and seeks refund within 7 days of purchase.

Download Course Contents
Module 1: Introduction to Apache Spark
  •       Introduction to Big Data
  •       What is Apache Spark?
  •       Evaluation of Apache Spark
  •       Features
  •       Spark Architecture
  •       Spark Vs Hadoop Map Reduce
  • Spark SQL Vs HIVE
  •       Installation on MAC
  •       Installation on Windows
  •       With Scala and Intellij
  •       Creating DataBricks Account
  •       DataBricks Compute, Notebook, tables
  •       Why Pyspark?
  •       Need for Pyspark
  •       Spark Python Vs Scala
  •       Pyspark features
  •       Real-life usage of PySpark
  •       Web/Application
  •       SparkSession
  •       SparkContext
  •       Stage
  •       Executor
  •       RDD
  •       Parallelize
  • Parallelize
  • Read Text File
  • Read CSV
  •       Create RDD
  •       RDD Persistence and Caching Mechanism
  •       RDD Features
  •       RDD Limitations
  •       RDD Lineage
  •       Action
  •       Pair Functions- Paired RDD
  •       Repartition and Coalesce
  •       Shuffle Partitions
  •       Cache vs Persist
  •       Introduction
  •       Making data Structured
  •       Case Classes
  •       ways to extract case class objects
  •       using function
  •       using map with multiple exressions
  •       using map with single expression
  •       Sql Context
  •       Data Frames API
  •       DataSet API
  •       RDD vs DataFrame vs DataSet
  •       Create a DataFrame
  •       Create an empty DataFrame
  •       Convert RDD to DataFrame
  •       Convert DataFrame to Pandas
  •       union() & unionAll()
  •       unionByName()
  •       UDF (User Defined Function)
  •       map()
  •       Aggregate Functions
  •       Window Functions
  •       Date and Timestamp Functions
  •       JSON Functions
  •       Read & Write JSON file
  •       when()
  •       expr()
  •       lit()
  •       split()
  •       concat_ws()
  •       substring()
  •       translate()
  • regexp_replace()
  • overlay()
  •       to_timestamp()
  • to_date
  •       Working with sql statements
  •       Spark and Hive Integration
  •       Spark and mysql Integration
  •       Working with CSV
  •       Working with JSON
  •       Transformations and actions on dataframes
  •       Narrow, wide transformations
  •       Addition of new columns, dropping of columns ,renaming columns
  •       Addition of new rows, dropping rows
  •       Handling nulls
  •       Joins
  •       Local Mode
  •       Cluster Modes(Standalone , YARN

Learn more about Koenig. Download Presentation Buy Other Flexi


Yes, you can pay from this web page.
Yes, the site is secure by utilizing Secure Sockets Layer (SSL) Technology. SSL technology enables the encryption of sensitive information during online transactions. We use the highest assurance SSL/TLS certificate, which ensures that no unauthorized person can get to your sensitive payment data over the web.
We use the best standards in Internet security. Any data retained is not shared with third parties.
You will be provided access to LET ( Learning Enhancement Tool), where you will get the links to access all your purchases.
Flexi video for the new version will be provided free of cost.
6 months from the date of delivery.
Yes, contact us for corporate packages.
Yes, Course-Book and Lab are not included in the annual plan. All Flexi videos are included.
It is only for one user.
Videos can only be streamed and not downloaded.
We do not track the pass rate of Flexi students. However, we trust it will be lower than for Live Online.
Presently, Flexi is only available in English.
It’s a unique subscription plan where customers can avail unlimited Flexi courses within a year.
The subscription plan is valid for 1 year from the date of purchase.
No, this is limited to one user and its non-transferable.

Feedbacks from Clients