Description
Spark is one of the most in-demand Big Data processing frameworks right now.
This course will take you through the core concepts of PySpark. We will work to enable you to do most of the things you’d do in SQL or Python Pandas library, that is:
- Getting hold of data
- Handling missing data and cleaning data up
- Aggregating your data
- Filtering it
- Pivoting it
- And Writing it back
All of these things will enable you to leverage Spark on large datasets and start getting value from your data.
Let’s get started.
Related Courses:
Data Analysis with R by Facebook
Modern Reinforcement-learning using Deep Learning
(Premium) - Learn Machine Learning By Building Projects
Machine Learning For Researchers
Natural Language Processing: NLP In Python with Projects
Machine Learning MASTER, Zero To Mastery
Digishock 2.0: Learn Machine Learning in 2021 (No Coding)
Build Auto Machine Learning (Auto-ML) Projects With Python