Dive into the world of big data processing with PySpark in Microsoft Fabric Notebooks! This course offers hands-on experience with PySpark, the powerful framework for large-scale data processing, integrated within Microsoft Fabric’s seamless environment. Learn how to leverage PySpark’ s robust capabilities to handle and analyze massive datasets, write efficient queries, and build scalable pipelines. You’ll explore how Fabric Notebooks empower you to collaborate, streamline workflows, and utilize Microsoft Fabric’s unique features like built-in security, connectors, and scalability for modern data challenges.
Overview
COURSE DIFFICULTY
COURSE DURATION
4h 38m
Skills Learned
After completing this online training course, students will be able to:
Navigate and utilize Microsoft Fabric workspace
Develop Spark notebooks for data engineering
Build data pipelines and dataflows in Fabric
Create semantic models for reporting
Power Automate Developers, Data Engineers
None
01. Introduction
02. Provisioning a Fabric Notebook
03. What is PySpark?
04. Working with Strings and Numbers
05. Working with Dataframes
06. Querying Data
07. Writing Data
08. Filtering Data
09. Aggregations
10. Working with Nulls
SKILLS LEARNED
Skills Learned
After completing this online training course, students will be able to:
Navigate and utilize Microsoft Fabric workspace
Develop Spark notebooks for data engineering
Build data pipelines and dataflows in Fabric
Create semantic models for reporting
WHO SHOULD ATTEND
Power Automate Developers, Data Engineers
PREREQUISITES
None
COURSE OUTLINE
01. Introduction
02. Provisioning a Fabric Notebook
03. What is PySpark?
04. Working with Strings and Numbers
05. Working with Dataframes
06. Querying Data
07. Writing Data
08. Filtering Data
09. Aggregations
10. Working with Nulls
