PySpark Live-Coding: Building a project from scratch | Dev Setup, Read CSV, Window Functions, Tests

preview_player
Показать описание
#apachespark #pyspark #databricks #dataengineering #datascience #python #livecoding #learnspark

In this video, I will do a live-coding session and build a PySpark project from scratch. I will show you how to set up the development environment, where to get data, how to read csv files in the right format, implement a window function and how to write a test. Along the way, I will show you where to find information from official sources. Enjoy!

00:00 Intro
00:44 Outline
02:32 Development setup
08:05 Hello PySpark
13:14 Getting example data
15:11 Reading csv
24:36 Transforming columns
30:32 Window functions
36:48 Writing a test
50:52 Becoming Pro & Outro
Рекомендации по теме
Комментарии
Автор

Huge respect for your work.
I’m Data Engineer too.
What do you think about Mojo?

terabit
Автор

Man you make great content! I work as a data engineer(Scala + Spark)

anmolmishra
visit shbcf.ru