Apache Spark 2 - Basic Transformations and Actions - 01 - map, flatMap, reduce and more

preview_player
Показать описание
As part of the course Apache Spark 2 using Python 3, let us understand details about basic RDD transformations

itversiry LMS course(CCA 175 Spark and Hadoop Developer – Python – 93 Days Lab):

* Row level transformations – map, flatMap
* Aggregations – reduceByKey and aggregateByKey

We will be seeing other transformations as part of next session.

* Filtering – filter
* Joins – performing inner joins and outer joins - join, leftOuterJoin, rightOuterJoin, fullOuterJoin
* Sorting data - sortByKey, we will also understand composite key sorting.

On our YouTube channel we conduct live sessions regularly. Please do subscribe to get notifications for our live sessions by clicking here.

For quick itversity updates, subscribe to our newsletter or follow us on social platforms.

#Python #PySpark #Spark2 #itversity #Spark #DataEngineering

Join this channel to get access to perks:
Рекомендации по теме
Комментарии
Автор

Great Explanation, Thankyou so much for wonderful series.

khubaibraza
Автор

if subtitle available it is very useful

praveenkumare
Автор

Good explanation about the transformations map, flat map, group by key, reduce by key, aggregate by key and in general. Good practice for spark using pyspark

sumithra