Master Reading Spark Query Plans

Показать описание

Spark Performance Tuning

Dive deep into Apache Spark Query Plans to better understand how Apache Spark operates under the hood. We'll cover how Spark creates logical and physical plans, as well as the role of the Catalyst Optimizer in utilizing optimization techniques such as filter (predicate) pushdown and projection pushdown.

The video covers intermediate concepts of Apache Spark in-depth, detailed explanations on how to read the Spark UI, understand Apache Spark’s query plans through code snippets of various narrow and wide transformations like reading files, select, filter, join, group by, repartition, coalesce, hash partitioning, hashaggregate, round robin partitioning, range partitioning and sort-merge join. Understanding them is going to give you a grasp on reading Spark’s step-by-step thought process and help identify performance issues and possible optimizations.

Chapters:
00:00 Introduction
01:30 How Spark generates logical and physical plans?
04:46 Narrow transformations (filter, select, add or update columns) query plan explanation
09:02 Repartition query plan explanation
12:57 Coalesce query plan explanation
17:32 Joins query plan explanation
23:23 Group by count query plan explanation
27:04 Group by sum query plan explanation
28:05 Group by count distinct query plan explanation
33:59 Interesting observations on Spark’s query plans
36:56 When will predicate pushdown not work?
39:07 Thank you

#ApacheSpark #SparkPerformanceTuning #DataEngineering #SparkDAG #SparkOptimization
#dataengineering #interviewquestions #azuredataengineer

Рекомендации по теме

Комментарии

🔔🔔 Please remember to subscribe to the channel folks. It really motivates me to make more such videos :)

afaqueahmad

Thanks a bunch. To my knowledge, no one has explained Spark explain function this detailed level. Very in-depth information.

roksig

Bhai mera bhai 😍 Abto hazaro students aayenge bhai ke pass par Apne sabse pehle student ko mat bhulna bawa😜

Very proud of you bhai... And i can guarantee every1 here that he is the best teacher that there is❤️

vikasverma

My work laptop doesn't allow the gmail login so I usually do not comment or like. But for this, I searched for the same video on mobile just to like and comment. Brilliantly explained. Your channel is definitely underrated.
Thank you for all the videos.

mohitshrivastava

no one teaches detailed way complex things like you no matter what please spread you're knowledge to world i am sure there must be people learn from you , remember you as master life long who settled in it job like me

iamexplorer

After looking for some time for best material which truly explains this topic, and try to dig deep enough you clearly delivered, thanks Afaque.

dawidgrzeskow

Amazing content.. I am a newbie into Spark but I am hooked.. Sir plz post the continued series.. awaiting for your video posts.. Amazing teacher

saptorshidana

Proud of you brother, looking forward to more of such videos. Great job!

khanthesalah

My today's well spent 40 mins. Thanks for the knowledge sharing.

anirbansom

Very Informative.... Thanks for sharing 🙂

divyadivya

This takes me back to me YaarPadhade times. Great work Bhai much love!

adityasingh

This is one of the best video about Spark I have seen recently!

SidharthanPV

Afaque, THANK YOU SO MUCH FOR THESE VIDEOS!!
They are so amazing for a fast paced learning experience.

Hope you soon upload much more!!

YoSoyWerlix

Thanks for such an in-depth overview!! helps a lot to grow!!

СергейРоманов-уи

Great content with practical knowledge. Hats off to you !!!

sandeepchoudhary

Excellent Video
Thanks for the beautiful explanation

Ilovefriendswebseries

It's great to see such useful contents in spark... an its helpful to understand clearer with your notes! you rock.... Thankless thanks !!

GuruBala

One of the best videos I have seen on Spark, waiting for your Spark Architecture Video

shubhamwaingade

rare content! please don't stop making these

psicktrick

Beautifully explained. Many concepts got cleared. thanks a lot.Keep going.

abhishekmohanty

Master Reading Spark Query Plans

Master Reading Spark Query Plans

Master Reading Spark DAGs

Understanding Query Plans and Spark UIs - Xiao Li Databricks

04 Spark DataFrames & Execution Plans | Spark Logical and Physical Execution Planning | What are...

From Query Plan to Performance: Supercharging your Apache Spark Queries using the Spark UI SQL Tab

Spark Monitoring: Basics

Spark Execution Plans for Databricks

SQL Query Optimization - Tips for More Efficient Queries

Spark Logical & Physical Plan

95% reduction in Apache Spark processing time with correct usage of repartition() function

Spark Executor Core & Memory Explained

Spark Basics | Shuffling

Learn Apache Spark in 10 Minutes | Step by Step Guide

72. Databricks | Pyspark | Interview Question: Explain Plan

How to Read Spark DAGs | Rock the JVM

Adaptive Query Execution: Speeding Up Spark SQL at Runtime

Advancing Spark - Understanding the Spark UI

Catalyst Optimizer in Spark SQL| Logical Plan Vs Physical Plan

BeeScala 2016: Jacek Laskowski - Speak Spark SQL for better performance

Care and Feeding of Catalyst Optimizer

Apache Spark Mastery Roadmap - A Step-by-Step Guide to Unlock your Learning

Spark: Optimization Part2: Adaptive Query Framework| AQE| Spark 3.0 feature| Adaptive Query Plan

Master Databricks and Apache Spark Step by Step: Lesson 6 - Understanding Spark SQL (fixed sound)

Improving Interactive Querying Experience on Spark SQL