Cache vs Persist | Spark Tutorial | Deep Dive

preview_player
Показать описание
#Cache #Persist #Apache #Execution #Model #SparkUI #BigData #Spark #Partitions #Shuffle #Stage #Internals #Performance #optimisation #DeepDive #Join #Shuffle:

Please join as a member in my channel to get additional benefits like materials in BigData , Data Science, live streaming for Members and many more

About us:
We are a technology consulting and training providers, specializes in the technology areas like : Machine Learning,AI,Spark,Big Data,Nosql, graph DB,Cassandra and Hadoop ecosystem.

Visit us :
Twitter :

Thanks for watching
Please Subscribe!!! Like, share and comment!
Рекомендации по теме
Комментарии
Автор

Why spark uses cache() as spark already have persist()? How both are different, persist can work as cache also.

beyond_success
Автор

When we caching a rdd on memory then where it is getting stored, which memory? Is it memory of a particular executor ? Then which executor?

vidushijaiswal
Автор

Hey Hi,
Here why the size of the serialized object is more than the deserialized object. As you said data i.e. deserialized, its size should be more right?
In the above case, I can see deserialized data size is 3.3KB whereas serialized data is 5.0KB. Can you please clarify this.?

sriramrimmalapudi
Автор

i see, for RDD its memory and for Dataframe it is mem + disk

pmdsngh
Автор

Which spark version you are using? I tried with spark 2.1 and 2.3 but the storage tab is not visible. Please could you help me with this issue. Plz suggest

rohitchavan
Автор

Sir will you please make a video that explains the rand() function?

himanshuramekar
Автор

If data size is 16Gb and memory is 20GB what will use cache or persist?

i_ambhosale
Автор

Also isn't there an add on persist option, in which data in replicated "the ones with _2"
MEMORY_ONLY_2, MEMORY_AND_DISK_2

Just a pointer :)

ajiks
Автор

If we don't cache or persist, then where will the rdd be stored ? In heap or ram ?

ashwenkumar
Автор

What if we use persist() without any arguments in it

chakkiralavenkatakrishnach
Автор

Audio quality is not good content is good

panduranga