6.7 Decide Number Of Buckets in Hive and spark | Partition and Bucketing

preview_player
Показать описание
As part of this video we are Learning
What is Bucketing in hive and spark
how to create buckets
how to decide number of buckets in hive
factors to decide number of buckets in hive
hive bucketing without partition
bucket sampling in hive
bucketing in hive dataflair
hive bucket hash function
hive create bucket
list bucketing in hive
custom bucketing in hive
bucketing on multiple columns in hive
for difference between partition and bucketing see other video on same channel. link is here

Please subscribe to our channel.
Here is link to other spark interview questions

Here is link to other Hadoop interview questions

#spark , #sparkquestions, #hive #bucketing
Рекомендации по теме
Комментарии
Автор

Awesome information... thank you harjeet for the great

gauravmathur
Автор

I have a question?
1) buckets are created by writing clusters by. How we can implicitly give the number of buckets

lokeshmvs
Автор

Can you plz explain what is Hcatalog and what is the use of it??

projjalchakraborty
Автор

How we will restrict the bucket size as per the block size dynamically? if I will mention 4 buckets and then what will happen if 1 bucket size will gradually increase above 1 GB or above? how I will achieve optimization?

diptiranjannayak
Автор

Can we create buckets on top of partitioning ...can you please explain this?

routhmahesh
Автор

How we can decide whether we should do partition or bucketing ?

simplecooking
Автор

I got confused, block size is 128 mb and our memory can be 4 gb in size. In this case bucket should be 128 mb or 4 gb?

rahulsamyal
Автор

Please explain about oozie...how to schedule and workflows concepts...thank u

rajareddy
Автор

Is it possible to alter bucketted table to change number of buckets ?

r.kishorekumar
Автор

Can you please explain how can we optimize if number of buckets get way too much about 1 million?

divendughati
Автор

Can you please explain how to decide number of partition?

AnkitaMishra-diub
Автор

Harjeet-What is default no of bucket and partition??

gauravpathak