40 Scenario based pyspark interview question | pyspark interview

preview_player
Показать описание
#spark #pyspark #pysparkinterview #dataengineer

Scenario based pyspark interview question

1. group multiple rows in single using pyspark
2. combine list and convert into dataframe using pyspark

Want more similar videos- hit like, comment, share and subscribe

❤️Do Like, Share and Comment ❤️
❤️ Like Aim 5000 likes! ❤️

➖➖➖➖➖➖➖➖➖➖➖➖➖
Please like & share the video.
➖➖➖➖➖➖➖➖➖➖➖➖➖
data

1. data=[(1,'Manish','Mobile'),(1,'Manish','Washing Mavhine'),(2,'Rahul','Car'),(2,'Rahul','mobile'),(2,'Rahul','scooty'),(3,'Monu','Scooty')]
schema=["Customer_ID", "Customer_Name",'Purchase']

2. list1 = ["a", "b", "c", "d"]
list2 = [1, 2, 3, 4]

➖➖➖➖➖➖➖➖➖➖➖➖➖

Azure data factory :

PYSPARK PLAYLIST -

➖➖➖➖➖➖➖➖➖➖➖➖➖
📣Want to connect with me? Check out these links:📣

➖➖➖➖➖➖➖➖➖➖➖➖➖
what we have covered in this video:

➖➖➖➖➖➖➖➖➖➖➖➖➖

Hope you liked this video and learned something new :)
See you in next video, until then Bye-Bye!

➖➖➖➖➖➖➖➖➖➖➖➖➖
Рекомендации по теме
Комментарии
Автор

bhai thoda explain kr ke bataya kr code kuch bate samj nhi ari, please

vaibhavpawar
Автор

what is rdd y we used rdd here and where else we can use rdd and what is parallize and where and when we can use parallize

mohammedak-mk