Most Important Question of PySpark in Tech Tech Interview Question #pysparkinterview #interview

preview_player
Показать описание
data=[(1,'Sagar'),(2,'Alex'),(3,'John'),(4,'Kim')]
schema="Customer_ID int, Customer_Name string"

data=[(1,4),(3,2)]
schema="Order_ID int, Customer_ID int"

Databricks-PySpark RealTime Scenarios Interview Question Series

Project Link:

#hashtags
#tags #pysparkinterview #pysparkforbeginners
Рекомендации по теме
Комментарии
Автор

left_anti join type can easily do this

aditim
Автор

We can use anti join as well to achieve the same..

final_df=df_customer.join(df_order, on='Customer_ID', how='anti').select('Customer_Name')

saurabh
Автор

Bro is this for experienced or fresher

muhsinali
Автор

joined_df = df_customer.join(df_order, df_customer.Customer_ID == df_order.Customer_ID, "left_anti")
display(joined_df)

caferacerkid