deloitte interview question in pyspark| pyspark interview questions | #interview | #sql |#data

preview_player
Показать описание
deloitte interview question in pyspark| pyspark interview questions | #interview | #sql |#data

In this video we will be solving DELOITTE interview question in PySpark.

Dataframe Create Code :
======================
1)
product_data = [
(1, 'Laptops', 'Electronics'),
(2, 'Jeans', 'Clothing'),
(3, 'Chairs', 'Home Appliances')
]

product_schema = ['product_id', 'product_name', 'category']

2)
sales_data = [
(1, 2019, 1000.00),
(1, 2020, 1200.00),
(1, 2021, 1100.00),
(2, 2019, 500.00),
(2, 2020, 600.00),
(2, 2021, 900.00),
(3, 2019, 300.00),
(3, 2020, 450.00),
(3, 2021, 400.00)
]

sales_schema = ['product_id', 'year', 'total_sales_revenue']

SQL Approach and Step by Step Solution Video :

#deloitte #sql #interview #dataengineers #dataanalytics #datascience #StrataScratch #Facebook #data #dataengineeringinterview #codechallenge #datascientist #pyspark #CodingInterview
#dsafordataguy
Рекомендации по теме
Комментарии
Автор

Please give the Input data of products for our practice, it will be really helpful, thanks

vickywaran
Автор

what is the need to considering -ve values becuase -ve value proves there is no increasement in revenue. We can directely filter the value on the basis of +ve value.

jhonsen
Автор

Nice explanation, if possible, plese provide the data as well

AnandKumar-rhcv
Автор

Hey Dhiraj, Please add Data into description.

biramdevpawar