Tiger Analytics PySpark Interview Question | Very Important Question of PySpark |

Показать описание

data=[
('Rudra','math',79),
('Rudra','eng',60),
('Shivu','math', 68),
('Shivu','eng', 59),
('Anu','math', 65),
('Anu','eng',80)
]
schema="Name string,Sub string,Marks int"

Solution:

I have prepared many courses on Azure Data Engineering

1. Build Azure End to. End Project

2. Build Delta Lake project

3. Master in Azure Data Factory with ETL Project and PowerBi

4. Master in Python

Check out my courses on Azure Data Engineering

hastags
tags

#dataengineer #interviewquestions #pysparkinterview
#hashtags #hastag #tags #tcs

Рекомендации по теме

Комментарии

Use pivot funtion with Subject column to get a new column for each value in that particular column. Can use aggregate function sum on Marks. Order of Eng/Math column may not be same.

abhigyapranshu

All videos in this pyspark interview playlist are highly useful Sagar. Big Thanks for your efforts man!!

sheikirfan

pivoted_df = "first"}).show()

abhishekpathak

Hi Sagar, to master pyspark which of your's course should i buy?

pratik

I tried below df =
df.show(), but throw error as jgd = self._jgd.pivot(pivot_col) Column is not iterable

surenderraja

was this asked in Tiger analytics (Canada)?

Pratik_Tortikar

My Solution :

df.withColumn("math", when(df.Subject=="math", df.Marks).otherwise(0))\
.withColumn("eng", when(df.Subject=="eng", df.Marks).otherwise(0))\
.groupBy("Name").agg(max("math").alias("math"), max("eng").alias("eng")).show()

throughmyglasses

Hi Sir

My Way:

df1 =
df2 = df1.select("Name", "math", "eng").orderBy(col('math').desc(), col('eng').desc())
df2.show()

rawat

Sagar, I had a query.... For using collect_list command, we have to sort the dataset based on subject first, right?

My Solution:

df_1 = spark.createDataFrame(data=data, schema=["Name", "Sub", "Marks"])

df_2 = df_1.groupBy(col("Name")).pivot("Sub", ["math", "eng"]).agg(sum("Marks"))
or,

display(spark.sql("Select Name, SUM(CASE WHEN sub like 'math' THEN Marks ELSE 0 END) as Math, SUM(CASE WHEN sub like 'eng' THEN Marks ELSE 0 END) as Eng from Pivot_Data GROUP BY Name"))

_Sujoy_Das

please english language azure datbricks
required plese

dorwxtk

: "last"})
df1.show()
This code will give you irrspective how many subject you have in Sub col umn as different columns

venkatsubbaiah

df.groupBy("Name").agg(max(when(df.Sub=='math', df.Marks).otherwise(0)).alias("Math"), max(when(df.Sub=='eng', df.Marks).otherwise(0)).alias("eng"))

okouroy

df.groupby(col("Name")).agg(
sum(when(col("Sub")=="math", col("Marks")).otherwise(0)).alias("maths"),
sum(when(col("Sub")=="eng", col("Marks")).otherwise(0)).alias("eng")
).show()

amanmaheshwari

df.groupBy('name').pivot('Sub', ['math', 'eng']).sum('Marks').display()

ayushmangal

df_sub1 =
df_sub1.withColumn('math', df_sub1.Sub_Marks[0]).withColumn('eng', df_sub1.Sub_Marks[1]).select('Name', 'math', 'eng').show()

kunalshinkar

df.groupBy(f.col("Name")).pivot("Sub", [i[0] for i in

balaa

Tiger Analytics PySpark Interview Question | Very Important Question of PySpark |

Tiger Analytics PySpark Interview Question | Very Important Question of PySpark |

tiger analytics interview questions and answers in pyspark | #interview

Tiger Analytics Interview Question | Find out all the DataFrames in Spark Session |

Questions asked in Tiger Analytics TO DE - Part 1|| Pyspark || Data Engineer #pyspark #dataengineer

Tiger Analytics Senior Data Engineer Interview Questions | SDE Interview Experience | 25 LPA

Tiger Analytics PySPark Interview Question - Split and Explode Functions in PySpark

tiger analytics interview questions

Find out the head count of employee in each job | Data Engineering Interview | TigerAnalytics

Most Asked Coding Interview Question (Don't Skip !!😮) #shorts

Solve KPMG Pyspark Interview Questions

tiger analytics python interview questions and answers | dsa for data engineer |dsa for data science

3 most common data modeling interview questions

Pyspark Advanced interview questions part 1 #Databricks #PysparkInterviewQuestions #DeltaLake

How much does a DATA ENGINEER make?

Rahul Data Scientist at Tiger Analytics | Data Scientist Interview | Applied Ai Course Reviews

Data engineer interview question | Process 100 GB of data in Spark Spark | Number of Executors

PySpark Interview questions | Part 12B | #shorts #pyspark #bigdata

This SQL Problem I Could Not Answer in Deloitte Interview | Last Not Null Value | Data Analytics

Find the key from Map with K, V pair where V is greater than 100|Tiger Analytics Interview Question|

Most Important Question of PySpark in LTIMindTree Interview Question | Salary in each department |

How much does an ANALYST from a CONSULTANCY make?

How much does a LEAD ANALYST make?

'Data Science' For 2 to 3 Years Experienced, Most Asked Interview Questions, Can't Af...

data engineer interview questions