day 5 | salary report | pyspark scenario based interview questions and answers

Показать описание

day 5
salary report
pyspark scenario based interview questions and answers

Create DataFrame Code :
=====================

# Creating the Salary DataFrame
salary_data = [
(1, 'Rohan', 5000),
(2, 'Alex', 6000),
(3, 'Maryam', 7000)
]

salary_schema = "emp_id int, emp_name string, base_salary int"

# Creating the Income DataFrame
income_data = [
(1,'Basic', 100),
(2,'Allowance', 4),
(3,'Others', 6)
]

income_schema = "id int, income string, percentage int"

# Creating the Deduction DataFrame
deduction_data = [
(1,'Insurance', 5),
(2,'Health', 6),
(3,'House', 4)
]

deduction_schema = "id int, deduction string, percentage int"

#interview #spark #pyspark