Cache and Persist DataFrame PySpark Interview Question | Maersk Interview Question |

preview_player
Показать описание
data = [("Raj","Doe",None),
(None,"Samuel","VIZAG"),
("David","Smith", None),
("Samson",None, "HYD"),
("Immi", "Steve", "BNG"),
(None, None, None)]

columns = ["Firstname", "Lastname", "City"]

Databricks-PySpark RealTime Scenarios Interview Question Series

Project Link:

#hashtags #cache #persit #maersk
#tags #pysparkinterview #pysparkforbeginners
Рекомендации по теме
Комментарии
Автор

json files with 10000 records
Everey record has 10 elements, with 2 and 8 elements contain 5 child elements
pls read the json and parse it

satyabratasahoo
Автор

I was asked questions like create table with schema without schema reading file from s3 writing to s3