CCA 175 Preparation Problem 2

preview_player
Показать описание
Рекомендации по теме
Комментарии
Автор

Hi, very nice video, I got a question about the cloudera will give you the password and username for the MYSQL in the exam, right?

chrisy.
Автор

Thanks for your efforts on the preparatory tutorials Arun.. Wanted to check if there was a need to apply countDistinct on productID while doing it the Dataframe way?

vjaiswal
Автор

awesome Arun. really appreciate the efforts you have put for teaching ... keep the momentum going!!!

tejindersinghbedi
Автор

Hello Arun
Do you have the solutions in PySpark as well ? Whats best way to get solutions based on PySpark dataframes in Spark?

Автор

at 7:25 when you are changing the permissions, I understand that rwx means 'read' 'write' and 'execute' but there's an extra rw and group should have read write permissions but the second only has 'r'? and the third needs to have 'rx' since it's read execute but you only have 'x'. It's very confusing can you please explain whenever you get time :) thankyou

hou
Автор

Can we solve all spark problems without using RDDs and just by relying on DF or SQL?

pruthvirajsuresh
Автор

in the exam if they ask us to do aggregate, can we create a dataframe using pyspark or will they us to do using rdd transformation ?. because its easy and fast using python functions in pyspark.

pruthvirajsuresh
Автор

Really appreciate your effort arun, 1 mistake though : you forgot to filter sql result for prices less than 100.

metawerse
Автор

Is it necessary to use round functions while it's not specified in the question ?

kundansakargayen
Автор

How many problem will we get in cca175 like this?

Shaitender