Running Spark applications using Scala and Python on EMR Cluster

preview_player
Показать описание
As we are done with revising programming languages and built Spark based applications, now let us see how we can run these applications on the cluster.

* Run the Spark application using Scala using step execution
* Run the Spark application using Python using step execution
* Run both the applications directly on the cluster
* Validate the data
* Compare and Contrast Running jobs against s3 as well as HDFS
* Understand the relevance of other technologies such as Red Shift, Dynamo DB etc.

Connect with me or follow me at
Рекомендации по теме
Комментарии
Автор

awesome video. can you please share the program that you are showing in the jar file? Do you have any videos on how to set up a Scala development framework for a group of 4/5 developers while working with EMR?

navinsai
welcome to shbcf.ru