AWS Tutorials - Absolute Beginners Tutorial for Amazon EMR

preview_player
Показать описание

Amazon EMR is a big data platform for processing large scale data using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto. Amazon EMR is easy to set up, operate, and scale for the big data requirement by automating time-consuming tasks like provisioning capacity and tuning clusters.
In this workshop, you launch an EMR cluster. You then use Jupyter Notebook to do PySpark based programming with EMR Cluster. Finally, you launch a data processing task using EMR Cluster.
Рекомендации по теме
Комментарии
Автор

Very crisp & clear!! easy to understand :) “If you can’t explain it simply, you don’t understand it well enough.” - Albert Einstein

RaguM-kz
Автор

My god This was fantastic! Explained on high-level but then you actually followed through and covered specific concrete examples <3.

bugfacedog
Автор

Many Thanks.. you are simply superb... one of the best resources available on internet...best part of all workshops you share is its always having practical content... truly appreciable...Many Thanks...

akkijaiho
Автор

wow wow wow ... just awesome Sir... Thank you so much for this beautiful time consuming job for all the beginners to learn from your knowledge... Thank you once again🙏🙏

saaransh
Автор

Great introductory tutorial to AWS EMR. After watching your tutorial I now have some knowledge about EMR. Thanks a lot.

smogjr
Автор

Excellent presentation described in simple language. Really appreciate your effort.

HareshRCPatel
Автор

Its really good session as a beginner i learned many things thank u soo much

sukanyaraja
Автор

great content. focuses on the basics and gets into the right level of details. amazing job !

ankan
Автор

Great job. Exactly what I needed. Thanks a ton

ARUNKUMAR-gfzv
Автор

Amazing, thanks for the great introduction!

ghay
Автор

Very informative video, please do tutorial for Glue and Athena as well

sakinafakhri
Автор

Awesome😊... Really helped alot... Looking one more session on read write hbase table from spark in EMR along with version compatibility...

arjunaare
Автор

Great video!
One small correction: it's Jupyter Notebook

dorinxtg
Автор

Once again, this is a great tutorial. Thank you. I was wondering what is your view on running Spark ETL on both AWS Glue and Amazon EMR Spark cluster, what would be your preference between these two services assume the AWS cost isn't of concern?

hsz
Автор

good job! i liked it a lot, keep doing an awesome job!

KS-nivv
Автор

Really Useful. Thanks for sharing the knowledge😃

sumitkumarsah
Автор

Awesome video. from where we can download the jar file?

dhanraj
Автор

can plz give workshop on aws emr hadoop and presto

emraanpathan
Автор

great content. but someone can tell me how to fetch input parameters in the notebook when EMR notebook being hit through boto3 or any backend language

scriptbeesdem
Автор

What to choose under "New" option if I will be doing Scala code in Spark instead of python?

jovelynobias