AWS Glue Job Import Libraries Explained (And Why We Need Them)

preview_player
Показать описание
This video explains the 6 import statements in a boilerplate glue script to help data engineers understand why we need them and what they do.

#aws #awsglue #pyspark
Рекомендации по теме
Комментарии
Автор

Perfect and straight to the point. I got in 5 min what I couldn't get in an hour.

mohammedgt
Автор

Just found your channel. can we have a complete playlist, a type of course or a oneshot video/videos, your explain in depth and I found your videos better than the other tutorials on youtube

mickyman
Автор

Cool explanation. I had never paid attention to these boiler plate statements

sukulmahadik
Автор

Liked, suscribed and commented!
Thank you very much for your help!
Greetings from Colombia!

danielchicaiza
Автор

Loved this video. Just a question, isn't it import * a bad coding practice?
If you have already created video on practical implementation of those 24 classes then please share link, if not, I request you to make a video on that. "Took the one less traveled by, And that has made all the difference" .

nikhilgupta
Автор

I have files in an s3 bucket whose type is gz. The gz file consists of json records (each line is a record in json format). How can I read such file using glue dynamic frame?

abdullahkheruwala
Автор

Nice Video! I am struggling to find a way how I can set the script location path in the jupyter notebbok. I can see there is no magic command to do that and aws does not allow to make any changes manually under the tab "job details". Can u help me if there is any way?

sanchitgarg
Автор

nice video! what's the point of using jobs in notebooks since bookmarks aren't supported there? is there another benefit?

Scott-sf
Автор

i'm new to aws and i'm working on a project but i'm unable to it. I'm getting Unresolved reference 'awsglue'
Can you help me with this?

AbhishekChauhan-kvds
Автор

Can we not create functions (def fn() ) is streaming glue jobs??

saksheegoel
Автор

Hello sir i am facing no module named awsglue.context when i wrote the above imports in aws glue python shell. can you please help. thank you

MuhammadImran-lrtn
Автор

Can be update the data in database using glue jobs

AmritAgarwal
Автор

Hi I have a question about the interaction between creating a "normal" spark session and glue, I needed to import a JAR and I got it working with
spark = SparkSession.builder\
.appName("my-app") \
.config('spark.jars.packages',
.getOrCreate()
I commented out
sc = SparkContext()
glueContext = GlueContext(sc)
spark = glueContext.spark_session
So two things Im missing out is dynamic frames and save job states, how do I modify the original arguments so that I can bring gluecontext back in? Thank you

Fight