Mastering AWS Glue Unit Testing for PySpark Jobs with Pytest

preview_player
Показать описание
This video is a step-by-step guide on how to write unit tests to test functions in a pyspark job that works on the AWS Glue Service. This video will cover how to write sample dataset to test our glue job transformations to make sure they are doing what we are expecting.

Tutorial Links:
#aws, #awsglue, #pytest, #pyspark
Рекомендации по теме
Комментарии
Автор

Hi Data eng your knowledge in AWS and way of teaching is excellent could you please share the videos on CICD pipeline to deploy the glue jobs to production

thepravinbtech
Автор

Your functions based job is quite straightforward. Would you like to organize your glue job in a Python class?

renyang
Автор

how do you add third party python libraries ?

Angleito
Автор

Do you use the Factory Design pattern?

harshadk
Автор

How would you recommend local dev and organization -> uploading to AWS for scripts with multiple files ?

kckc
Автор

I've been looking to develop some unit tests with pytest but I would like to mock everything related to the Glue Environment. I've been trying to do it through MonkeyPatch but the problem I have is when I transform the dybamicframe to dataFrame, it's also expecting a full mock of the dataFrame and it's functions. Any experience with that?

joseluisvega