How to build AWS Glue ETL with Python shell | Data pipeline | Read data from S3 and load Redshift

Показать описание

In this video, we will develop AWS Glue ETL script using Python shell. We can now use Python scripts in AWS Glue to run small to medium-sized ETL (extract, transform, and load) workflow. Previously, AWS Glue jobs were limited to Apache Spark environment.
Python shell jobs in AWS Glue support scripts that are compatible with Python 2 and 3 and come pre-loaded with libraries such as the Boto3, Numpy, SciPy, pandas, and others. We can also, install other libraries via .whl file.

Subscribe to our channel:

---------------------------------------------
Follow me on social media!

---------------------------------------------

#Python #ETL #AWS

Topics covered in this video:
0:00 - Introduction ETL with Python shell
0:53 - Pre-Requisites
1:30 - Create Python .whl file
2:35 - Python ETL script
4:15 - Upload scripts to AWS
5:11 - AWS Glue ETL Job
6:33 - AWS Redshift table
6:49 - Execute Glue ETL Job
7:17 - Review Data & logs

Рекомендации по теме

Комментарии

This was extremely helpful! I really like that you are able to compress such valuable information in just 8 mins! I think it would be really useful to see how to build an ETL pipeline in a IaC framework. Haven't see many on the web! Thanks!

GiovanniDeCillis

You're a hero for the well explained content and then answering everyone's comments. :)

calvinbutler

Subscribed!!! Thank you so much for the great content!! Can you please make dedicated videos on how to use AWS Glue, Triggers, Lambda functions and Athena for ETL pipeline?

satishmajji

Thanks great video! Other examples I have seen used a crawler to write the schema of the redshift table to the data catalog before loading using a Glue Job.
If I just wanted to do this using only a Visual Glue Job and without a crawler, is it possible?

kofio

thanks!!! but I have a question, If my data comes from an API, is S3 not necessary?

ArniFuentes

Hi, how can we read the credentials from connections or secrets in aws glue python shell, it not working for me

koyalmudi

Hi - thanks for such concise content! I noticed that you deployed to S3 without debugging locally. Suppose i wanted to test the etl script before deploying it? is there a way to execute the etl.py script on the local host using aws_cli?

joegenshlea

Hi, Can we transfer 1Tb data from s3 to Redshift using Glue or Lambda +Glue ?

PawanKumar-glyw

How to build AWS Glue ETL with Python shell | Data pipeline | Read data from S3 and load Redshift

AWS Glue Tutorial for Beginners [FULL COURSE in 45 mins]

AWS Glue Tutorial for Beginners| Learn everything about Glue in 30 mins| Glue Data Catalog| Glue ETL

What is AWS Glue? | AWS Glue explained in 4 mins | Glue Catalog | Glue ETL

AWS Glue Tutorial | Getting Started with AWS Glue ETL | AWS Tutorial for Beginners | Edureka

How to build AWS Glue ETL with Python shell | Data pipeline | Read data from S3 and load Redshift

Working with AWS Glue Studio - Part 1

AWS Glue | How to interactively develop Glue ETL Job?

Getting started with AWS Glue | Hands-On | Basic end-to-end transformation | AWS Glue tutorial | p2

AWS Hands-On: ETL with Glue and Athena

How to create table in AWS Glue Catalog using Crawler | AWS Glue Tutorials | Hands-on tutorial

Why Data Engineers Should Develop AWS Glue Jobs Locally

AWS Glue | How to create Glue Catalog Tables | Query your S3 Data | AWS Athena

Learn about AWS Glue workflow in very easy way

What is AWS Glue? | AWS Glue Tutorial | Introduction to AWS Glue | AWS Tutorial | Simplilearn

AWS Data Engineer Project | AWS Glue | ETL in AWS

ETL | AWS Glue | AWS S3 | Data Cleansing | Transforming data with AWS Glue in ETL workflows

AWS Tutorials – Building ETL Pipeline using AWS Glue and Step Functions

AWS Glue Practical | AWS Glue Tutorial | AWS Data Engineer

How to Build Your Own Version of AWS Glue Bookmark to get Only New Incremental Files

AWS Glue Data Catalog | Glue Database, Crawler, Connections, Classifiers explained | Glue tutorial-2

Building a Data Lake on AWS with AWS Glue, Glue Studio, Amazon Athena, and S3

Practical Projects to Learn Data Engineering On AWS

Create a Crawler to populate an AWS Glue data catalog

Simplify and Fast-Track ETL Modernization with AWS Glue - AWS Online Tech Talks