AWS Tutorials - AWS Glue Pipeline to Ingest Multiple SQL Tables

Показать описание

There are scenarios where one has to ingest data from multiple SQL tables to the data lake. It raises the debate about whether to use individual glue job and pipelines or use single glue job and pipeline. This tutorial discusses the debate in detail and also shows demo for single pipeline single job scenario.

Рекомендации по теме

Комментарии

Thanks for great content. The videos in you content is relatable in terms of real world problems which is great. Looking forward to get more of like this and if possible put all these steps on your website as well as easy to compare during practice session. 😀

terrcan

Hello sir.
Do you have any content about how to ingest from a external DB for the GLue ingestion job, using VPC (such as using a connection - MySQL or SQLServer datasource instead an AWS Redshift source)

cassianocalimansantos

How to create parameterized AWS Glue Job but with CDC injestion, because in this case the job will be run continuously every 5 minutes to update data (or doing an Upsert). Is there a way to make upserts in a generic way (or parameterized way)?

deveshv

very nice explanation and implementation... thank you so much !

durgarasane-kolapkar

Hi if orderdata failed to write into destination others will fail or flow is running

nagarjunau

Can you pls do concurrent run on workflow also

veerachegu

Is that cost effective to have a single job running multiple times or multiple job runs once?

debaratiaich

Great content from new sub. Please do more big data stuff!

khandoor

Nice content. A video on CDC would be great.!

sonynavi

I crrated a step function just like yours, but my step function is running forever

IsmaelRDeMelo

AWS Tutorials - AWS Glue Pipeline to Ingest Multiple SQL Tables

AWS Glue Tutorial for Beginners [FULL COURSE in 45 mins]

AWS Glue Tutorial for Beginners [NEW 2024 - FULL COURSE]

What is AWS Glue? | AWS Glue explained in 4 mins | Glue Catalog | Glue ETL

AWS Tutorials - AWS Glue Studio vs. Glue DataBrew

AWS Tutorials - AWS Glue Job Optimization Part-1

AWS Glue Tutorial | Getting Started with AWS Glue ETL | AWS Tutorial for Beginners | Edureka

AWS Tutorials - AWS Glue Studio integration with Code Repository

AWS Glue Tutorial for Beginners| Learn everything about Glue in 30 mins| Glue Data Catalog| Glue ETL

AWS Tutorials - AWS Glue Data Quality - Automated Data Quality Monitoring

AWS Tutorials - AWS Glue Pipeline to Ingest Multiple SQL Tables

AWS Tutorials - Using AWS Glue Workflow

AWS Tutorials - AWS Glue Job Optimization - Flexible Job Execution

AWS Tutorials - Incremental Data Load from JDBC using AWS Glue Jobs

AWS Tutorials – Building Event Based AWS Glue ETL Pipeline

AWS Tutorials – Building ETL Pipeline using AWS Glue and Step Functions

What is AWS Glue?

AWS Glue | AWS Glue Tutorial | AWS Glue ETL | AWS Tutorial for Beginners | Intellipaat

AWS Tutorials - Introduction to AWS Glue DataBrew

AWS Tutorials - Joining Datasets in AWS Glue ETL Job

What is AWS Glue? | AWS Glue Tutorial | Introduction to AWS Glue | AWS Tutorial | Simplilearn

AWS Hands-On: ETL with Glue and Athena

AWS Tutorials - Partition Data in S3 using AWS Glue Job

AWS Tutorials - Access Database using AWS Glue DataBrew

Getting started with AWS Glue | Hands-On | Basic end-to-end transformation | AWS Glue tutorial | p2