🚀 Architecting an AWS Big Data Pipeline with File Tracking, Audit Logging, & Real-Time Monitoring!

preview_player
Показать описание
In this video, we address a common interview question: 'How would you design an event-driven data pipeline to meet a tight 20-minute SLA for data transformations and loading into Redshift or Snowflake while ensuring comprehensive file tracking, audit logging, and robust monitoring?'

Prerequisite:
----------------------
An automated data pipeline using Lambda, S3 and Glue - Big Data with Cloud Computing
Build and automate Serverless DataLake using an AWS Glue , Lambda , Cloudwatch

Code:
---------

Check this playlist for more Data Engineering related videos:

Apache Kafka form scratch

Messaging Made Easy: AWS SQS Playlist

Snowflake Complete Course from scratch with End-to-End Project with in-depth explanation--

Explore our vlog channel:

🙏🙏🙏🙏🙏🙏🙏🙏
YOU JUST NEED TO DO
3 THINGS to support my channel
LIKE
SHARE
&
SUBSCRIBE
TO MY YOUTUBE CHANNEL

#aws #datapipeline #interviewquestions #eventdrivenarchitecture #etl #dataengineering #snowflakes #s3 #lambda #glue #dynamodb #cloudwatch #monitoring #awsarchitecture
Рекомендации по теме
Комментарии
Автор

Finally I found Exactly what iam looking for ...

kmuralikrishna
Автор

Thanks for the great video. I am submitting a Spark job to EMR using Lambda. Lambda is getting success irrespective of the job status of the Spark job. How can I get the status of the EMR job to kick off the rest of the Lambda functions?

mirli
Автор

Hi,
I get this following error when i try to run the pipeline. Trying to overcome this from past two days.
''''when calling the StartJobRun operation: Failed to start job run due to missing metadata'''..

Please note: the job name produced in Lambda is right. the region is set right.

fazaljamadar
Автор

Hi, it's a greate vedio. I do have some real task to do such similar data type convertion. I just wonder for 10 concurrency setting, do you know how long will it take for such job to convert 1TB data?

sabrinawang
Автор

How to implement the same use case with emr without using glue. Can we implement with emr ?

gontlakowshik
welcome to shbcf.ru