Fast and Easy Spark ETL with AWS Glue

preview_player
Показать описание
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. Spark jobs in AWS Glue will now be able to start in under one minute, improving interactivity and reducing over-all job completion times.

Subscribe:

#AWS #AWSGlue
Рекомендации по теме
Комментарии
Автор

I used it day in and day out.
We never used the actual aws glue code but we run pyspark & python over it.
Glue acts as a chasis for our pipelines, its very easy to configure everything right from referencing a jar file to establishing on premises database, fully customizable and very easy to create the CFT for the same via jenkins.


PS : You Guys are awesome :)

SusiEzhil
Автор

This is what I was looking for. Micro-batching ETL soln with low resource allocation time. Thanks.

gaurivankudre