AWS re:Invent 2021 - Building a data lake on Amazon S3

preview_player
Показать описание
Flexibility is key when building and scaling a data lake, and by choosing the right storage architecture, you will have the agility to quickly experiment and migrate to AWS. This session explores best practices for building a data lake on Amazon S3, which allows you to leverage industry-leading AWS, open-source, and third-party analytics and ML tools and gain insights from your data. This session also explores how to optimize your storage on Amazon S3 for data lakes, including information on storage classes, S3 access points, and running HPC workloads with Amazon FSx for Lustre.


Subscribe:

ABOUT AWS
Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts.

AWS is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

#AWS #AmazonWebServices #CloudComputing
Рекомендации по теме
Комментарии
Автор

This is very eye opening. I used AWS for several years and never thought S3 could serve such purpose. this is fantastic!

MrDottyrock
Автор

This was super duper amazingly wonderful to watch !

umairqamar
Автор

This video clearly explains about the storage-s3. Very good video to learn about s3

samuel_william
Автор

This is a very helpful intro into days lake design with S3. Thank you

djohnjimmy
Автор

We had 200gb of data in MS OLAP in 2008 coming out of terabyte ERP system. Not sure where getting his numbers from. 9:46

rifkiamil
Автор

This is very helpful. Thanks very much

mbaapohelviszonepoh
Автор

Thank you for this, this was very helpful

severtone
Автор

Hadoop replicates data in three different nodes, so we need to lose 2 nodes before we start to worry about data loss. He said we need to lose 3 data nodes.:)

yogenderpal
Автор

Not much on data lake, more of a talk about S3 features for what it’s intended for, data storage. Actual data analysis and reporting is done with other AWS services.

lordlee