AWS re:Invent 2018: Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321)

Показать описание

Come to this session to learn how Amazon DynamoDB was built as the hyper-scale database for internet-scale applications. In January 2012, Amazon launched DynamoDB, a cloud-based NoSQL database service designed from the ground up to support extreme scale, with the security, availability, performance, and manageability needed to run mission-critical workloads. This session discloses for the first time the underpinnings of DynamoDB, and how we run a fully managed nonrelational database used by more than 100,000 customers. We cover the underlying technical aspects of how an application works with DynamoDB for authentication, metadata, storage nodes, streams, backup, and global replication.

Рекомендации по теме

Комментарии

Amazing. Just what I needed. Thank you for making this available here.

shawapratim

This one is an awesome talk. Jaso goes through important innards of DynamoDB, including secondary indexes, throttling, burst capacity, accumulating read capacity, adaptive capacity, etc.

dimkir

very nicely described. kept on watching and watching. came back to this video multiple times when i understood more and more.

saumilkapadia

Key takeaways:
- storage nodes acknowledge a WRITE (on a partition) back to the request router if 2 out of the 3 storage nodes for the affected partition successfully completed the WRITE. So when you perform an "eventually consistent READ", the odds to get a strongly consistent result are 2/3.
-The 3 storage nodes of a partition decide between themselves which one will be the partition leader.
- Leader nodes store the partition data into a B-tree index, and they also maintain a "replication log" (a kind of TX log).

galeop

0:00 Intro

2:39 Agenda
- GetItem/PutItem
- Auto Scaling
- Backup Restore
- Streams
- Global Tables

3:04 GetItem/PutItem
- Request Router
- Paxos
- Partition Metadata System
- Tables (hashing, partitioning)
- Eventual Consistency
- Storage Nodes (B-tree, Replication Log)
- System Management (Auto Admin, Partition Repair)
- Secondary Index (Log Propagator)
- Provisioning Table Capacity
- Adaptive Capacity

28:48 Auto Scaling

33:54 Backup Restore
- Point in Time
- On-demand backup

41:43 Streams

44:45 Global Tables

CAMorales

I've read both Dynamo 2007 and DynamoDB 2022 papers and this is a great summary. Thanks.

ahmxtb

Good talk! Seems like a industry guest talking about dynamodb in classroom.

rahulat

Trying to guess the PID controller part, seems it only needs the P(proportion)?

let multiplier = 1.5

Let's say we want to allow roughly 60 seconds of 150% consumed over provisioning.

Then on each iteration(say, a second), do:

multiplier = multiplier + (provisioned - consumed) / provisioned / 60 # 60 is for 60 seconds
multiplier = min(1.5, max(1, multiplier)) # clamp it between 1 & 1.5

This way the consumed can surge to 1.5 times provisioned and will slowly go down to 1, then keep as 1 until consumed is below provisioned and slowly bring multiplier back to 1.5

frankren

At 9:21, could the client be talking to the request router (RR) in an availability zone (AZ) and it isn't necessary that it has the leader storage node, and therefore the RR might have to send the request to the storage node leader to a different AZ. Nothing wrong with that but I wonder if there are any performance savings in having writes go directly to the RR in the AZ with the leader storage node.

Himanshu-mbnl

18:00 I guess we're talking about *global* 2ndary indexes here, and not *local* 2ndary indexes, correct ?
Does this propagation of a new value from the main table to the 2ndary index happen asynchronously?

galeop

I think it's looks like nanoseconds but actually milliseconds? 50:42

Tommy-ddpz

Does the replication log store the entire history of the data or just current state ?

Mike-ciio

the last few sentences are like "we are hiring; come and work with us" lol

kevin

Where can I download the PPT, please?

youran

Dynamodb is awsome for bigdata works but it is really hard to work with CRUD operations

sagara

Might it worth blowing your nose (away from the Mic ! ) & clearing you air waves before start of your session !

arifmalikoracledba

what db does scamazon use for thier own shopping?

grousemoriarty

AWS re:Invent 2018: Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321)

AWS re:Invent 2018: Amazon DynamoDB Deep Dive: Advanced Design Patterns for DynamoDB (DAT401)

AWS re:Invent 2018: Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321)

AWS re:Invent 2018: A Deep Dive into What's New for Amazon DynamoDB (DAT201)

AWS re:Invent 2018: [NEW LAUNCH!] Building modern apps using Amazon DynamoDB transactions (DAT374)

AWS re:Invent 2018: How Oath Built a Multi-Region GDPR Application with Amazon DynamoDB (DAT325)

AWS re:Invent 2018: Becoming a Nimble Giant: How Amazon DynamoDB Serves Nike at Scale (DAT320)

AWS re:Invent 2018: Why GE Aviation Migrated from Cassandra to Amazon DynamoDB (DAT332)

AWS re:Invent 2018: Migrating Your NoSQL Database to Amazon DynamoDB (DAT314)

AWS re:Invent 2018: Protecting Your Greatest Asset: Security Best Practices on DynamoDB (DAT303)

AWS re:Invent 2018: Leadership Session: AWS Database and Analytics (DAT206-L)

AWS re:Invent 2018: [REPEAT] Building IoT Applications for a Smart Home, ft. Vestel (IOT306-R)

AWS re:Invent 2018: Build Business-Ready Blockchains with Intelligence (GPSTEC315)

AWS re:Invent 2018: Scaling a Fantasy Sports Platform with Amazon ElastiCache & Amazon Aurora ST...

AWS re:Invent 2018: [REPEAT 1] Scaling Up to Your First 10 Million Users (ARC205-R1)

AWS re:Invent 2018: [REPEAT 1] Managing Modern Infrastructure in Enterprises (ENT227-R1)

AWS re:Invent 2019: Scale fearlessly with Amazon DynamoDB adaptive capacity (DAT304)

AWS re:Invent 2019: Data modeling with Amazon DynamoDB (CMY304)

AWS re:Invent 2018: [REPEAT 1] Become an IAM Policy Master in 60 Minutes or Less (SEC316-R1)

AWS re:Invent 2018: Amazon EC2 Instances & Performance Optimization Best Practices (CMP307-R1)

AWS re:Invent 2019: [REPEAT 1] Amazon DynamoDB deep dive: Advanced design patterns (DAT403-R1)

AWS re:Invent 2018 - Daily re:Port - Monday Night Live

AWS re:Invent 2018: Amazon EC2 T Instances – Burstable, Cost-Effective Performance (CMP209)

AWS re:Invent 2019: How Verizon Media implemented push notification using Amazon DynamoDB (DAT205)

AWS re:Invent 2018: Augmenting Security & Improving Operational Health w/ AWS CloudTrail (SEC323...