Data Vault vs Traditional Data Warehouse Architectures

preview_player
Показать описание


Let's take a look at an overview of the Data Vault Architecture for data warehousing. What are it's goals as an alternative to Kimball and Inmon's approaches. And how does it compare to dimensional data modeling, and when should we consider using the Data Vault.

⏯RELATED VIDEOS⏯

------------------------------------------------------------------------------

------------------------------------------------------------------------------

🎓Data courses (Not Produced by nullQueries)🎓

------------------------------------------------------------------------------

📷VIDEO GEAR📷

💻VIDEO SOFTWARE💻

------------------------------------------------------------------------------

Some of the links in this description are affiliate links and support the channel. Thanks for the support!

------------------------------------------------------------------------------
00:00 Intro
00:28 Goals of the Data Vault
01:27 Architecture
02:10 Modeling Terms
03:16 Modeling Example
03:39 ETL
04:08 Reporting
04:32 Pros and Cons
Рекомендации по теме
Комментарии
Автор

What do you think of the data vault compared to the dimensional data warehouse? Have you built both?

nullQueries
Автор

This is a wonderful video. Unfortunately for me, I read 450 pages from Dan Lindstedt's book introducing the data vault 2.0 architecture. This is, hands down, the worst book I have ever ready. It is just horrible. However, it does contain about 7 good ideas and this video captures all of them in a nicely presented coherent way. Thank you!

stephanzhechev
Автор

One of the best video's out there regarding Data Vault modelling

srikanthmanduri
Автор

I enjoy your videos quite a bit, just a few pieces of constructive criticism:
I feel like a little bit more space between sentences to let the viewer digest what is being said/shown would help a lot.
I like the clean look of the visuals, but the text labels etc. help make things easier to visually process.
I think the visual example you did with the tables in this one was good, more real examples like that for what these concepts actually look like in the real world, even just as examples helps drive the points hope.

Looking forward to seeing your channel grow, keep up the good work!

CrazySwde
Автор

We can use Data Vault (incremental loading) + Inmon (EDW) + Kimbal (Star schema) + Deta Lake (ELT = Bronze, Silver and Gold data movements) methodologies and use all of them at the same time. The core will be Kimball + Inmon.

Leo-DatabaseConsultant
Автор

My data engineering team have built many data vaults, but could never quite articulate to me as a business leader why? This has been very educational for me in explaining the benefits vs complexity. The pace that business is changing and the number of new data sources that become available makes a data vault seem a more obvious choice. The business still gets its Inmon Kimble model, but the foundational data structures in the Vault provide more capability to make changes to them. That's what this inferred to me. I hope I am on the right mark.

paulheadey
Автор

thanks for make this kind of videos, i really appreciate it, they are so useful for people like me who are learning about it

michaelenriquez_
Автор

I went from the 3NF video to the dimensions one to this one and I feel like the only advantage I see is the dimension/Kimball one. This data vault seems just overkill. The storage will increase exponentially with all the extra keys needed and with very large storage of millions/billions of rows the performance I suppose will be greatly impacted when querying all those keys. Why is this an easier ETL solution? Am I missing something?

timpcutata
Автор

Data vault is the curated layer in a data lake. And they have a very specific design... But really its an inmon/operational design

MrCutlash
Автор

Very well explained with good examples, this is very helpful!

SjeetjeMineetje
Автор

Well explained in pictorial format. But there should be some use case or an example so the newbies can understand more easily.

bytedonor
Автор

Would it happen that you guys have a transcript of this video? maybe posted in a blog post?

pedropradocarvalho
Автор

First time watching your videos and I absolutely love them! Subbed and liked. It'd be even more awesome if you could allow for an extra second to digest what you're saying. It's a lot of useful information. But even if you don't change anything, I'll still be a fan! Thank you for this!

Sam-gjhf
Автор

Great videos .. very informative ...can you do a quick comparison between Redshift & Vertica? an overall evaluation?

ardee
Автор

Hi. Thank you for this overview video. Do you have also a webpage where you can be contacted? Would be happy to get your thoughts about DWH automation (we are the creators of the Datavault Builder tool). Regards

pbpb
Автор

This video is very good but I need to clarify the ETL Process. Supposed I have a few raw files yet to be stored. They are placed inside the data lake unmodified. From there, I insert the data as hubs, link tables and satellites tables into the raw vault, creating surrogate keys along the way. Is that right? And what does 'since objects in each layer never connect to each other' mean? 4:01

treelo
Автор

Really good video! Thank you!
Quick question: what do you mean by "Business logic"? Do you mean that kind of logic that would be used with an MDM, to control whether new attributes about an entity should be added or ignored (eg if we have conflicting phone numbers for a customer)?

galeop
Автор

Nice video, where can we learn about the other data warehouse format?

mosa
Автор

All those fancy pictures make zero sense without real live examples, just think about it

thghtfl
Автор

Can I just say "Dimensional Datamart" is my favorite cyberpunk term

christopherbronson
welcome to shbcf.ru