Lecture 8: Data Management (Full Stack Deep Learning - Spring 2021)

preview_player
Показать описание
New course announcement ✨

We're teaching an in-person LLM bootcamp in the SF Bay Area on November 14, 2023. Come join us if you want to see the most up-to-date materials building LLM-powered products and learn in a hands-on environment.

Hope to see some of you there!

--------------------------------------------------------------------------------------------- In this video, you will dive into the data management bucket within the ML infrastructure landscape, exploring the tools and software to ingest, store, process, explore, label, and version datasets.

00:00 - Introduction
00:49 - The Common Data Management Path for Deep Learning
04:22 - Data Sources
11:23 - Data Storage
29:00 - Data Processing
36:02 - Feature Stores
42:00 - Data Exploration
43:42 - Data Labeling
51:28 - Data Versioning
57:03 - Data Privacy
Рекомендации по теме
Комментарии
Автор

Thank you for this detailed lecture. Been following the serious from Lecture 1. The slides are well formatted and the lecture well organized. Enterprise/commercial and opensource offerings are discussed for each of the components and recommended suggestions on what tool to adopt or process to follow.

jeromeeusebius
Автор

Enjoying this playlist but I'm not clear on what you mean by "everything" is in RAM (20:20). I think regular databases can store way more than their RAM

alexbelling
Автор

Thanks guys for really wonderful work.

SalehElm
Автор

Can't know how I bumped onto this. All in all GREAT clip ❤️😄. I also have been watching those rather similar from mStarTutorials and kinda wonder how you guys make these clips. MSTAR TUTORIALS also had amazing information about similiar things on his vids.

knuttlaarsen
join shbcf.ru