Versioning Data with DVC (Hands-On Tutorial!)

preview_player
Показать описание
This tutorial is for total beginners to get started using DVC and Git to version data, models, and more. If you're completely new to DVC and want a gentle walkthrough of the basic features, this is for you!

We'll be mirroring the first "Getting Started" feature from the official DVC docs, in which we learn how to:

- Set up Git and DVC in a project
- Configure remote storage (here, Google Drive!) for your project
- Create a new version of a dataset
- Checkout a previous version of a dataset

This is the first of 4 videos in the DVC Basics series.

⚠️𝗡𝗢𝗧𝗘!
As this video is one of our older ones, please refer to these docs if getting started with setting up your Google Drive is proving challenging:

Рекомендации по теме
Комментарии
Автор

I'm just starting to get acquainted with MLOps at work, and I must say, this entire series of videos is SUPER useful! I'll be recommending this channel to everyone I know in my circles. Thanks a ton for everything you do, Elle! :D

prraoable
Автор

Thank you so much for this tutorial. I was struggling to understand how exactly DVC connected to cloud services, and your Google Drive example was extremely clear and simple.

albertkim
Автор

Best course on DVC on the whole internet

ankitraj_
Автор

Even I do understand DVC after seeing this explanation. Thank you very much.

reiniersuurenbroek
Автор

Great explanation of the basics, thanks a lot Elle!

carl
Автор

Elle, thanks for this tutorial! Easy to follow, clear examples and super video effects! )))

mishanotebook
Автор

Really useful tutorial 💚
waiting for more DVC stuff 😍

arminarlert
Автор

-d flag is not working in 'v3.42.0' at 3:33. --default flag worked perfectly also the authentication is changed by little bit.

This is for those who are watching in 2024.

CRCE__Hardik_Prajapati
Автор

This series was super useful, and very easy to understand. Thanks!

shift_escape
Автор

Can we do versioning for Image and Video datasets using DVC? If so, please point out or make a tutorial video for the same. Thanks

sourabhyadav
Автор

Thanks for the intro! This was super helpful. Now I'm trying to use DVC for a computer vision project but I'm running into issues. My dataset is not too large ~40GB, I ran `dvc add` and it's been running for almost half an hour. Is this normal? I'm trying to find tutorials going through how to manage CV datasets but no luck. Do you know of any tutorials/documentation that could help me?

sabrinapereira
Автор

extremely helpful hands-on, thank you!

siegmeyer
Автор

11:10 "It's very difficult to name these things" 🤣. Great tutorial!

yeraydavidrodriguezdomingu
Автор

This is awesome! I love it! Nice tutorial too!

MattCamp
Автор

Thanks for your informative video! it helped alot keep up the good work

wolfcoin
Автор

This is very useful! Thanks for this video!

surajit.techie
Автор

I enjoyed the video, but I have a question: isn't Git LFS accomplishing the same goal? My understanding is that we can already use Git LFS to store large files outside of our repositories, but still track their versioning. What would be the advantage of using DVC instead of Git LFS in this case?

douglasemsantos
Автор

Hi! Building a DVC on AWS S3, and got an error while dvc push: ERROR: failed to transfer 'md5: - 405, message='Method Not Allowed',
Any ideas why?...

DataScienceGarage
Автор

hi! thank you for the video!
But I have a problem that when I "dvc push", google accounts links are 400: invalid_requests . Do you have any idea?

sonbomme
Автор

Thanks for the tutorial. How is it different from Git LFS?

GunjaPapi