Machine Learning Experimentation with DVC and VS Code

Показать описание

Co-hosted by FourthBrain and Iterative. Learn how to manage and make your machine learning projects reproducible with an open-source tool DVC and its extension for VS Code. We will see how to track datasets and models, run, compare, visualize, and track machine learning experiments right in VS Code.

Link to the repository with code:

Link to Alex Kim's Github:

Iterative builds DVC, CML, and other developer tools for machine learning. They're a well-funded, remote-first team on a mission to solve the complexity of managing datasets, ML infrastructure, and ML models lifecycle management.

——
00:00 About today's talk
03:44 Introduction
04:27 The problem we want to solve
05:31 What happens next: Goals
06:12 Goal #1: Achieve best performance
06:59 Goal #2: Ensure reproducibility
08:19 Goal #3: Minimal setup and dependency of 3rd party services
10:18 Why it's difficult to achieve all three goals (Same experiments/different metrics)
12:44 When in doubt go with Open-Source Software
14:20 Open-Source tools: Git, Visual Studio Code & DVC
16:08 DVC: What is DVC?
17:25 DVC: What are DVC pipelines?
20:52 DEMO: Initial setup
24:25 DEMO: Start experimenting
27:21 DEMO: Automating experiments, Grid Search
29:24 DEMO: Keeping one of the experiments
31:47 DEMO: Some comments on the process
33:11 DEMO: How does DVC handle Model and Data files?
35:47 Summary
37:37 Alex's take on MLOps, DevOps and GitOps
42:47 Alex's take on the difference between Data Scientist, ML Engineer and MLOps Engineer
47:45 Alex's take on data preparation and the overall ML pipeline
51:10 Alex's take on when W&B and MLFlow might be unreliable
53:13 Conclusion

Рекомендации по теме

Комментарии

Alex Kim is a dope instructor! Thanks for sharing.

saliexplore

This is super cool, thanks for sharing! I would love to see something similar but for large computer vision datasets.

sabrinapereira

DVC looks very nice. One important thing, is there support for letting optuna set my hyperparameters?

DrJohnnyStalker

Can we use DVC for chatGPT for prompts to return certain type of data consistently, it is a hit and a miss, would be nice to see what is working and not

aiautoglasscrm

Machine Learning Experimentation with DVC and VS Code

Machine Learning Experiments with DVC (Hands-On Tutorial!)

Machine Learning Experiments with DVC Studio

Machine Learning Experimentation with DVC and VS Code

Managing Machine Learning Experiments with DVC

Machine Learning Experimentation in VS Code with DVC Extension

From Jupyter Notebooks to Reproducible and Automated experiments with DVC, MikhailRozhkov

Machine Learning Experimentation in VS Code: Introducing our DVC Extension for VS Code!

Machine Learning Experiment Tracking with VS Code, Git, and DVC

DVC: data versioning and ML experiments on top of Git, Dmitry Petrov

Optimizing Image Segmentation Projects with DVC

ML Experimentation with DVC and VS Code

Introducing Machine Learning Cloud Computing Experiments with DVC Studio

Workflow & MLOps for batch scoring applications with DVC, MLflow and Airflow, Mikhail Rozhkov

ML Experiment Tracking with DVC and DagsHub Logger (Dagshub Tutorial)

DVC Extension for VS Code: Shareable DVC Studio Machine Learning Experiment Links

Reproducible Machine Learning & Experiment Tracking Pipeline with Python and DVC

Using DVC Experiments in Transfer Learning

Rob de Wit - Becoming a Pokémon Master with DVC: reproducible machine learning experiments

Tracking your Data Experiments with MLflow and DVC | Jeremy Jackson ThoughtWork | GeekNight

7. End to End Machine Learning Portfolio Project | Experiments Tracking DVC | MLOps | DVC Pipeline

ML Experiment Versioning with DVC #shorts

ML Experiment Versioning: Don't just Track your Machine Learning Experiments, Version Them with...

Becoming a Pokémon Master with DVC: Reproducible Machine Learning Experiments | Rob De Wit

Antoine Toubhans: Flexible ML Experiment Tracking System for Python Coders with DVC and Streamlit