Effective Resampling for Machine Learning in Tidymodels {rsample} R package reviews

Показать описание

In this video, we'll learn how to use three most important resampling techniques: train-test split, cross-validation and bootstrapping. Random sampling, stratified sampling for numeric and categorical variables. What is not in the video are grouped resampling and time specific resampling, they deserve a separate video each.

If you only want the code (or want to support me), consider join the channel (join button below any of the videos), because I provide the code upon members requests.

Enjoy! 🥳

Welcome to my VLOG! My name is Yury Zablotski & I love to use R for Data Science = "yuzaR Data Science" ;)

This channel is dedicated to data analytics, data science, statistics, machine learning and computational science! Join me as I dive into the world of data analysis, programming & coding. Whether you're interested in business analytics, data mining, data visualization, or pursuing an online degree in data analytics, I've got you covered. If you are curious about Google Data Studio, data centers & certified data analyst & data scientist programs, you'll find the necessary knowledge right here. You'll greatly increase your odds to get online master's in data science & data analytics degrees. Boost your knowledge & skills in data science and analytics with my engaging content. Subscribe to stay up-to-date with the latest & most useful data science programming tools. Let's embark on this data-driven journey together!

Рекомендации по теме

Комментарии

I am watching R-tutorials since 7 years now. Your visual representations and explanations are the *best, * I've ever seen- so far.

hansmeiser

What I love the most about this YouTube channel, is that the quality of the free tutorials is much better than many paid ones. I must admit, that you have a talent in illustrating such a complex topic into very easy method. A true professor you are. I really don't know how to thank you.
I would be so grateful if you create a tutorial of machine learning using tidymodels package.
From the bottom of my heart, thank you.
❤❤❤❤

muhammedhadedy

Thank you! It helped me a loot! Excellent explanation!

aiz_i

Even though I know very little about modeling it is clear that this package kicks ass! The animations on this video illustrating the concepts being described is superb!

wbdill

Nice lecture for resampling. Please make a video for simulation study

SUNILYADAV-tvze

Excelent! Great animations and clear explanations. Thanks!

Incredibly usefull chanel, love your work.

rodrigonehara

Thanks for the vid sir. Can you create a video about the use of tidymodels in time series model and analysis?

joshstat

Kudos to our Boss, thanks in million sir, the data scientist gurus in the universe
Sir could you make a tutorial on handling class imbalance when dealing with a binary classification

abdulmusa

Good to know about Monte Carlo CV in that package. Where does Monte Carlo fit in the variance|bias continuum vis-a-vis bootstrapping and cross-fold validation?

I use tend to use 'caret' instead where I dial in these model performance tests via train_control.

We need a vid on sub-space clustering. 😉

chacmool

Very well explained 👍 Thanks for sharing 🙏 just to be clear, for cross-validation, which model is going to run over the test-data created in the very beginning?

AnimeshSharma

Could I use this approach to remove some BIAS regarding different data sample efforts? Example: I have a dataset with monthly video records of animal interactions with vertebrate latrines. However, some spots have different months' recordings because come latrines weren't there when I installed my equipment, meaning I have different latrines sampling with different sample efforts. I am looking for a way to correct it, but since stats are something new for me, I was wondering if this approach you use could be used in my situation.

Please don't stop doing this, we need this kind of informative videos and didactic 🙏🙏🙏. Thank you again for this one!

yurisilvadesouza

13.25 you have used rand_forest() instead of lm() on code line 124. Could you please clarify?

samirhajiyev

Hello. Can you do a demo about how to do bootstrap in R. Also the website link is not working. Thank you

mohamedaddani

Regarding this, yuzaR, I was wondering if it existed something along the concept of keeping % of classes in our test-train split, but for numerical values in order to draw two populations that got the most similar means, sd and all of that. Thank you :D

galan

So how can we use cv sampled or bootstrapped models in prediction using predefined test set?

serhatakay

Effective Resampling for Machine Learning in Tidymodels {rsample} R package reviews

Effective Resampling for Machine Learning in Tidymodels {rsample} R package reviews

Resampling Techniques in Machine Learning

Introduction to Resampling Methods

Ajinkya More | Resampling techniques and other strategies

Machine Learning Model Comparison with Bootstrap Resampling | sklearn Implementation

26: Resampling methods (bootstrapping)

Python Tutorial: Increasing successful detections using data resampling

Handling Imbalanced Dataset in Machine Learning: Easy Explanation for Data Science Interviews

R Tutorial: Training a model with resampling

Resampling technique- an introduction

Resampling and Regularization | Data Science with Marco

Understanding Resampling Methods Boost Your Model's Performance

[CVPR 2023]Learning Steerable Function for Efficient Image Resampling

Resampling Method Can Matter

28 Resampling Techniques for AI Model Building | Artificial Intelligence Course | Datajango

Resampling MRI Data: Common Uses and Pitfalls

ISLR: Resampling Methods Part 1 (islr04 5)

Learning Recommenders for Implicit Feedback with Importance Resampling

Machine Learning with Imbalanced Data - Part 3 (Over-sampling, SMOTE, and Imbalanced-learn)

Week 6 Lecture 43 Bootstrapping & Cross Validation

mlr3: Resampling

Machine Learning Fundamentals: Cross Validation

CVPR 2023: Learned Two-Plane Perspective Prior based Image Resampling for Efficient Object Detection

SMOTE (Synthetic Minority Oversampling Technique) for Handling Imbalanced Datasets