Solving Real-World Data Science Problems with Python! (Predicting Healthcare Insurance Costs)

preview_player
Показать описание
Try out the Datacamp platform - Assess your skills, learn Python, SQL, R and more... and get certified as a data professional! (Get 25% off annual subscription with my link)

This video teaches how to predict health insurance costs using Python and machine learning. It covers the full process from cleaning data to building and testing a regression model. Viewers will learn to use pandas for data handling, create visualizations, and apply scikit-learn for linear regression. The tutorial provides hands-on experience with real-world data analysis and predictive modeling.

Video timeline!
0:00 - Video overview
0:47 - What is regression?
2:29 - Getting started with the code
4:30 - Initial regression modeling strategy
6:41 - Task #1: Clean our health insurance data
25:47 - Task #2: Create scatterplots of our variables mapped to charges
31:34 - Task #3: Prepare the data for regression model fitting
41:32 - Task #4: Fit a linear regression model to our dataframe with sklearn
52:11 - Task #5: Test our model on validation data & submit project

#datascience #regression

-------------------------
Follow me on social media!

-------------------------
Learn data skills with hands-on exercises & tutorials at Datacamp!

Practice your Python Pandas data science skills with problems on StrataScratch!

Join the Python Army to get access to perks!

*I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.
Рекомендации по теме
Комментарии
Автор

Keith loved this. Very useful for refreshing on linear regression.

simondechoisy
Автор

Thank you so much for these uploads, hope you continue them in the future

ameenshareef
Автор

Watched till the end.
I am curious, why didn't you use chatgpt? Also do we have to create pipelines all the time?

husan_ismoilov
Автор

Loved it🎉. 13 children moment was awesome 😂

Optimus_Gaming
Автор

@ 20:52 and 21:21 there's a null value in charges. I checked the raw csv and found some '$nan' entries which didn't get dropped coz we first did .dropna, and then .strip(), I think?

rogueknight
Автор

Great video.
I curious to what presenting a model to stakeholders looks like.

I can’t seem to find that

TJ-ptei