filmov
tv
Solving real-world data analysis problems with Python Pandas! (Lego dataset analysis)
Показать описание
In this video we walkthrough a data analysis project on DataCamp. This project has us walk through a Lego dataset and answer a few questions. To do our analysis we use the Pandas library of Python.
Check out DataCamp!
Link to my GitHub:
From the DataCamp website:
The Rebrickable database includes data on every LEGO set that has ever been sold; the names of the sets, what bricks they contain, what color the bricks are, etc. It might be small bricks, but this is big data! In this project, you will get to explore the Rebrickable database and answer a series of questions related to the history of Lego!
Some skills worked on in this video:
- Reading CSV files with Python
- Filtering DataFrame based on conditional parameters
- Grouping data by column values and aggregating it
btw, I apologize at about the 25-minute mark I started having microphone issues, I'll have it solved by my next video.
Thank you to DataCamp for sponsoring this video :)
-------------------------
Follow me on social media!
-------------------------
Song at the end
Creative Commons — Attribution 3.0 Unported — CC BY 3.0
-------------------------
Practice your Python Pandas data science skills with problems on StrataScratch!
Join the Python Army to get access to perks!
*I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.
-------------------------
Video Timeline!
0:00 - Introduction
1:05 - Getting started w/ Lego analysis project
2:33 - How to follow along if you are not a premium DataCamp subscriber (GitHub)
4:01 - Project tasks overview
5:40 - Basic exploration of the dataset
9:45 - Task #1: What percentage of all licensed sets ever released were Star Wars Themed?
24:23 - Task #2: In which year was Star Wars not the most popular licensed theme?
34:00 - Bonus Task: How many unique sets were released each year (1955-2017)?
42:26 - Conclusion!
Check out DataCamp!
Link to my GitHub:
From the DataCamp website:
The Rebrickable database includes data on every LEGO set that has ever been sold; the names of the sets, what bricks they contain, what color the bricks are, etc. It might be small bricks, but this is big data! In this project, you will get to explore the Rebrickable database and answer a series of questions related to the history of Lego!
Some skills worked on in this video:
- Reading CSV files with Python
- Filtering DataFrame based on conditional parameters
- Grouping data by column values and aggregating it
btw, I apologize at about the 25-minute mark I started having microphone issues, I'll have it solved by my next video.
Thank you to DataCamp for sponsoring this video :)
-------------------------
Follow me on social media!
-------------------------
Song at the end
Creative Commons — Attribution 3.0 Unported — CC BY 3.0
-------------------------
Practice your Python Pandas data science skills with problems on StrataScratch!
Join the Python Army to get access to perks!
*I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.
-------------------------
Video Timeline!
0:00 - Introduction
1:05 - Getting started w/ Lego analysis project
2:33 - How to follow along if you are not a premium DataCamp subscriber (GitHub)
4:01 - Project tasks overview
5:40 - Basic exploration of the dataset
9:45 - Task #1: What percentage of all licensed sets ever released were Star Wars Themed?
24:23 - Task #2: In which year was Star Wars not the most popular licensed theme?
34:00 - Bonus Task: How many unique sets were released each year (1955-2017)?
42:26 - Conclusion!
Комментарии