Diamonds Dataset visualized using ggplot2 in R and Python Plotnine

preview_player
Показать описание
Please find code here:

Chapter 3 R for Data Science

In this video we run tidyverse R in Google Colab. R ggplot2 syntax is used to generate graphs. We also apply ggplot syntax in python using plotline python libraries.

Data

# A diamond data frame from R Tidyverse with
53940 rows and 10 variables
# price in US dollars (\$326--\$18,823)
# carat weight of the diamond (0.2--5.01)
# cut quality of the cut (Fair, Good, Very Good, Premium, Ideal)
# color diamond colour, from D (best) to J (worst)
# clarity a measurement of how clear the diamond is
# x length in mm (0--10.74)
# y width in mm (0--58.9)
# z depth in mm (0--31.8)
# depth total depth percentage = z / mean(x, y) = 2 * z / (x + y) (43--79)
# table width of top of diamond relative to widest point (43--95)

# nice explanation

# Some Useful links to explain Diamond Qualities and Values

# Price Database

# IDEX Diamond Index

!pip install pandas plotnine
from plotnine import *
Рекомендации по теме
join shbcf.ru