Data Manipulation in R with dplyr

preview_player
Показать описание
- Data manipulation is a way of modifying a dataset
- It is the process of making data more organized and useful.
- Common packages used in data manipulation in R are:
- dplyr

- dplyr has 5 main verbs used for the common data manipulation tasks.
- `mutate()` adds new variables that are functions of existing variables
- `select()` picks variables based on their names.
- `filter()` picks cases based on their values.
- `summarise()` reduces multiple values down to a single summary.
- `arrange()` changes the ordering of the rows.
- These all combine naturally with `group_by()` which allows you to perform any operation **by group**.
Рекомендации по теме