Apache Spark for Data Science #2 - How to Work with Spark RDDs

preview_player
Показать описание
Spark is based on Resilient Distributed Datasets (RDD) - Make sure you know how to use them. This video will teach you RDD basics in Spark.

00:00 Introduction
01:25 Virtual environment setup
03:22 How to start a Spark Session
05:48 How to read datasets with Spark
08:19 Outro

FOLLOW BETTER DATA SCIENCE

FREE “LEARN DATA SCIENCE MASTERPLAN” EBOOK

GEAR I USE
Рекомендации по теме
Комментарии
Автор

iris.subtract(iris_header) does'nt seem to work in Jupyter Notebook

dingowhiz