Processing Device JSON structured data with Spark's Dataset and Dataframes

preview_player
Показать описание
In this Scala Notebook, I show how to process IoT Device JSON structured data using Apache Spark on a Databricks Community Edition.

I use both the DataFrames and Dataset APIs to analyze and visualize data.
Рекомендации по теме
Комментарии
Автор

Nice video Jules, cant wait to try your example out, been on the waitlist for the community edition since the day it was announced. Thanks for sharing your knowledge.

couchpotato
Автор

Hi Jules,

Thats a great video. Just wanted to notify that you haven't mentioned the DeviceToData Case Class fields and Datatypes.

Thanks

rahulgulati
Автор

Hi Jules Damji,

Thanks for above video information.
Spark is coming with concept of DATASET which is typed safe over DataFrame & RDD.

I moving my code from Dataframe to Dataset.
Reading from database & i wanted to join multiple tables . moment i joined table, ,, results become DATASET of multiple table and when i use group by on same it become GroupedDataSet.

How to use aggregation on this?

What is the right way to use Dataset and dataframe ?

SSWAPU
Автор

Thanks Couch Potato! Can you DM your details; I can find out what's hold up.

julesdamji