Getting started with Apache Spark + Apache Cassandra Part 1

preview_player
Показать описание
How many people logged into my website four weeks ago? What's the average temperature on Wednesdays? Cassandra is not designed to handle these types of queries, you model your data for your online operational queries so ad-hoc querying and aggregation does not work.

That's where Apache Spark + the OS Cassandra Spark connector come in! Spark does the distributed computation, Cassandra handles the data distribution. The connector allows you to map Cassandra tables to Spark RDDs, meaning you can achieve ad-hoc querying/aggregation.

In this video, Christopher Batley will give you an overview of how this works and show some examples. There are three parts to it. Have fun watching!
Рекомендации по теме