From Basic to Advanced Aggregate Operators in Apache Spark SQL 2 2 by Examples and their Catalyst Op

Показать описание

"There are many different aggregate operators in Spark SQL. They range from the very basic groupBy and not so basic groupByKey that shines bright in Apache Spark Structured Streaming’s stateful aggregations, including the more advanced cube, rollup and pivot to my beloved windowed aggregations. It’s unbelievable how different the performance characteristic they have, even for the same use cases.

What is particularly interesting is the comparison of the simplicity and performance of windowed aggregations vs groupBy. And that’s just Spark SQL alone. Then there is Spark Structured Streaming that has put groupByKey operator at the forefront of stateful stream processing (and to my surprise as the performance might not be that satisfactory).

This deep-dive talk is going to show all the different use cases for the aggregate operators and functions as well as their performance differences in Spark SQL 2.2 and beyond. Code and fun included!

Session hashtag: #EUdd5"

About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.

Connect with us:

Рекомендации по теме

From Basic to Advanced Aggregate Operators in Apache Spark SQL 2 2 by Examples and their Catalyst Op

From Basic to Advanced Aggregate Operators in Apache Spark SQL 2 2 by Examples with Jacek Laskowski

From Basic to Advanced Aggregate Operators in Apache Spark SQL 2 2 by Examples and their Catalyst Op

Advanced Aggregate Functions in SQL (GROUP BY, HAVING vs. WHERE)

Basic Aggregate Functions in SQL (COUNT, SUM, AVG, MAX, and MIN)

SQL Tutorial for Beginners [Full Course]

SQL | Windows Vs Aggregate Functions

Complete MongoDB aggregation pipeline course

SQL Tutorial for Beginners

DO NOT design your network like this!! // FREE CCNA // EP 6

Mastering Data Aggregation in Java: From Basics to Advanced Techniques

Advanced SQL Tutorial | Subqueries

Advanced Aggregate Functions in SQL

Basic Aggregation in SQL

Top 5 Aggregation Stages Explained!

Advanced Aggregate Functions in SQL (Grouping Sets)

Advanced Use of groupby(), aggregate, filter, transform, apply - Beginner Python Pandas Tutorial #5

SQL Window Function | How to write SQL Query using RANK, DENSE RANK, LEAD/LAG | SQL Queries Tutorial

AGGREGATE Formula in Excel #excel #exceltips #exceltutorial #msexcel #microsoftexcel #developer

SQL Advanced Aggregate Functions

SQL Aggregates: COUNT(*) aggregate function

Aggregate Functions in SQL #sql #shorts

Aggregate Functions, Group by Clause and Having Keyword in SQL ? | SQL Server Interview Questions

The Ultimate MongoDB Aggregation Guide: Make Your Queries Soar in One Video

Data Aggregation in Python with Pandas: A Step-by-Step Tutorial to Boost Your Data Analysis Skills