filmov
tv
Funnel Analysis with Apache Spark and Druid
Показать описание
Every day, millions of advertising campaigns are happening around the world.
As campaign owners, measuring the ongoing campaign effectiveness (e.g “how many distinct users saw my online ad VS how many distinct users saw my online ad, clicked it and purchased my product?”) is super important.
However, this task (often referred to as “funnel analysis”) is not an easy task, especially if the chronological order of events matters.
One way to mitigate this challenge is combining Apache Druid and Apache DataSketches, to provide fast analytics on large volumes of data.
However, while that combination can answer some of these questions, it still can’t answer the question “how many distinct users viewed the brand’s homepage FIRST and THEN viewed product X page?”
In this talk, we will discuss how we combine Spark, Druid and DataSketches to answer such questions at scale.
Connect with us:
As campaign owners, measuring the ongoing campaign effectiveness (e.g “how many distinct users saw my online ad VS how many distinct users saw my online ad, clicked it and purchased my product?”) is super important.
However, this task (often referred to as “funnel analysis”) is not an easy task, especially if the chronological order of events matters.
One way to mitigate this challenge is combining Apache Druid and Apache DataSketches, to provide fast analytics on large volumes of data.
However, while that combination can answer some of these questions, it still can’t answer the question “how many distinct users viewed the brand’s homepage FIRST and THEN viewed product X page?”
In this talk, we will discuss how we combine Spark, Druid and DataSketches to answer such questions at scale.
Connect with us:
Funnel Analysis with Apache Spark and Druid
Apache Spark Side of Funnels - Zoran Stipanicev (GetYourGuide)
Funnel Analysis with Spark and Druid - Itai Yaffe @ Nielsen (English)
Apache Spark as a Platform for Powerful Custom Analytics Data Pipeline: Talk by Mikhail Chernetsov
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Deep Dive into Apache Spark | Big Data Course | Board Infinity
Using Apache Spark to Predict Installer Retention from Clickstream Data (Patrick Halina)
OSA Con 2021: Succeeding with Apache Druid and Clickstream Data
Building a Versatile Analytics Pipeline on Top of Apache Spark - Mikhail Chernetsov
'Druid: Powering Interactive Data Applications at Scale' by Fangjin Yang
Funnel Analysis In Mobile Gaming: Leveraging Approximation Algorithms For Low Latency Analytics
DX APM: Getting Started with Funnel Analysis
Visualize Your Customer Journey: Building and Analyzing Funnels
SQL : How can one calculate funnel analysis from a SQL table of raw events?
High Performance Advanced Analytics with Spark Alchemy - Sim Simeonov was (Swoop)
Druid Summit 2022: Why & How We Built An Open Source Spark Druid Connector
Funnel Analysis
Democratizing data science Using spark, hive and druid
Paco Nathan: NLP and text analytics at scale with PySpark and notebooks
Apache Spark Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark - Emily Curtin
Data ingestion, stream processing and sentiment analysis pipeline using Twitter data example
Apache Spark the Hard Way: Challenges with Building an On Prem Spark Analytics Platform and Strategi
Intro to Apache Spark for Java and Scala Developers - Ted Malaska (Cloudera)
Building Modern Analytics Applications with Apache Druid
Комментарии