Apache Druid 101

preview_player
Показать описание
Data Con LA 2020
Description
Apache Druid is a cloud-native open-source database that enables developers to build highly-scalable, low-latency, real-time interactive dashboards and apps to explore huge quantities of data. This column-oriented database provides the microsecond query response times required for ad-hoc queries and programmatic analytics. Druid natively streams data from Apache Kafka (and more) and batch loads just about anything. At ingestion, Druid partitions data based on time so time-based queries run significantly faster than traditional databases, plus Druid offers SQL compatibility. Druid is used in production by AirBnB, Nielsen, Netflix and more for real-time and historical data analytics. This talk provides an introduction to Apache Druid including: Druid's core architecture and its advantages, Working with streaming and batch data in Druid, Querying data and building apps on Druid and Real-world examples of Apache Druid in action
Speaker
Matt Sarrel, Imply Data, Developer Evangelist
Рекомендации по теме
Комментарии
Автор

No Comments??? Man, This is one of the best presentation I have seen. Thanks Matt.

nandkarthik
Автор

Thank you Matt. This is really well explained. I appreciate your effort and time. It is very helpful for me to understand where Druid actually fits. Thanks again @Matt Sarrel

ankurranjan
Автор

Thank you for the great content. It would have been really awesome if the slide was available to view.

shinebayar
Автор

How we can ingest avro data from apache kafka topic,

anantababa
Автор

Any use cases where someone adopted druid and the after a couple of years moved tl a different solution due to limitations he didn't predict fully aware of?

programminginterviewsprepa
Автор

About the timing thing.... it seems to me that there is too much waffling around the history and about yourself, if you stick to the topic you would probably have had enough time.

michalrybinski