Все публикации

Data Driving Yahoo

Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse

WHAT’S POSSIBLE WITH

WHAT’S POSSIBLE WITH AI AND DATA IN 2017?

Dynamic DDL Adding

Dynamic DDL Adding structure to streaming IoT data on the fly

Tuning Apache Ambari

Tuning Apache Ambari performance for Big Data at scale with 3000 agents

Data profiling in

Data profiling in Apache Calcite . Julian Hyde, Hortonworks

Extending Apache Ranger

Extending Apache Ranger Authorization Beyond Hadoop Review of Apache Ranger Extensibility Framework

Partner Ecosystem Showcase

Partner Ecosystem Showcase for Apache Ranger and Apache Atlas

Bringing Real Time

Bringing Real Time to the Enterprise with Hortonworks DataFlow

APACHE HADOOP YARN

APACHE HADOOP YARN PRESENT AND FUTURE

Beyond unit tests

Beyond unit tests Deployment and testing for Hadoop Spark workflows

Large Scale Graph

Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud PrevenUon

Data Ingest Self

Data Ingest Self Service and Management using Nifi and Kaya

It Takes a

It Takes a Village Organizational Alignment to Deliver Big Data Value in Health Insurance

DANCING ELEPHANTS –

DANCING ELEPHANTS – EFFICIENTLY WORKING WITH OBJECT STORES FROM APACHE SPARK AND APACHE HIVE

Dockerize and Kerberize

Dockerize and Kerberize Notebook for Yarn and HDFS

HADOOP JOURNEY AT

HADOOP JOURNEY AT WALGREENS

HANDLING KERNEL UPGRADES

HANDLING KERNEL UPGRADES AT SCALE - THE DIRTY COW STORY

Kafka to the

Kafka to the Maxka Kafka Performance Tuning

WHOOPS, THE NUMBERS

WHOOPS, THE NUMBERS ARE WRONG! SCALING DATA QUALITY @ NETFLIX

The Unbearable Lightness

The Unbearable Lightness of Ephemeral Processing

Its Finally Here!

Its Finally Here! Building Complex Streaming Analytics Apps in under 10 mins without writing any

APACHE KUDU: 1.0

APACHE KUDU: 1.0 AND BEYOND

Ingest Process Analyze

Ingest Process Analyze – Automation and Integration through the Big Data Journey

Scalable Data Science

Scalable Data Science with SparkR