Все публикации

Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse

WHAT’S POSSIBLE WITH AI AND DATA IN 2017?

Dynamic DDL Adding structure to streaming IoT data on the fly

Tuning Apache Ambari performance for Big Data at scale with 3000 agents

Data profiling in Apache Calcite . Julian Hyde, Hortonworks

Extending Apache Ranger Authorization Beyond Hadoop Review of Apache Ranger Extensibility Framework

Partner Ecosystem Showcase for Apache Ranger and Apache Atlas

Bringing Real Time to the Enterprise with Hortonworks DataFlow

APACHE HADOOP YARN PRESENT AND FUTURE

Beyond unit tests Deployment and testing for Hadoop Spark workflows

Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud PrevenUon

Data Ingest Self Service and Management using Nifi and Kaya

It Takes a Village Organizational Alignment to Deliver Big Data Value in Health Insurance

DANCING ELEPHANTS – EFFICIENTLY WORKING WITH OBJECT STORES FROM APACHE SPARK AND APACHE HIVE

Dockerize and Kerberize Notebook for Yarn and HDFS

HADOOP JOURNEY AT WALGREENS

HANDLING KERNEL UPGRADES AT SCALE - THE DIRTY COW STORY

Kafka to the Maxka Kafka Performance Tuning

WHOOPS, THE NUMBERS ARE WRONG! SCALING DATA QUALITY @ NETFLIX

The Unbearable Lightness of Ephemeral Processing

Its Finally Here! Building Complex Streaming Analytics Apps in under 10 mins without writing any

APACHE KUDU: 1.0 AND BEYOND

Ingest Process Analyze – Automation and Integration through the Big Data Journey

Scalable Data Science with SparkR