Все публикации

An Interdisciplinary Approach to Research within the Educational Institution

Data Highway Rainbow Petabyte Scale Event Collection, Transport & Delivery at Yahoo

End to end Data Governance with Apache Avro and Atlas

AN APACHE HIVE BASED DATA WAREHOUSE

Solving cyber at scale

Lessons learned from scaling YARN to 40k machines in a mulU tenancy environment

Continuous Data Ingestion pipeline for the Enterprise

Open Source in the Energy Industry

Hadoop Query Performance Smackdown

Accelerating HBase with NVMe and BucketCache

Integrating and Analyzing Data from Multiple Manufacturing Sites

DEEP LEARNING WITH SPARK AND GPUS

GeoWave Open Source Geospatial Temporal N dimensional Indexing for Accumulo, HBase and Cassandra

Governance Bots Metadata Driven Compliance Through AI, Atlas and NiFi

REAL TIME STREAMING ARCHITECTURE AT FORD

Semi Supervised Learning In An Adversarial Environment

Entity Resolution Service Bringing Petabytes of Data Online for Instant Access

Hadoop Infrastructure @Uber Past , Present and Future

Creating real time, data centric applications with Impala and Kudu

Multitenancy At Bloomberg HBase and Oozie

Deep Learning in Security – Examples, Infrastructure, Challenges, and Suggestions

Optimizing, profiling and deploying high performance Spark ML and TensorFlow AI models

MLeap Scaling Machine Learning

Worldwide Scalable and Resilient Messaging Services by CQRS and Event Sourcing using Akka, Kaya Stre