Все публикации

Using Spark and Riak for IoT Apps—Patterns and Anti Patterns: Spark Summit East talk by Pavel Hardak

The Fast Path to Building Operational Applications with Spark: talk by Nikita Shamgunov

Optimizing Spark Deployments for Containers: Isolation, Safety & Performance by William Benton

Secured Kerberos based Spark Notebook for Data Science: Spark Summit East talk by Joy Chakraborty

Apache Carbondata: An Indexed Columnar File Format for Interactive Query by Jacky Li/Jihong Ma

New Directions in pySpark for Time Series Analysis: Spark Summit East talk by David Palaitis

Spark: Data Science as a Service: Spark Summit East talk by Shekhar Agrawal and Sridhar Alla

Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach by Eric Kaczmarek and Lucy Lu

Spark Autotuning: Spark Summit East talk by: Lawrence Spracklen

Fault Tolerance in Spark: Lessons Learned from Production: Spark Summit East talk by Jose Soltren

Sparking Up Data Engineering: Spark Summit East talk by Rohan Sharma

Spark Streaming as a Service with Kafka and YARN: Spark Summit East talk by Jim Dowling

Fighting Cybercrime: A Joint Task Force of Real Time Data and Human Analytics by William Callaghan

Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky Yin

Experiences with Spark's RDD APIs for Complex, Custom Applications: talk by Tejas Patil

Monitoring the Dynamic Resource Usage of Scala & Python Jobs in Yarn by Ed Barnes/Ruslan Vaulin

Exploring Spark for Scalable Metagenomics Analysis: Spark Summit East talk by Zhong Wang

Spark as the Gateway Drug to Typed Functional Programming: talk by Jeffrey Smith and Rohan Aletty

Powering Predictive Mapping at Scale with Spark, Kafka, and Elastic Search: talk by Jörg Schad

Delivering Insights from 5PB of Product Logs at Pure Storage: Spark Summit East talk by Brian Gold

Building the Ideal Stack for Real-Time Analytics: Spark Summit East talk by Steven Camina

A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East Talk by Myles Collins

Parallelizing Existing R Packages with SparkR: Spark Summit East talk by Hossein Falaki

Apache Spark for Machine Learning with High Dimensional Labels: by Michael Zargham/Stefan Panayotov