Kubernetizing Big Data and ML Workloads at Uber - Mayank Bansal & Min Cai, Uber

Показать описание

Kubernetizing Big Data and ML Workloads at Uber - Mayank Bansal & Min Cai, Uber

Uber relies on Big Data and ML to make business critical decisions such as pricing, trip ETA, etc. Today, those workloads such as Hive and Spark are running on YARN. To save millions of dollars by efficient use of cluster resources, Uber is planning to use Kubernetes to co-locate BigData/ML and micro-service workloads. Kubernetes is the de-facto standard for running micro-services. However, in comparison to YARN, it still lacks many features like hierarchical resource pools, elastic resource sharing, gang scheduling etc. To bridge this gap, we have re-architected Peloton to be a set of Kubernetes scheduler and controller plugins so that we can provide feature parity with YARN. This talk will cover: - Learnings of running large-scale BigData/ML on Kubernetes with Peloton - Colocation of mixed workloads - Federation across zones - Feature and API parity with YARN

CNCF [Cloud Native Computing Foundation]

Рекомендации по теме

Kubernetizing Big Data and ML Workloads at Uber - Mayank Bansal & Min Cai, Uber

Kubernetizing Big Data and ML Workloads at Uber - Mayank Bansal & Min Cai, Uber

Nezha: A Kubernetes Native Big Data Accelerator For Machine Learning - Huamin Chen & Yuan Zhou

Flyte: Cloud Native Machine Learning & Data Processing Platform - Ketan Umare & Haytham Abue...

Michelangelo ML Platform at Uber: Past, Present and Future | Min Cai

AWS Container Day - ML with Kubernetes

Infrastructure Agnostic Machine Learning Workload Deployment

AWS re:Invent 2019: Run big data workloads faster and cheaper (DEM151)

Interview Big Data and Uber

Morcor - Co-Location of Mixed Workloads at Uber | Amite Bose

Keynote: Smooth Operator♪: Large Scale Automated Storage with... - Celina Ward & Matt Schallert...

[SIGMOD21]Real-time Data Infrastructure at Uber

Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service

Only Slightly Bent: Uber’s Kubernetes Migration Journey for Microservices - Yunpeng Liu, Uber

Introducing KFServing: Serverless Model Serving on Kubernetes - Ellis Bigelow & Dan Sun

Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service

Data Lakes In Kubernetes Clusters: Challenges & Opportunities

How Uber uses ML & NLP to Improve Customer Experience? | Real-World Data Science Project

DevOps for Machine Learning (Azure MLOps Part 5) - Deploy Your Model to Azure Kubernetes Service

Making the Most Out of Kubernetes Audit Logs - Laurent Bernaille & Robert Boll, Datadog

Kubernetes from Dev to Prod at GoEuro [I] - Subhas Dandapani, GoEuro

Kubernetes for Everyone - Sendil Kumarn, Uber

Peloton - Uber's Webscale Unified Scheduler on Mesos & Kubernetes

Run:AI, Weka.io, and Rancher / Suse on Kubernetes in Production

Partha Seetala & Radhesh Menon, Robin.io | CUBEconversation, March 2019