Ray: Enterprise-Grade, Distributed Python

Показать описание

1. Low-latency scheduling and execution of small-to-large ‘tasks’ that perform a wide variety of computation chores, with logical sequencing of dependent tasks.
2. Management of ‘arbitrary’, distributed state, with thread-safe updates and access from other Ray tasks across a cluster.
3. Near-linear scaling.
4. An intuitive API that hides complexity from the user.

Ray has been used for reinforcement learning, hyper parameter tuning, model serving, and other applications in clusters up to thousands of nodes. I’ll discuss examples that illustrate how Ray can be used with Spark to build robust, scalable data applications for enterprises, when to use Ray versus alternative choices, and how to adopt it in your projects.

About:
Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Connect with us:

Рекомендации по теме

Комментарии

How to initialize ray on Databricks? It's throwing error. Can you share any example notebook for ray usage on Databricks?

madhuful

Thanks for a nice introduction, really helped me as a Spark practitioner to understand the differennce

GeorgeVorobiov

Ray: Enterprise-Grade, Distributed Python

Ray: Enterprise-Grade, Distributed Python

Ray: Faster Python through parallel and distributed computing

Introduction to Distributed Computing with the Ray Framework

Dean Wampler - Ray: A System for High-performance, Distributed Python Applications

Ray: A System for High-performance, Distributed Python Applications - Dean Wampler

Ray: A System for Scalable Python and ML |SciPy 2020| Robert Nishihara

Stateful Distributed Computing in Python with Ray Actors

Tutorial - Jules S. Damji: Distributed Python with Ray Hands on with the Ray Core APIs

Talk: Dean Wampler - Ray: A System for High-performance, Distributed Python Applications

Distributed Python with Ray-Hands on with the Ray 2.0 APIs for scaling Python Workloads | PDNYC 2022

Ray: A Framework for Scaling and Distributing Python & ML Applications

How does Ray compare to Apache Spark??

Distributed Computing In Python Made Easy With Ray

Robert Nishihara — The State of Distributed Computing in ML

PyTorch Community Voices | Distributed PyTorch with Ray | Michael & Richard

RayDP: Build Large-scale End-to-end Data Analytics and AI Pipelines Using Spark and Ray

Unifying Large Scale Data Preprocessing and ML Pipelines with Ray Datasets | PyData Global 2021

Collective-on-Ray: High-performance Collective Communication for Distributed Machine Learning on Ray

Ray and Its Growing Ecosystem

Build Large-Scale Data Analytics and AI Pipeline Using RayDP

TALK / SangBin Cho / Data Processing on Ray

Introduction to Ray AIR for Scaling AI/ML and Python Workloads

Ray: A Cluster Computing Engine for Reinforcement Learning Applications

Distributed Computing is the Future of Computing with Robert Nishihara