Distributed Processing for Machine Learning Production Pipelines

Показать описание

Production ML workloads often require very large compute and system resources, which leads to the application of distributed processing on clusters. On premises or cloud-based infrastructure cost requires maximum efficient use of resources. This makes distributed processing pipeline frameworks such as Apache Flink ideal for ML workloads.
In addition, production ML must address issues of modern software methodology, as well as issues unique to ML. Different types of ML have different requirements, often driven by the different data lifecycles and sources of ground truth. Implementations often suffer from limitations in modularity, scalability, and extensibility.
In this talk, we discuss production ML applications and review TensorFlow Extended (TFX), Flink, Apache Beam, and Google experience with ML in production.

AICamp

Рекомендации по теме

Комментарии

Q&A:
Q: When should we use TFX for input processes and when should we use tf.data? They seem like they serve a similar purpose.
Q: Can you paste quickstart link?

AICamp

Distributed Processing for Machine Learning Production Pipelines

Distributed Processing for Machine Learning Production Pipelines

Distributed Machine Learning with Python

Distributed Processing

A friendly introduction to distributed training (ML Tech Talks)

Distributed Machine Learning at Lyft

Distributed Processing for Machine Learning Production Pipelines - Altay, Crowe, Rokni

Distributed Processing and Components (TensorFlow Extended)

Distributed Deep Learning

AWS re:Invent 2024 - Accelerate innovation with AI-powered operations (COP315)

Distributed Systems | Distributed Computing Explained

RPC Library for Distributed Processing Used for Machine Learning - 2021 English version -

Understanding of distributed processing in Python - Chie Hayashida

System and Algorithm Co-Design, Theory and Practice, for Distributed Machine Learning

Machine Learning in Distributed Systems | Maria Zervou | Senior Solutions Architect @Databricks

Distributed Machine Learning over Networks

Ray: Faster Python through parallel and distributed computing

Holden Karau: A brief introduction to Distributed Computing with PySpark

Apache Spark™ ML and Distributed Learning (1/5)

An Uber Journey in Distributed Deep Learning

Distributed Machine Learning with Apache Spark / PySpark MLlib

Apache Spark : A distributed processing engine with built in machine learning pipeline

Course Introduction - Distributed Optimization and Machine Learning

ROCm and Distributed Deep Learning on Spark and TensorFlowJim Dowling Logical Clocks AB,Ajit Mathews

Distributed Systems - Fast Tech Skills