4.8 TensorFlow Extended (TFX): Building End-to-End ML Pipelines with TFX

Показать описание

Building end-to-end machine learning (ML) pipelines with TensorFlow Extended (TFX) entails orchestrating the various components and stages of the ML workflow, from data ingestion and preprocessing to model training, evaluation, and deployment. TFX offers a comprehensive suite of tools and libraries for efficiently building, validating, and deploying production ML pipelines.

The process begins with defining the structure and stages of the ML pipeline, specifying tasks, data sources, input features, and output targets. TFX provides components like ExampleGen for data ingestion and Transform for feature engineering and preprocessing using TensorFlow Transform.

Data validation and schema inference are facilitated by components such as StatisticsGen for computing descriptive statistics, SchemaGen for inferring schema, and ExampleValidator for detecting anomalies in training data.

Model training and evaluation are handled by components like Trainer for training models using TensorFlow and Evaluator for computing evaluation metrics. Custom model architectures and training configurations can be defined using TensorFlow's APIs.

Once trained and evaluated, TFX includes components like Pusher for deploying models to production serving infrastructure. Monitoring and governance tools enable tracking pipeline execution, monitoring model performance, and ensuring compliance.

Orchestration and execution of TFX pipelines are managed by frameworks like Apache Airflow, Apache Beam, and Kubeflow Pipelines, which handle task dependencies and provide visibility into pipeline execution.

Understanding these key steps and components is crucial for effectively deploying and managing ML workflows in production environments, unlocking the full potential of machine learning for real-world applications.

Рекомендации по теме

4.8 TensorFlow Extended (TFX): Building End-to-End ML Pipelines with TFX

Apache Beam for Production Machine Learning: TensorFlow Extended (Beam Summit Europe 2019)

TensorFlow Extended (TF Dev Summit '20)

Model Understanding and Business Reality (TensorFlow Extended)

Using TensorFlow Extended (TFX) on AI Platform Pipelines

TFX SchemaGen & ExampleValidator - Schema Generation and data validation

Deep Learning Trading Strategy from the beginning to the production using TensorFlow 2.0 and TFX

How to Build & Use TensorFlow Data Pipeline for Image Processing

Lecture 12A: Building Data Pipelines for Tensorflow - Part 1

Hands-on with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + Kubernetes + PyTorch + XGBoost

PyTorch vs Tensorflow: What's the difference? #python #AI #ml #pythonprogramming #git #github #...

End to End Machine Learning Pipeline using Tensorflow

KubeFlow +Keras/TensorFlow2 +TF Extended (TFX) +Kubernetes +PyTorch +XGBoost +Airflow +MLflow +Spark

Machine Learning - TensorFlow Extended com Apache Airflow

Hands-on KubeFlow + Keras/TensorFlow 2.0 + TFX + K8s + PyTorch + XGBoost + Airflow + MLflow + Spark

Migrating Apache Spark ML Jobs to Spark + Tensorflow on Kubeflow - Holden Karau (Independent)

SBTB 2019: Chris Fregly, End-to-End ML Pipelines with KubeFlow and TensorFlow Extended (TFX)

Jörg Schad - Workshop: Building and Operating an Open Source Data Science Platform

Complete Tutorial on TensorFlow 2 0 using Keras API [Part 1]

100 Days of ML | Day 26: ML Project pt4 - TFX Pipeline Setup

Tensorflow: Build a data pipeline Part-I

Build an image classifier (ML Zero to Hero - Part 4)

Productionizing ML with ML Ops and Cloud AI - Kaz Sato, Google

Building Machine Learning Products with TensorFlow 2.0 - Ekaba Bisong #DevFestSGF

Building a Reproducible Machine Learning Pipeline With Kubernetes, Tensorflow, and Kuberflow