Named Tensors, Model Quantization, and the Latest PyTorch Features - Part 1

Показать описание

PyTorch, the popular open-source ML framework, has continued to evolve rapidly since the introduction of PyTorch 1.0, which brought an accelerated workflow from research to production. We'll deep dive on some of the most important new advances, including the ability to name tensors, support for quantization-aware training and post-training quantization, improved distributed training on GPUs, and streamlined mobile deployment. We'll also cover new developer tools and domain-specific frameworks including Captum for model interpretability, Detectron2 for computer vision, and speech extensions for Fairseq.

Рекомендации по теме

Комментарии

*My takeaways:*
*0. Agenda **0:44*
*1. Community growth **1:30*
- Paper implementation grouped by framework: PyTorch (44%), TensorFlow (23%), others (32%) and JAX+MXNet+Caffe2 (1%) 2:20
*2. Usage at Facebook **3:22*
*3. PyTorch background & motivation **5:29*
- Core principles 1 - developer efficiency 9:52: clean API, TorchScript, TensorBoard
- Core principles 2 - building for scales (i.e. high-performance executions for model training and inference) 13:31: optimizing for hardware backends, PyTorchJIT
*4. PyTorch 1.4 and the latest features **15:13*
- Named Tensors 15:30
- Java bindings 17:25
- Captum 18:19
- PyTorch mobile 18:48
- Quantization 20:49
- PyTorch elastic 22:32
- PyTorch RPC 24:08

leixun

Switched to Pytorch from TensorFlow in 2019, won't regret ever ... awesome framework

bibhashmitra

How does this have only 100 views in 1 hour?!

NitishSinghchin

Named Tensors, Model Quantization, and the Latest PyTorch Features - Part 1

Named Tensors, Model Quantization, and the Latest PyTorch Features - Part 1

Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)

Deep Dive on PyTorch Quantization - Chris Gottbrath

Quantization in Deep Learning (LLMs)

Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware Training

Tensors for Neural Networks, Clearly Explained!!!

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

What’s new in PyTorch 1.3 - Lin Qiao

GTC 2021: Systematic Neural Network Quantization

Tutorial (TVMCon 2021) - Neural Network Quantization with Brevitas

Understanding 4bit Quantization: QLoRA explained (w/ Colab)

Hessian AWare Quantization V3: Dyadic Neural Network Quantization

Understanding Quantization for Deep Learning

Model Quantization for Edge Devices with AIMET

Quantization in PyTorch 2.0 Export at PyTorch Conference 2022

INT8 Inference of Quantization-Aware trained models using ONNX-TensorRT

Tensor Processing Units: History and hardware

Neural Network Compression – Dmitri Puzyrev

Quantization - Dmytro Dzhulgakov

What’s new in TensorFlow 2.11

Inside TensorFlow: TF Model Optimization Toolkit (Quantization and Pruning)

TensorFlow model optimization: Quantization and pruning (TF World '19)

How to Change Data types of Tensors - Tensorflow Basics

Multi-Dimensional Pruning: A Unified Framework for Model Compression