tinyML EMEA 2022 - Andrew Reusch: Whole-model optimization with Apache TVM

Показать описание

tinyML EMEA 2022
Algorithms, Software & Tools session
Whole-model optimization with Apache TVM
Andrew REUSCH, Software Engineer, OctoML

Optimized deep learning kernels are crucial for achieving good performance in deployed ML models. Increasingly, developers are turning to deployment tools to assemble these optimized kernels into full models. However, per-kernel optimizations have limited impact on a full model’s throughput, especially when heterogenous compute platforms are in use or when the underlying hardware is designed to execute operators concurrently.
In this talk, I’ll describe how Relax, a new model-level language in Apache TVM, enables hardware vendors to easily apply common hardware optimization techniques such as striping and global memory planning across the full program. With Relax, these techniques can be easily tailored towards both individual accelerators and heterogeneous compute environments. I’ll lastly discuss our future plans to integrate Relax with Apache TVM’s Ahead-of-Time compilation flow, making it available in a low-overhead runtime targeted to bare-metal environments.

tinyML Foundation

Рекомендации по теме

tinyML EMEA 2022 - Andrew Reusch: Whole-model optimization with Apache TVM

tinyML EMEA 2022 - Andrew Reusch: Whole-model optimization with Apache TVM

tinyML EMEA 2022 Keynote - Massimo Banzi

tinyML EMEA 2022 - Federico Paredes-Valles: Full-stack neuromorphic, autonomous tiny drones

tinyML EMEA 2022 Dima Lvov: Sound Classification Model Robustness using Augmentation and...

tinyML Summit 2022 Keynote: Miniature dreams can come true!

tinyML Talks: BuildingTinyML applications with Silicon Labs EFR32MG24 wireless SoC platform

tinyML Summit 2022: Ecosystem of tools for better productivity

tinyML EMEA - Marco Lattuada: Exploiting forward-forward based algorithm for training on device

TinyML Cookbook release. Join the main industry book presentation.

tinyML Asia 2022 Keynote - Gregory Cohen: Biology-inspired Space Imaging with Neuromorphic Systems

tinyML Challlenge Smart Weather Station 2022

tinyML Neuromorphic Engineering Forum - Sensors Session

tinyML Summit 2023: Exploring ML Compiler Optimizations with microTVM

Energy-Efficient Tiny ML at the Edge for Next Generation of Smart Sensors | Edge AI | Michele Magno

tinyML Asia 2022 Yuya Ling: Xylo: A sub-mW, low-dimensional Signal Neuromorphic Processor

tinyML Auto ML Tutorial with Neuton

tinyML Talks: State of Hardware & Software Ecosystem for Low-Power ML Applications on RISC-V

Neuromorphic Vision Sensor

Autonomous Neuromorphic Drone

tinyML Summit 2021 Keynote: Adaptive Neural Networks for Agile TinyML

tinyML Talks UK: Bio Photo Voltaics (BPV): from fundamental principles to practical applications

tinyML Talks: Smart motion sensors offer a world of always-on possibilities: TinyML use cases...

Pete Warden — Practical Applications of TinyML

tinyML Talks local Seattle: An Introduction to Optimizing ML Models with TVMC