2023 EuroLLVM - ML-on-CPU: should vectorization happen in the LLVM backend or higher up the stack?

Показать описание

2023 European LLVM Developers' Meeting
------
ML-on-CPU: should vectorization happen in the LLVM backend or higher up the stack?
Speaker: Elen Kalda
------
-----
This talk is about how TVM, one of the most mature machine learning compilation stacks in ML space, interacts with LLVM. TVM is a domain specific compiler that consumes a machine learning model expressed in high level ML framework like TensorFlow or PyTorch and compiles it for a chosen target, such as Arm(R) architecture. For CPU targets, it does this by using LLVM as a backend, directly translating TVM's IR into LLVM IR.

In TVM, just like in other Machine Learning stacks using LLVM as a backend for CPU code generation, one needs to make a decision about where optimizations like vectorization should happen: in the LLVM backend, or in the ML stack higher up. This is further complicated by the emergence of scalable vectors, like the Scalable Vector Extension (SVE). While generating code for fixed length vectors can mostly be left to LLVM, there is a case to be made for the presence of variable length vectors in TVM stack, to be able to more effectively use the capabilities of SVE. In this talk, we're going to present our experiences and insights on the trade-offs targeting SVE in the TVM+LLVM stack.
-----

LLVM

Рекомендации по теме

2023 EuroLLVM - ML-on-CPU: should vectorization happen in the LLVM backend or higher up the stack?

2023 EuroLLVM - ML-on-CPU: should vectorization happen in the LLVM backend or higher up the stack?

2023 EuroLLVM - Improving Vectorization for Loops with Control Flow

2023 EuroLLVM - Lock Coarsening optimizations for loops in Java

2023 EuroLLVM - MLIR-based offline memory planning and other graph-level optimizations for xcore.ai

2023 EuroLLVM - Tensor Evolution - An ML Graph Optimization Technique

2023 EuroLLVM - ML-LLVM-Tools: Towards Seamless Integration of Machine Learning in Compiler Optim..

2023 EuroLLVM - What's new in MLIR?

2023 EuroLLVM - Prototyping MLIR in Python

2023 EuroLLVM - Order out of Chaos, The LLVM Release Process

2023 EuroLLVM - Using MLIR for Dalvik Bytecode Analysis

2023 EuroLLVM - Inliner in MLIR

2023 EuroLLVM - Multiple-Entry, Multiple-Exit MLIR Regions

2023 EuroLLVM - Tutorial: Developing BOLT pass

2023 EuroLLVM - Extending the AArch32 JITLink backend

2023 EuroLLVM - Fast Pivot Function for Presburger Library through Vectorization and Integer...

2023 EuroLLVM - Using MLIR to Optimize Basic Linear Algebraic Subprograms

2023 EuroLLVM - Tutorial: Controllable Transformations in MLIR

2023 EuroLLVM - OpenMP as GPU Kernel Language

2023 EuroLLVM - Buddy Compiler: An MLIR-based Compilation Framework for Deep Learning Co-design

2023 EuroLLVM - RISC-V Vector Extension Support in MLIR: Motivation, Abstraction, and Application

MLGOPerf: An ML Guided Inliner to Optimize Performance @LLVMPROJ's MLGO Meeting

Machine Learning in Compiler Optimization

2022 LLVM Dev Mtg: Machine Learning Guided Optimizations (MLGO) in LLVM

2020 LLVM Developers’ Meeting: A. Kumar “Code Size Compiler Optimizations and Techniques”