Jan-Benedikt Jagusch Christian Bourjau: Making Machine Learning Applications Fast and Simple with...

preview_player
Показать описание
Speaker:: Jan-Benedikt Jagusch Christian Bourjau

Track: General: Production
Taking trained machine learning models from inside a Jupyter notebook and deploying them into a production microservice is painful for two reasons:

- Models are not fully self-contained and need to be packaged together with their environment
- Models are optimized for batch processing but slow down for single-row predictions, which could lead to timeouts in a fast-paced online microservice.

In this talk, you will learn how to use the Open Neural Network Exchange (ONNX) framework to compile models into fully-self contained computational graphs, which can reduce single-row inference time by up to 99%, while also drastically simplifying model management.

You will be introduced to the ONNX ecosystem, such as the `sklearn-onnx` and `onnxmltools` libraries for converting models into ONNX graphs, and will learn how to write converters for custom estimators and transformers.

Recorded at the PyConDE & PyData Berlin 2022 conference, April 11-13 2022.