Synapse Espresso: Library Management in Apache Spark for Synapse

preview_player
Показать описание
Welcome to episode 33 of our Synapse Espresso series! In this video, Estera joins Stijn and talks you through the three different ways of library management in Azure Synapse!

Stijn Wynants - Fasttrack Engineer

Estera Kot - Program Manager
Рекомендации по теме
Комментарии
Автор

Thanks for the video. Is there already a way to use a CICD pipeline for building a python package and uploading that artifact to a Spark pool on a specified environment via yaml tasks or endpoints? Right now, it seems limited to using the web interface to upload the package manually.

sanderklijsen
Автор

@EsteraKot @Stijn, thanks for this video. I have 1 question :
what's the advantage of pre-loading the librairies at the Spark Cluster level vs doing it at session level with PIP Install (for PySpark) ? at least with PIP Install we can be sure that it will use the latest version available, isn't it ?

GuillaumeBerthier