Faster Model Serving with Ray and Anyscale | Ray Summit 2024

Показать описание

Ray Serve is an industry-leading ML platform for distributed model serving and deployment. In this Ray Summit 2024 breakout session, Edward Oakes and Akshay Malik from Anyscale explore how the Anyscale platform extends Ray Serve to solve key problems for serving large-scale AI models.

They also explore how the challenges of building AI applications have been accentuated by the rise of large-scale generative AI. Larger models are more expensive to initialize and run and even require special techniques such as tensor or pipeline parallelism across multiple GPUs. At the same time, they come with the same production-readiness and developer productivity challenges as hosting all ML models. The solution: Anyscale's Ray Serve

--

Interested in more?

--

🔗 Connect with us: