AWS re:Invent 2022 - Deploy ML models for inference at high performance & low cost, ft AT&T (AIM302)

Показать описание

High-performance, cost-effective model deployment is critical to maximize the return on your ML investments. Amazon SageMaker provides the breadth and depth of fully managed deployment features to achieve optimal inference performance and cost, while reducing operational burden. In this session, learn how to use SageMaker inference capabilities to quickly deploy ML models in production at scale. Discover SageMaker deployment options including: infrastructure choices; real-time, serverless, asynchronous, and batch inference; single-model, multi-model, and multi-container endpoints; auto scaling; SageMaker Inference Recommender; model monitoring; and SageMaker MLOps integration. Learn how AT&T used Amazon SageMaker to optimize ML model deployment at scale.

Subscribe:

ABOUT AWS
Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts.

AWS is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

#reInvent2022 #AWSreInvent2022 #AWSEvents

Рекомендации по теме

AWS re:Invent 2022 - Deploy ML models for inference at high performance & low cost, ft AT&T (AIM302)

AWS re:Invent 2022 - Build, test, and deploy your .NET applications on AWS (XNT302)

AWS re:Invent 2022 - Goldman Sachs: Using policy as code to deploy new apps in minutes (COP313)

AWS re:Invent 2022 - A deployment is not a release: Control your launches w/feature flags (BOA305-R)

AWS re:Invent 2022 - Deploy ML models for inference at high performance & low cost, ft AT&T ...

AWS re:Invent 2022 - Reimagining multi-account deployments for security and speed (NFX305)

AWS re:Invent 2022 - Amazon RDS Blue/Green Deployments, Optimized Writes & Optimized Reads (DAT2...

AWS re:Invent 2022 - Deploying egress traffic controls in production environments (SEC312)

AWS re:Invent 2022 - Manage and control your AWS costs (COP203)

AWS re:Invent 2022 - Deep dive into Amazon Aurora and its innovations (DAT326)

AWS re:Invent 2022 - [NEW] Easily build, train, and deploy ML models using geospatial data (AIM218)

AWS re:Invent 2022 - Deep dive on IBM software and SaaS solutions on AWS (PRT236)

AWS re:Invent 2022 - Building real-world serverless applications with AWS SAM (SVS303)

AWS re:Invent 2022 - Building next-gen applications with event-driven architectures (API311-R)

AWS re:Invent 2022 - AWS Well-Architected best practices for DevOps on AWS (DOP207)

AWS re:Invent 2022 - Developing and deploying secure AWS Lambda applications (PRT094)

AWS re:Invent 2022 - Deep learning on AWS with NVIDIA: From training to deployment (PRT219)

AWS re:Invent 2022 - How to deploy a private mobile network in days using AWS Private 5G (HYB204)

AWS re:Invent 2022 - The well-architected way (ARC210)

AWS re:Invent 2022 - All you need to know about architecting VMware Cloud on AWS deployment (PRT252)

AWS re:Invent 2022 - Deploy modern and effective data models with Amazon DynamoDB (DAT320)

AWS re:Invent 2022 - Beyond five 9s: Lessons from our highest available data planes (ARC310)

AWS re:Invent 2022 - SaaS architecture patterns: From concept to implementation (SAS305-R)

AWS re:Invent 2022 - Journey to cell-based microservices architecture on AWS for hyperscale (ARC312)

AWS re:Invent 2022 - How four customers reduced ML inference costs and drove innovation (CMP226)