Site Reliability Engineering (SRE) Fundamentals

preview_player
Показать описание
Join us on September 22nd to learn the Site Reliability Engineering (SRE) principles and practices that you can apply in your organization that enable your systems to be more scalable, reliable, and efficient.

Technical Account Manager, Pamella Canova, will lead the session, including: 

- The core problems SRE solves and organizational structures to facilitate the practice of SRE
- Key principles SREs use to keep systems reliable
- Areas of responsibility and expertise amongst SREs
- How to adopt SRE best practices in your organization

07:06 The SRE approach to operations
09:04 What do SRE teams do?
10:10 SRE and DevOps
11:03 Error budgets: The key principle of SRE

23:57 Practice areas of SRE
24:17 Monitoring and alerting
26:57 Demand forecasting and capacity planning
29:04 Efficiency and performance
30:55 Change management
34:00 Pursuing maximum change velocity
39:55 Provisioning
41:50 Emergency response
44:09 Incident and postmortem thresholds
48:31 Culture of blamelessness
49:55 Toil management / operational work

52:55 Getting started in 4 steps
55:14 Resources and certification information

56:55 Q&A
Рекомендации по теме
Комментарии
Автор

From where we can do SRE certification?

pratikRimps