DevOps Institute/Site Reliability Engineering (SRE) Foundation

Site Reliability Engineering (SRE) Foundation Certification Training Course Overview

The SRE (Site Reliability Engineering) Foundation℠ course is an introduction to the principles & practices which will be enabling the organization to reliably and economically scale critical services. The major objective of this course and certification is to impart, test and validate knowledge of SRE basic vocabulary, principles and practices.

SRE Foundation is designed for providing participants to understand the basic concepts of SRE and the uses of SRE in developing the operational activities by applying Site Reliability Engineering principles and engineering practices.

Audience

  • Business Managers and Stake Holders
  • Change Agents and Consultants
  • DevOps Practitioners
  • IT Directors, Managers and Team Leaders
  • Product Owners
  • Scrum Masters
  • Software Engineers
  • Site Reliability Engineers
  • System Integrators
  • Tool Providers
This course prepares you for Exam SRE.

Site Reliability Engineering (SRE) Foundation (16 Hours) Download Course Contents

Live Online Training
Group Training 1150 Date On Request
1-on-1 Training (GTR) 1300
4 Hours
8 Hours
Week Days
Weekend

Start Time : At any time

12 AM
12 PM

GTR=Guaranteed to Run
Classroom Training (Available: London, Dubai, India, Sydney, Vancouver)
Duration : On Request
Fee : On Request
On Request
Buy Flexi For Only $99 (Online Training Re-imagined)

Course Modules

Module 1: Course Introduction
  • Course Goals
  • Course Agenda
Module 2: SRE Principles & Practices
  • What is Site Reliability Engineering?
  • SRE & DevOps: What is the Difference?
  • SRE Principles & Practices
Module 3: Service Level Objectives & Error Budgets
  • Service Level Objectives (SLO’s)
  • Error Budgets
  • Error Budget Policies
Module 4: Reducing Toil
  • What is Toil?
  • Why is Toil Bad?
  • Doing Something About Toil
Module 5: Monitoring & Service Level Indicators
  • Service Level Indicators (SLI’s)
  • Monitoring
  • Observability
Module 6: SRE Tools & Automation
  • Automation Defined
  • Automation Focus
  • Hierarchy of Automation Types
  • Secure Automation
  • Automation Tools
Module 7: Anti-Fragility & Learning from Failure
  • Why Learn from Failure
  • Benefits of Anti-Fragility
  • Shifting the Organizational Balance
Module 8: Organizational Impact of SRE
  • Why Organizations Embrace SRE
  • Patterns for SRE Adoption
  • On-Call Necessities
  • Blameless Post-Mortems
  • SRE & Scale
Module 9: SRE, Other Frameworks, The Future
  • SRE & Other Frameworks
  • The Future
Download Course Contents

Request More Information

Course Prerequisites
  • Basic knowledge in IT
  • Working knowledge in DevOps

Upon completion of this course, you will be able to accomplish

  • Knowing the complete history of SRE and its emergence at Google
  • Inter-relationship of SRE with DevOps and other major frameworks
  • To underly principles behind SRE
  • Focus of Service Level Objectives (SLO’s)
  • Knowledge in service Level Indicators (SLI’s) and their modern monitoring landscape
  • Knowing about error budgets and the associated error budget policies
  • The effects of Toil on an organization’s productivity
  • Knowing few practical steps that can help to eliminate toil
  • Knowing the techniques of SRE tools, automation and the importance of security
  • Anti-fragility, our approach to failure and failure testing
  • Knowing about the impacts on organization by introducing SRE