Site Reliability Engineering (SRE) Foundation & Practitioner

Includes:
Exam voucher
Play Video
Play Video
Watch Video: Find out more about this course
Watch Video: Find out more about this course

Course overview

Syllabus

Exams

FAQ

Student Review

Well presented courses covering all the essential info needed for DevOps certification. Definitely recommended as an online learning provider.

About this course:

Become a certified site reliability engineer with this fully accredited SRE Foundation (SREF)℠ & SRE Practitioner (SREP)℠ suite from Good e-Learning!

We cover the DevOps Institute’s SRE syllabus in its entirety, teaching candidates everything they need to know about site reliability engineering and how it enables businesses to provide and scale market-leading services. Following an introduction to the principles and practices of SRE, the suite covers how to implement them and fully optimize your pipeline. Kickstart your SRE training today!

Who is this course aimed at?

  • Software engineers, Scrum masters, system integrators, tool providers, change agents, consultants, IT directors, and anyone else involved in IT leadership, development, operations, scalability, and reliability
  • DevOps and Site Reliability Engineers who wish to verify their knowledge with widely-recognized qualifications
  • Organizations wishing to comprehensively integrate SRE’s best practices, insight, tools, and vocabulary
  • Leaders and managers focused on modern IT leadership and organizational change
  • DevOps-powered companies that wish to optimize their cultures

What will you learn by taking this course?

  • Everything required to pass the SRE Foundation and SRE Practitioner certification exams
  • The principles, practices, and tools of site reliability engineering and how they enhance development and operations
  • How SRE empowers organizations to scale services reliability and economically
  • How an organization can be realigned to support SRE best practices
  • How SRE is evolving, as well as how site reliability engineers can continue updating their knowledge
  • How to understand, set, and track service level objectives (SLOs)
  • The relationship between SRE and DevOps
  • How to highlight, avoid, and repair antipatterns
  • How to define service level objectives and service level indicators (SLI) in distributed ecosystems
  • The importance of having an error budget and how to perform error budget calculations
  • How to make systems secure and reliable by design
  • The importance of full-stack observability and how to review the health of your systems
  • How to first implement SRE within an organization
  • The advantages of building and operating control platforms as products for technology
  • What kind of role AIOps plays in boosting the efficiency of IT services
  • How to leverage an incident and command framework and OODA loops in incident response management
  • The importance of chaos engineering for building confidence in a system

Why should you take this course?

  • This course suite is fully accredited by the DevOps Institute, the organization behind the Site Reliability Engineering certification
  • This SRE course suite offers a detailed overview of SRE, its benefits, and how to implement it
  • SRE optimizes efficiency by enhancing communication and collaboration between development and operations staff
  • Qualified site reliability engineers are in increasingly high demand across all industries and sectors
  • Good e-Learning is an award-winning online training provider, as well as a Trusted Education Partner for the DevOps Institute
  • This SRE online training course suite comes with several engaging assets, including gamified knowledge checks, mock exams, case studies, and instructor-led videos
  • Good e-Learning regularly provides free SRE training resources, including downloadable posters and webinars
  • The GEL support team is fully qualified to answer questions on the SRE training syllabus
  • Good e-Learning courses can be accessed from any web-enabled device thanks to the FREE Go.Learn app
  • Worried about SRE certification costs? Good e-Learning offers FREE exam vouchers with this course (T&Cs apply)
  • The course is fully accredited by the DevOps Institute

An Introduction to SRE Foundation (SREF)℠

Learning Objectives

This module introduces the SRE Foundation training course. The subject matter and rationale is explained and students are given an overview of the Foundation syllabus.

Candidates also receive a toolkit containing:

Table of contents
Glossary
Further resources
Diagram pack

SRE Foundation (SREF)℠ Module 1: SRE Principles & Practices

Learning Objectives

This module introduces students to site reliability engineering as a discipline, including how it compares to DevOps. The principles and practices of SRE are also explained.

SRE Foundation (SREF)℠ Module 2: Service Level Objectives & Error Budgets

Learning Objectives

This module examines service level objectives (SLOs), service levels, error budgets, and error budgets policies.

SRE Foundation (SREF)℠ Module 3: Reducing Toil

Learning Objectives

This module introduces ‘toil’, why it represents a problem, and how it can be effectively managed.

SRE Foundation (SREF)℠ Module 4: Monitoring & Service Level Indicators

Learning Objectives

This module focuses on service level indicators (SLIs), along with observability and monitoring.

SRE Foundation (SREF)℠ Module 5: SRE Tools & Automation

Learning Objectives

This module looks at ‘automation’ as defined by both SRE and DevOps. It focuses on several distinct types of automation and their hierarchy, as well as popular automation tools.

SRE Foundation (SREF)℠ Module 6: Anti-Fragility & Learning from Failure

Learning Objectives

This module looks at the SRE principle of learning from failure and how it relates to anti-fragility and chaos engineering.

SRE Foundation (SREF)℠ Module 7: Organizational Impact of SRE

Learning Objectives

This module examines how SRE is managed at an organizational level. It also covers how SRE is initially implemented, why so many businesses are embracing SRE, patterns for adopting SRE, sustainable incident responses, and blameless post-mortems. Finally, it covers how to utilize SRE at scale.

SRE Foundation (SREF)℠ Module 8: SRE, Other Frameworks, Trends

Learning Objectives

This module covers how SRE relates to and incorporates other popular frameworks, including IT4IT, Agile, and ITIL 4. It also considers how SRE is evolving and what kind of shape it will take in the future.

SRE Foundation (SREF)℠ Practice Exams

Learning Objectives

This module features two practice exams that can help candidates get used to the conditions of the Site Reliability Engineering (SRE) Foundation exam.

An Introduction to SRE Practitioner (SREP)℠

Learning Objectives

This module introduces students to the main features of the course, including its objectives, aims, learning plan, and structure.

Candidates are taken through the course’s syllabus and provided with a glossary, further reading and links document, diagram pack, and links to download copies of essential SRE publications. It then answers some of the most frequently asked questions regarding SRE Practitioner.

The module concludes with a brief assessment to gauge how much candidates remember from the SRE Foundation syllabus.

SRE Practitioner (SREP)℠ Module 1: SRE Antipatterns

Learning Objectives

This module focuses on antipatterns in SRE and how such unproductive behaviors can negatively impact a pipeline.

SRE Practitioner (SREP)℠ Module 2: Service Levels and Error Budgets

Learning Objectives

This module examines system boundaries and demonstrates how to define system capabilities and appropriate service level indicators (SLIs) and service level objectives (SLOs). It also looks at how to measure the baseline.

The module goes on to examine multi-service architecture, along with how to calculate and utilize error budgets.

SRE Practitioner (SREP)℠ Module 3: Building Secure and Reliable Systems

Learning Objectives

This module defines the role of a site reliability engineer when it comes to systems design, along with the important considerations regarding changing landscape and security requirements. The module then examines contemporary approaches, technology, and tools for system design, along with design patterns that help SRE practitioners build secure, resilient, reliable, and scalable systems.

SRE Practitioner (SREP)℠ Module 4: Full-Stack Observability

Learning Objectives

This module focuses on the key elements of full-stack observability, as well as how instrumentation makes SRE systems more observable.

SRE Practitioner (SREP)℠ Module 5: Review: Modules 1-4

Learning Objectives

This is a reflective module designed to help students test their knowledge on the concepts and terms covered in modules one to four. It features a memory game as well as a concept checker.

SRE Practitioner (SREP)℠ Module 6: Platform SRE and AIOps

Learning Objectives

This module examines the benefits of taking a platform-centric view when building and operating platforms as products. It goes on to look at how artificial intelligence can benefit IT operations and how to implement AI.

SRE Practitioner (SREP)℠ Module 7: SRE and Incident Management

Learning Objectives

This module looks at the key elements of incident management in relation to the incident command framework. It also examines how the Observe, Orient, Decide, Act (OODA) loop is used to integrate technology, processes, and resources for incident responses.

SRE Practitioner (SREP)℠ Module 8: Chaos Engineering

Learning Objectives

This module examines ‘chaos engineering’, the discipline of experimenting with a distributed system to build confidence in its ability to survive and thrive even in turbulent conditions. It also explains how to set up game day exercises for practicing chaos engineering and dispels common myths on the subject.

SRE Practitioner (SREP)℠ Module 9: Implementing SRE Practices

Learning Objectives

This module looks at the role SRE plays in optimizing operations and fully realizing DevOps cultures. It then looks at the steps and models used to implement and execute SRE.

SRE Practitioner (SREP)℠ Module 10: Review: Modules 6-9

Learning Objectives

This is a reflective module designed to help students affirm their understanding of the concepts and terms covered in modules six to nine. It features a memory game and concept checker.

SRE Practitioner (SREP)℠ Practice Exams

Learning Objectives

This module features two practice exams that can help candidates get used to the conditions of the Site Reliability Engineering (SRE) Practitioner exam.

This SRE course suite is designed to fully prepare students to sit the official SRE Foundation and SRE Practitioner examinations. This includes providing official practice exams to help students test themselves and get used to examination conditions.

This course comes with mock exams to help students prepare for the real thing, as well as FREE exam vouchers. (T&Cs apply)

Before booking your exam, it will be a good idea to make sure that your device meets the technical requirements. Please visit the DevOps Institute website for more information and guidance.

When you are ready to use your free exam voucher, simply contact [email protected]. Exam voucher requests are typically processed within 2 working days but please allow up to 5. Students must request their exam voucher within the course access period which starts from the date of purchase. For more information, please visit our Support & FAQs page.

SRE Foundation (SREF) exam:

  • This is a multiple choice exam consisting of 40 questions
  • There is a time limit of 60 minutes to complete the exam
  • The exam is open book, with only the provided materials being permitted for use
  • The pass mark for the exam is 65%: you must score at least 26 out of 40 questions correct
  • Candidates can take the exam online or in person with an invigilator

SRE Practitioner (SREP) exam:

  • This is a multiple choice exam consisting of 40 questions
  • There is a time limit of 90 minutes to complete the exam
  • The exam is open book, with only the provided materials being permitted for use
  • The pass mark for the exam is 65%: you must score at least 26 out of 40 questions correct
  • Candidates can take the exam online or in person with an invigilator

What is SRE?

‘Site Reliability Engineering (SRE)’ is the process of continuously testing the ‘reliability’ of a new product in development. This enables developers to better understand and adapt to the needs of operations teams.

How does SRE work?

There are several elements to SRE, including:

A ‘Service Level Agreement (SLA)’ is outlined to define how reliable a product has to be for end-users
An ‘Error Budget’ is established to show how much can be spent on fixing errors before production must stop
Site reliability engineers make themselves available to help with development team workloads and vice versa
Site reliability engineers actively find and repair problems during the development stage
Developers take on Operations tasks if necessary
Site reliability engineers create automation wherever possible for the sake of efficiency and reliability

What is a site reliability engineer?

A ‘site reliability engineer’ is an automation/ coding specialist whose job it is to find and solve problems within Development and Operations.

How can SRE benefit businesses?

An SRE team can not only make a DevOps pipeline more reliable, but also far more efficient and scalable. It can also free Development and Operations team members to focus on improving services elsewhere, boosting the quality of releases. Incorporating SRE will also further improve existing DevOps cultures by encouraging greater communication, clarity, and understanding between teams.

Finally, site reliability engineers are specialists in considering and conveying concerns in relation to the wider organization and can extract metrics that can prove extremely valuable for other departments.

Does SRE complement DevOps?

DevOps and SRE work extremely well together. This is largely because both are designed with automation, inter-team collaboration, and communication in mind, as well as boosting efficiency and reliability within IT pipelines. The SRE Practitioner qualification even comes from the DevOps Institute.

What do I need to study site reliability engineering?

There are no prerequisites for taking this course. However, it can be helpful to have pre-existing knowledge of SRE, as well as DevOps.

Why is SRE necessary?

SRE was originally developed by Google. Its purpose is to quantify the relationship between Development and Operations teams, ensuring that code is created efficiently, reliably, and with operational factors in mind. This is particularly valuable in organizations where IT departments and teams have become siloed from one another.

Who can benefit from studying SRE?

SRE is ideal for organizations that rely on developing and releasing code. It works particularly well in DevOps environments and is a popular choice with DevOps engineers and DevOps Leaders. Given the growing popularity of SRE, a qualified and experienced practitioner will often find it easier to take the next step in their career.

Purchase Options

Self-Paced eLearning

  • Fully accredited:
  • Exam vouchers included:
  • Course duration: 30+ hours
  • Access period: 12 months
  • Tutor support:
  • Quizzes & practice exams:
  • Mobile compatible:

Virtual Classroom

  • Fully accredited:
  • Exam voucher included:
  • Video conferencing:
  • Breakout rooms:
  • Slide annotations:
  • Live tutor feedback:
  • Quizzes & practice exams:
  • Mobile compatible:

Leading DevOps, SRE and AWS Trainer and International Speaker

Oguzhan is a seasoned expert within the fields of Cloud Architecture, DevOps, SRE and AWS working for some of the biggest organizations in the world including Microsoft, Vodafone, and Shell

Learn how to optimize your IT culture with this fully accredited SRE Practitioner (SREP) certification course from Good e-Learning!

Interested in Corporate Training for This Course?

Supercharge your organization's learning and development with an award-winning training provider. Contact us today for a FREE quote!
A Selection of Our Customers From Fortune's Finest:

Free resources

Looking for more related resources, articles and webinars?