Site Reliability Engineering (SRE) refers to the application of software engineering practices to the management of IT infrastructure and operations. This instructor-led, live training (online or onsite) is aimed at technical persons who wish to apply software engineering tools and techniques
Overview
Site Reliability Engineering (SRE) refers to the application of software engineering practices to the management of IT infrastructure and operations. This instructor-led, live training (online or onsite) is aimed at technical persons who wish to apply software engineering tools and techniques to more efficiently manage an IT system.
By the end of this training, participants will be able to:
Apply a disciplined software engineering approach to solve IT operations problems.
Create software to manage systems and automate IT operations tasks.
Develop systems to increase site reliability and performance.
Bridge the work of development and operations by applying a software engineering mindset to system administration.
Format Of The Course
Interactive lecture and discussion.
Lots of exercises and practice.
Hands-on implementation in a live-lab environment.
Course Customization Options
To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction
How SRE marries traditional IT and software development.
The need for automation and observability
The role of a software engineers vs system administrators.
Site Reliability Engineers vs DevOps engineers.
Overview of an IT System
System architecture, on-premise and in the cloud.
Overview of SRE Principles and Practices
Infrastructure as a Code.
The role of containerization and orchestration (Docker, Kubernetes, etc.)
Continuous Integration, Continuous Deployment and Continuous Delivery.
Observability.
Evaluating an IT System
Taking stock of the team and organizational resources.
Maping out the systems and processes.
Estimating the potential impact of SRE.
The role the software engineering team.
The role of the operational team.
The role of management.
Maintaining the Reliability of a System
Describing and measuring the desired reliability of a service.
Understanding Service Level Objectives (SLOs)
Understanding Service Level Indicators (SLIs) and Service Level Agreements (SLAs).
Working with Error Budgets.
Developing an SLO.
Optimizing System Administration
Setting up a development environment
Evaluating SRE tools
Prioritizing tasks for automation.
Writing software.
Deploying "Infrastructure as Code"
Testing and iterating code
Making a system anti-fragile
Learning from failure
Monitoring a System
Observing system performance.
SRE tools and techniques.
Requirements
A general understanding of IT infrastructure.
A general idea of the software development process.
Programming or scripting experience in any language.
Audience
Developers
System administrators
Software Architects
DevOps engneers
IT Managers
NobleProg is an international training and consultancy group, delivering high quality courses to every sector, covering: Cyber Security, Artificial Intelligence, IT, Management, Applied Statistics.
Over the last 17 years, we have trained more than 50,000 people from over 6000 companies and organisations.
Our courses include classroom (both public and closed) and instructor-led online giving you choice and flexibility to suit your time, budget and level of expertise.
We practice what we preach – we use a great deal of the technologies and methods that we teach, and continuously upgrade and improve our courses, keeping up to date with all the latest developments.
Our trainers are hand picked and have been through rigorous checks and interviews, and all courses are evaluated by delegates ensuring continuous feedback and improvement.
This course covers the application of contemporary DevOps tools in a hands-on learning environment. The course is guided by two leading thinkers in this space.
The word DevOps is a contraction of ‘Development’ and ‘Operations’. DevOps is a set of best practices that emphasize the collaboration and communication of IT-professionals (developers, operators, and support staff) in the lifecycle of applications and services, leading to:
Many research fields can benefit from automation and programmatic techniques, ranging from the humanities and social sciences through biomedical sciences and engineering. The tools and techniques taught in this workshop will be of use to anyone who currently uses a computer for their research.
This 3-day certification course, designed by Pink Elephant, provides a new and unique perspective of DevOps through the ‘Full Stack’ of culture, practices and automation.
This comprehensive training program is designed to equip students with the knowledge and skills necessary to excel as DevOps Engineers. Participants will learn how to bridge the gap between development and operations, automate workflows, and manage infrastructure as code.
© 2025 coursetakers.com All Rights Reserved. Terms and Conditions of use | Privacy Policy