Post your CV and find your next job on Indeed!

Pagerduty jobs in England

Sort by: -

Senior Site Reliability Engineer

New

Boston Consulting Group

London

The Senior Site Reliability Engineer is responsible for running the engineering capability behind a defined area of reliability across the organisation.

Senior Site Reliability Engineer

NICE

Southampton

Run the production environment by monitoring availability and taking a holistic view of system health.
Improve reliability, quality, and time-to-market of our…

View all NICE jobs - Southampton jobs - Site Reliability Engineer jobs in Southampton
Salary Search: Senior Site Reliability Engineer salaries in Southampton

IT Support Engineer, EMEA

Okta

London

Annual leave
Company events

In this role, you will be responsible for helping to facilitate our new employee onboarding, quickly resolving employee IT issues and developing a strong…

View all Okta jobs - London jobs - IT Support jobs in London
Salary Search: IT Support Engineer, EMEA salaries in London

IT Support Engineer, EMEA
Okta
London
Annual leave
Company events
In this role, you will be responsible for helping to facilitate our new employee onboarding, quickly resolving employee IT issues and developing a strong…

View all Okta jobs - London jobs - IT Support jobs in London
Salary Search: IT Support Engineer, EMEA salaries in London
Crypto Operations Lead
IFT
London
We are building an institutional-grade on-chain treasury operations function in-house.

Own the day-to-day operation, integrity and security of the group's on-…

View all IFT jobs - London jobs - Operations Lead jobs in London
Salary Search: Crypto Operations Lead salaries in London
Senior Site Reliability Engineer
Tes Global
Sheffield S1 2JE
Annual leave
Employee assistance programme
Company pension
Cycle to work scheme
Contract Type: Full time, permanent.

As a Senior SRE Engineer, you will be pivotal in designing and implementing best SRE practices while fostering a culture of…

View all Tes Global jobs - Sheffield jobs - Site Reliability Engineer jobs in Sheffield
Salary Search: Senior Site Reliability Engineer salaries in Sheffield
See popular questions & answers about Tes Global
Staff / Senior Engineer (IBM Sterling)
Mindera
London
We are seeking an experienced engineer with strong expertise in IBM Sterling Order Management System (OMS) and Sterling Intelligent Promising (SIP) to join our…

View all Mindera jobs - London jobs - Staff Engineer jobs in London
Salary Search: Staff / Senior Engineer (IBM Sterling) salaries in London
See popular questions & answers about Mindera
Engineering Manager, DevOps
iProov
London
Annual leave
Employee discount
Company pension
Cycle to work scheme
Car scheme
Discounted gym membership
IProov provides science-based biometric solutions that enable the world’s most security-conscious organizations to streamline secure remote onboarding and…

View all iProov jobs - London jobs - DevOps Engineer jobs in London
Salary Search: Engineering Manager, DevOps salaries in London
Site Reliability Engineer (SRE)
xAI
London
Employee discount
Life insurance
XAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

£107,000 - £262,000 GBP.

View all xAI jobs - London jobs - Site Reliability Engineer jobs in London
Salary Search: Site Reliability Engineer (SRE) salaries in London
Infrastructure Security Engineer
Blockchain.com
London
You will play a critical role in building and providing tooling that monitors our entire stack, from supporting our consumer-facing products and our…

View all Blockchain.com jobs - London jobs - Infrastructure Engineer jobs in London
Salary Search: Infrastructure Security Engineer salaries in London
See popular questions & answers about Blockchain.com
Senior Incident and Resilience Manager
DNA Payments
London SW1W 0EN
Own, develop, and continuously evolve a robust and effective incident management framework across the organisation, ensuring incidents are managed with speed,…

View all DNA Payments jobs - London jobs - Incident Manager jobs in London
Salary Search: Senior Incident and Resilience Manager salaries in London
See popular questions & answers about DNA Payments

Job Post Details

Senior Site Reliability Engineer - job post

Boston Consulting Group

4.2 out of 5 stars

London•Hybrid work

Full-time

You must create an Indeed account before continuing to the company website to apply

Job details

Job type

Full-time

Location

London•Hybrid work

Full job description

Who We Are

Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in 1963. Today, we help clients with total transformation-inspiring complex change, enabling organizations to grow, building competitive advantage, and driving bottom-line impact.

To succeed, organizations must blend digital and human capabilities. Our diverse, global teams bring deep industry and functional expertise and a range of perspectives to spark change. BCG delivers solutions through leading-edge management consulting along with technology and design, corporate and digital ventures—and business purpose. We work in a uniquely collaborative model across the firm and throughout all levels of the client organization, generating results that allow our clients to thrive.

What You'll Do

The Senior Site Reliability Engineer is responsible for running the engineering capability behind a defined area of reliability across the organisation. The role works across multiple SRE disciplines including infrastructure, cloud, observability, automation, identity, security, and network operations, applying engineering thinking to reduce operational toil, improve resilience, and embed reliability and governance into delivery and operational workflows.

The role drives engineering quality and consistency within its scope of responsibility, contributes to wider engineering standards, and helps shape how reliability is delivered across the organisation. It builds reusable patterns, mentors engineers, and provides senior engineering input across a wider set of stakeholders.

The ideal candidate is a senior practitioner who is comfortable operating across multiple domains, balances delivery with mentorship, and can articulate engineering trade-offs clearly to both technical and non-technical audiences.

Core responsibilities

Run and continuously improve the reliability engineering systems within scope, including automation, pipelines, observability, and operational tooling.
Design and implement engineering solutions that eliminate operational toil at scale and embed reliability into delivery workflows.
Help shape engineering standards, patterns, and reusable frameworks across the SRE practice.
Lead the engineering response to complex incidents within scope, drive systemic remediation, and contribute to post-incident learning.
Mentor and coach less senior engineers across reliability engineering, automation, observability, and SRE principles.
Drive cross-team collaboration with engineering, platform, and operations functions to embed reliability and governance through engineering controls.
Communicate engineering status, risks, and recommendations clearly to senior stakeholders and leadership forums.
Contribute to monthly operational reviews with structured metrics on service health, ingestion or pipeline performance, automation coverage, and improvement progress.

What You'll Bring

5–8 years of experience in Site Reliability Engineering, Platform Engineering, or related operational engineering disciplines.
Strong hands-on experience across multiple SRE domains, including cloud, automation, observability, and CI/CD.
Demonstrated experience designing and implementing automation and reliability solutions at scale.
Deep knowledge of at least one cloud platform (AWS or Azure), including networking, identity, and observability primitives.
Experience with Infrastructure-as-Code (e.g. Terraform) and CI/CD pipelines.
Strong scripting experience (e.g. Python).
Experience leading incident response and driving systemic improvement.
Strong stakeholder engagement and technical communication skills.
Deep hands-on experience with one or more enterprise observability platforms (e.g. Splunk, Datadog).
Proven experience designing and operating telemetry pipelines, ingestion controls, and observability cost management.
Proven experience designing signals (SLIs, SLOs, synthetic checks, alerts) and ops automation triggered from those signals.
Experience driving SLO/SLI practices across multiple teams.
Deep hands-on experience operating cloud infrastructure across at least two of AWS, Azure, GCP, or Alibaba Cloud.
Proven experience designing reusable IaC patterns and landing zone components across cloud providers.
Strong working knowledge of cloud networking, account management, identity primitives, and policy enforcement across providers.
Experience driving cloud platform engineering standards and governance across multiple teams.
Deep hands-on experience with identity platforms (e.g. Entra ID) and secrets management (e.g. HashiCorp Vault).
Proven experience designing OIDC, workload identity, and dynamic credential patterns.
Experience driving Zero Trust and least-privilege adoption across multiple teams.
Deep hands-on experience with security tooling embedded in CI/CD pipelines.
Proven experience designing policy-as-code controls and secure-by-default patterns.
Experience driving secure engineering adoption across multiple teams.
Deep hands-on experience with hybrid and cloud network architectures.
Proven experience designing automated network controls through IaC.
Experience driving Zero Trust segmentation and network observability adoption.

Preferred qualifications

Experience working within a federated, multi-cloud, or large enterprise environment.
Familiarity with containerisation (Docker) and orchestration (Kubernetes).
Experience with secrets management tooling (e.g. HashiCorp Vault).
Cloud certification at professional level.
Experience with policy-as-code tooling (e.g. OPA, Sentinel).
Experience contributing to engineering communities of practice.
Experience with AIOps, noise reduction, and event correlation.
Experience with event-driven ops automation platforms (e.g. ServiceNow, PagerDuty, custom workflows).
Ability to lead complex observability platform incidents and capacity reviews.
Experience with cloud FinOps, cost engineering, and chargeback tooling.
Hands-on experience with Alibaba Cloud platform architecture.
Experience with cloud policy-as-code tools (e.g. AWS Service Control Policies, Azure Policy, OPA).
Strong understanding of identity-related security risks and mitigations.
Strong understanding of common security risks and mitigations across the SDLC.
Strong understanding of network reliability, observability, and security patterns.

Who You'll Work With

Hybrid or on-site work model.
Operates as a senior individual contributor with mentorship and cross-team influence.
Expected to participate in on-call rotation and lead incident response.
Occasional travel may be required for team or stakeholder engagement.

Boston Consulting Group is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, age, religion, sex, sexual orientation, gender identity / expression, national origin, disability, protected veteran status, or any other characteristic protected under national, provincial, or local law, where applicable, and those with criminal histories will be considered in a manner consistent with applicable state and local laws.
BCG is an E - Verify Employer. Click here for more information on E-Verify.

Let Employers Find YouUpload Your Resume