-
Prism Digital

Senior SRE Engineer

Prism Digital
United Kingdom · Full-time · Mid-Senior

Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services


  • Location: London (Hybrid, typically 3 days onsite)
  • Permanent, Full-time
  • Salary: £80k–£90k + bonus + benefits
  • Visa sponsorship: Not available


The Role

You’ll join as the first dedicated SRE hire, with responsibility for establishing SRE practices across a live Azure-based platform and a new strategic platform being brought into service.


The role is focused on reliability, observability, incident management, resilience, and automation. You’ll help define how services are measured and operated, introducing practical improvements around SLIs, SLOs, error budgets, monitoring, and service ownership.


This is a hands-on role for someone who has done this before and can bring structure, prioritise well, and build an SRE capability in a pragmatic way.


Non-Negotiables

  • Site Reliability Engineering in production environments
  • Azure cloud environments in enterprise-scale businesses
  • SLO / SLI / error budget design and implementation
  • Observability tooling (Prometheus, Grafana, OpenTelemetry or similar)
  • Incident leadership across Sev1 / Sev2 environments
  • Disaster recovery, resilience testing, RTO / RPO
  • Terraform infrastructure as code
  • CI/CD pipelines and engineering enablement
  • Strong scripting with PowerShell, Bash or Python
  • Experience improving reliability in hybrid estates (cloud + IaaS)
  • Ability to introduce new ways of working and build an SRE practice from scratch

They are looking for someone with a strong Azure background, but the priority is proven SRE capability and the ability to apply it effectively.


What You’ll Work With

  • Azure platform engineering
  • Azure Container Apps / cloud-native services
  • Terraform infrastructure as code
  • Prometheus monitoring
  • Grafana dashboards
  • OpenTelemetry tracing
  • Azure DevOps pipelines
  • GitHub Actions CI/CD
  • Windows Server and Linux estates
  • Service Bus, Event Hubs and Kafka
  • Incident management, runbooks, failover and resilience testing


Nice to Haves

  • Financial services or regulated environment experience
  • FCA / PRA operational resilience exposure
  • Payments or FX platform experience
  • Chaos engineering
  • FinOps or cloud cost awareness
  • Kubernetes exposure

Kubernetes knowledge is useful, but not essential.


Why Join / Projects

  • Establish the SRE capability from the ground up
  • Define and implement SLIs, SLOs and error budgets
  • Improve observability across platforms and services
  • Lead incident response and post-incident improvements
  • Drive resilience, failover and automation initiatives
  • Support the move toward a modern, reliability-first platform

You’ll play a key role in shaping how reliability is engineered across both the current platform and a new strategic platform being brought into production.


Employee Benefits

  • Pension
  • Private healthcare
  • Training and certification support


Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services

Key Skills

Ranked by relevance

cloud prometheus powershell grafana devops server linux bash
Login to Apply
Posted
Mar 20, 2026
Type
Full-time
Level
Mid-Senior
Location
London Area

Industries

Technology Information Media Software Development Financial Services

Categories

Engineering Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
Prism Digital
Related

DevOps Engineer

2026-05-15

Full-time
Mid-Senior
Ireland
Software Development
Engineering
View Job Details
Prism Digital
Related

Senior Network Engineer

2026-05-19

Full-time
Mid-Senior
Ireland
Software Development
Engineering
View Job Details
Prism Digital
Related

Product Manager

2026-03-19

Full-time
Mid-Senior
Ireland
Technology
Engineering