Site Reliability Engineer - 12 month rolling daily rate contract
Site Reliability Engineer – Observability Focus - 12 month daily rate contract (rolling)
Join a leading enterprise as a Site Reliability Engineer (SRE) with a strong focus on observability, operating at scale within a highly dynamic, cloud-native environment.
This role sits within a cutting-edge Cloud Operations team responsible for ensuring the performance, reliability, and scalability of critical applications across a rapidly expanding cloud estate. You’ll drive observability excellence through the full Helm-based Prometheus stack, including self-hosted Prometheus, Grafana, Loki, and Tempo.
You’ll design and implement advanced Grafana dashboard templating, alert routing, and OpenTelemetry Collector pipelines. Your work will directly influence our trace sampling strategies (head-based, tail-based, exemplars) and ensure log forwarding from Kubernetes (via FluentBit, Fluentd, or OTel) is optimized for clarity and performance. A key part of your remit will include enforcing standards for metric generation, labeling, and cardinality control.
The role requires a deep understanding of SRE principles, SLIs/SLOs, MTTR, and error budgets, with a governance mindset over usage, cost control, and environment-aware monitoring setups.
If you’ve got strong experience with observability tooling, Kubernetes management at scale, and you thrive in a culture of continuous improvement and automation, we want to hear from you.
What you’ll need:
- 5+ years in cloud infrastructure or SRE roles
- Deep expertise in Prometheus, Grafana, Loki, Tempo, OpenTelemetry
- Proficiency in Kubernetes, Helm, and Terraform
- Strong coding skills in Python, Go, or Bash
This is your chance to work in a high-impact role at the forefront of observability engineering in a major enterprise environment.
Key Skills
Ranked by relevance
Related Jobs
3 roles aligned with this opportunity
DevOps Engineer
2026-05-22
Senior Backend Engineer, Trust & Safety (Detection)
2026-05-29
Senior Backend Engineer, Trust & Safety (Detection)
2026-05-29
- Posted
- Aug 05, 2025
- Type
- Contract
- Level
- Mid-Senior
- Location
- Dublin
- Company
- Hadfield Green
Industries
Categories
Related Jobs
3 roles aligned with this opportunity
DevOps Engineer
2026-05-22
Senior Backend Engineer, Trust & Safety (Detection)
2026-05-29
Senior Backend Engineer, Trust & Safety (Detection)
2026-05-29