Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
A high-growth technology organisation is seeking a Cloud Infrastructure & Reliability Engineer to strengthen its internal platform capabilities. Reporting to the Platforms Lead, this role is responsible for designing, scaling, and operating the AWS environments that underpin engineering, data, and AI workloads across the business.
The position sits at the intersection of infrastructure engineering, automation, and developer enablement. The successful candidate will take ownership of infrastructure-as-code patterns, deployment pipelines, and reliability practices, while contributing to major cross-functional initiatives such as a globally distributed secure connectivity platform.
This role is well-suited to engineers who enjoy building durable systems, eliminating operational friction, and improving platform performance through measurable outcomes.
Key Responsibilities
Cloud Platform Engineering
- Design, deploy, and operate AWS infrastructure using Infrastructure-as-Code (Terraform).
- Develop and maintain reusable infrastructure modules and automation patterns across core AWS services.
- Improve availability, performance, and resilience across cloud-hosted workloads.
Reliability & Automation
- Strengthen platform reliability through monitoring, automation, and operational best practices.
- Reduce manual intervention by embedding guardrails, compliance checks, and self-service workflows.
- Drive improvements across deployment frequency, lead time, recovery time, and change reliability.
CI/CD & Developer Enablement
- Enhance CI/CD pipelines (GitLab preferred) and integrate infrastructure and compliance workflows.
- Improve developer experience by simplifying provisioning, deployment, and operational processes.
Global Infrastructure Initiatives
- Contribute to the design and operation of a globally distributed secure connectivity platform.
- Collaborate with cross-functional teams to ensure infrastructure supports international scale and reliability.
Skills & Experience
Required
- Strong hands-on experience operating AWS environments in production.
- Proficiency with Terraform and Infrastructure-as-Code practices.
- Deep understanding of AWS services including compute, storage, networking, and identity.
- Experience integrating CI/CD systems with cloud infrastructure.
- Programming or scripting capability in Go, Python, or TypeScript.
- Familiarity with Docker and Kubernetes-based workloads.
- Solid grounding in DevSecOps principles and secure-by-design infrastructure.
- Strong diagnostic and troubleshooting skills in distributed systems.
If you have any questions about this role or others we have available you can email our team directly via [email protected]
Key Skills
Ranked by relevanceReady to apply?
Join TheDriveGroup and take your career to the next level!
Application takes less than 5 minutes

