Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
⚙️ HIRING: Senior SRE / DevOps Lead | Avrioc | UAE 🇦🇪
We’re looking for a Seasoned DevOps & Site Reliability Engineer (SRE) Lead to design, scale, and enhance our cloud infrastructure and observability ecosystem.
If you’re passionate about automation, resilience, and reliability — this role is for you!
🔧 Key Responsibilities
- Architect and deploy scalable, highly available cloud infrastructure for production workloads.
- Lead and implement SRE best practices, ensuring system reliability, performance, and scalability.
- Oversee and optimize CI/CD pipelines (Jenkins, Argo CD or similar) for seamless deployments.
- Define and monitor SLOs & SLIs to ensure service reliability and uptime.
- Design and manage observability frameworks — monitoring, logging, and alerting (Elastic Stack, Prometheus, Grafana, Dynatrace, New Relic).
- Manage and optimize Kubernetes clusters and Helm charts for efficient orchestration and streamlined releases.
- Implement auto-healing and proactive monitoring systems to prevent outages.
- Drive fault injection testing & chaos engineering (Chaos Mesh, Litmus, AWS FIS) for resilience validation.
- Collaborate with engineering and product teams to embed reliability into every phase of development.
- Maintain clear documentation on infrastructure, incidents, and operational processes.
🧩 Requirements
- 8+ years of experience as a DevOps/SRE professional, leading enterprise SRE implementations.
- Hands-on with AWS, GCP, or Azure (EC2, S3, RDS, Lambda, etc.).
- Strong with IaC tools (Terraform, CloudFormation, Ansible).
- Proven experience in CI/CD automation, monitoring, and incident response.
- Skilled in observability tools — Elastic Stack, Grafana, Prometheus, Dynatrace, New Relic.
- Strong Kubernetes & Helm expertise for large-scale deployments.
- Experience with AWS managed & self-managed databases (MySQL, Cassandra, etc.).
- Skilled in Python, Bash, or Go scripting.
- Experience designing and testing BCP/DR strategies.
- Proactive in capacity planning, ensuring scalability and resilience across cloud environments.
- Excellent communication, documentation, and troubleshooting skills.
🛡️ Information Security Responsibilities
- Comply with Avrioc’s Information Security & Service Management policies.
- Maintain the confidentiality and integrity of all information assets.
- Attend mandatory information security trainings.
- Report any security incidents through official channels.
Key Skills
Ranked by relevanceReady to apply?
Join Avrioc Technologies and take your career to the next level!
Application takes less than 5 minutes

