-
TAT IT Technolgies
View all jobs
DevOps SRE Engineer - Observability & Automation
United Arab Emirates
· Contract
·
Associate
Urgent requirement for DevOps SRE Engineer - Observability & Automation is required for our banking clients in Abu Dhabi ,UAE
Strong experience in Terraform, IaC practice is MUST
Strong experience in Linux environments and performance troubleshooting is MUST
Strong experience in Banking is MUST
We’re looking for a talented Site Reliability Engineer (SRE) to keep our systems running smoothly, reliably, and at scale. Through smart automation, deep observability, and a calm head
in a crisis, you’ll help us balance speed, compliance, and stability, working alongside DevOps,Cloud, Quality Engineering, and Product teams to drive continuous improvements inperformance, security, and resilience..
ecosystems.
- Strong experience in Kafka, RabbitMQ, Redis, RDS/Aurora ---Must
- Strong experience in observability (metrics, logs, traces, dashboards, and alerts) is Must
Strong experience in Terraform, IaC practice is MUST
Strong experience in Linux environments and performance troubleshooting is MUST
Strong experience in Banking is MUST
We’re looking for a talented Site Reliability Engineer (SRE) to keep our systems running smoothly, reliably, and at scale. Through smart automation, deep observability, and a calm head
in a crisis, you’ll help us balance speed, compliance, and stability, working alongside DevOps,Cloud, Quality Engineering, and Product teams to drive continuous improvements inperformance, security, and resilience..
- Define and implement SLIs / SLOs and error budgets for business-critical digital banking
- Build actionable observability (metrics, logs, traces, dashboards, and alerts) using Dynatrace,
- Leverage AI-driven insights and anomaly detection (Dynatrace Davis AI or equivalent AIOps
- Lead incident management — from on-call triage and root-cause analysis to blameless
- Improve deployment safety with robust rollout / rollback strategies, canary and blue-green
- Support and optimize microservices-based architectures, ensuring service reliability,
- Conduct capacity planning, performance tuning, and resilience testing, optimizing for both
- Automate operational toil — from runbooks and remediation scripts to proactive health checks
- Collaborate with DevOps to embed reliability gates and validations into CI / CD pipelines
- Own and evolve the observability and AIOps stack, driving intelligent automation and predictive
- Maintain high-quality documentation, playbooks, and operational standards across
- Ensure operational compliance and security alignment with internal controls and regulatory
- Analyze system performance, availability, and cost data to continually optimize operations.
- Provide reliability support and escalation guidance for critical production systems during major
- 5+ years of experience in SRE or DevOps roles, building and managing large-scale,
ecosystems.
- Bachelor’s degree in Computer Science or equivalent technical experience.
- Strong experience with Linux environments and performance troubleshooting.
- Proven expertise in Terraform and Infrastructure as Code (IaC) methodologies.
- Proficiency with Kubernetes and container orchestration in microservices environments.
- Hands-on experience with AWS (preferred); exposure to Azure or GCP is an advantage.
- Deep knowledge of Dynatrace (AIOps, Davis AI), Prometheus, Grafana, and the ELK stack.
- Experience implementing AI / ML-driven reliability or automation solutions (AIOps, anomaly
- Practical understanding of CI / CD pipelines (GitHub Actions, Jenkins, GitLab CI / CD or Azure
- Experience with Kafka, RabbitMQ, Redis, Aurora, and RDS databases.
- Strong scripting or programming skills in Python, Bash, or Go.
Key Skills
Ranked by relevance
ai
microservices
kubernetes
gitlab ci
terraform
rabbitmq
jenkins
grafana
devops
gitlab
redis
kafka
linux
elk
infrastructure as code
prometheus
python
docker
bash
aws
gcp
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Senior Cloud DevSecOps Engineer with Mobile Apps Pipeline - Banking Domain
2026-03-25
Contract
Mid-Senior
United Arab Emirates
Technology
Engineering
View Job Details
Related
Senior AI Engineer / Agentic AI Engineer
2026-03-30
Contract
Mid-Senior
United Arab Emirates
Technology
Engineering
View Job Details
Related
Technical Analyst – Wealth & Brokerage Platform in Banking domain
2026-03-24
Contract
Mid-Senior
United Arab Emirates
Technology
Information Technology
Login to Apply
- Posted
- Mar 28, 2026
- Type
- Contract
- Level
- Associate
- Location
- Abu Dhabi
- Company
- TAT IT Technolgies
Industries
Technology
Information
Internet
Categories
Engineering
Information Technology
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Senior Cloud DevSecOps Engineer with Mobile Apps Pipeline - Banking Domain
2026-03-25
Contract
Mid-Senior
United Arab Emirates
Technology
Engineering
View Job Details
Related
Senior AI Engineer / Agentic AI Engineer
2026-03-30
Contract
Mid-Senior
United Arab Emirates
Technology
Engineering
View Job Details
Related
Technical Analyst – Wealth & Brokerage Platform in Banking domain
2026-03-24
Contract
Mid-Senior
United Arab Emirates
Technology
Information Technology