Ampstek
Site Reliability Engineer
AmpstekUnited Arab Emirates13 hours ago
ContractInformation Technology

Job Title: Site Reliability Engineer (SRE)

Location: Abu Dhabi

Onsite Opportunity

Fixed Term contract

Salary: 10K AED/Month


Job Description:

We are seeking an experienced Site Reliability Engineer (SRE) with 6+ years of hands-on experience in building, maintaining, and improving highly scalable and reliable systems. The ideal candidate will be responsible for ensuring the availability, performance, and stability of production environments while working closely with development, infrastructure, and operations teams.


Key Responsibilities:

  • Ensure high availability, reliability, and performance of production systems and services.
  • Monitor system health using observability tools and respond to alerts and incidents.
  • Manage incident response, perform root cause analysis (RCA), and implement preventive measures.
  • Automate routine operational tasks to improve efficiency and reduce manual intervention.
  • Collaborate with development teams to improve application performance and reliability.


Required Skills & Experience:

  • 6+ years of experience in Site Reliability Engineering, DevOps, or production support roles.
  • Strong experience with cloud platforms such as AWS, Azure, or GCP.
  • Proficiency in scripting or programming languages such as Python, Bash, or Go.
  • Hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK).
  • Experience with CI/CD tools such as Jenkins, GitLab CI, or similar platforms.
  • Strong understanding of Linux/Unix systems and networking fundamentals.


Preferred Skills:

  • Experience with microservices architecture and distributed systems.
  • Familiarity with incident management and SRE best practices (SLI/SLO/SLA).

Key Skills

Ranked by relevance