Ampstek
Site Reliability Engineer
AmpstekUnited Arab Emirates14 hours ago
ContractInformation Technology

Job Title: Site Reliability Engineer (SRE)

Role Overview

We are seeking a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our production systems. The ideal candidate will bridge the gap between development and operations by applying software engineering principles to infrastructure and system management.

Key Responsibilities

  • Design, build, and maintain scalable, highly available, and resilient systems.
  • Monitor system performance and ensure uptime, reliability, and efficiency.
  • Develop and implement automation tools for deployment, monitoring, and operations.
  • Manage incident response, root cause analysis (RCA), and post-mortems.
  • Collaborate with development teams to improve system reliability and performance.
  • Implement CI/CD pipelines and improve release processes.
  • Optimize infrastructure costs and performance.
  • Ensure system security, compliance, and best practices.
  • Create and maintain documentation for systems, processes, and runbooks

Key Skills

Ranked by relevance