NorthBay Solutions
DevOps Support Engineer
NorthBay SolutionsUnited Arab Emirates1 day ago
Full-timeCustomer Service

DevOps Support Engineer

Job Type: Full-time
Work Mode: Onsite (Candidates outside UAE must be willing to relocate)

Role Overview
The DevOps Support Engineer provides operational support for cloud infrastructure, CI/CD pipelines, container platforms, and AI workloads. The role focuses on maintaining platform stability, troubleshooting deployments, monitoring environments, and responding to infrastructure incidents.

Key Responsibilities

• Support cloud infrastructure including subscriptions, networking, and access control
• Monitor GPU clusters, container environments, and AI runtime systems
• Troubleshoot deployment failures across Sandbox, Staging, and Production environments
• Monitor CI/CD pipelines and resolve build or deployment issues
• Support version control workflows and release rollouts
• Monitor GPU utilization, memory usage, and AI inference performance
• Troubleshoot API gateway routing, throttling, and authentication issues
• Support integrations with enterprise platforms such as Microsoft 365, SharePoint, Teams, Oracle, and Jira
• Monitor system metrics including CPU, GPU, memory, storage, and logs
• Act as first responder for infrastructure and platform incidents (P0–P3)
• Perform incident triage and escalate complex issues to engineering teams
• Support Kubernetes clusters and Docker container environments
• Maintain infrastructure runbooks, troubleshooting guides, and RCA documentation

Required Skills

• Experience with cloud platforms such as Microsoft Azure, Amazon Web Services, or Google Cloud Platform
• Experience supporting container environments using Kubernetes and Docker
• Familiarity with CI/CD tools such as Azure DevOps, GitHub Actions, or Jenkins
• Experience with monitoring tools including Azure Monitor, Dynatrace, or Grafana
• Understanding of networking, IAM, API gateways, and infrastructure monitoring
• Familiarity with Infrastructure-as-Code tools such as Terraform

Experience

• 4–7 years of experience in DevOps, Cloud Operations, Platform Support, or SRE roles
• Experience supporting containerized or AI workloads preferred
• Experience working in regulated or government environments is a plus
• Arabic language skills are an advantage

Key Skills

Ranked by relevance