The Glove
Senior Technology Engineer -DevOps, MLOps, Openshift, CI/CD
The GloveUnited Arab Emirates11 days ago
Full-timeInformation Technology

We are currently hiring for the position of Senior Technology Engineer for Dubai location. This role requires strong hands-on experience with DevOps, MLOps, Openshift, CI/CD tools.

Please find the detailed job description below. If you are interested, kindly share your updated profile at the earliest.


We are looking for candidates from INDIA and UAE.


Job Description:


Skills Required:

-CICD/OpenShift AI

-AI/ML (AIOps/MlOps/LLMOps)


Key Responsibilities:

Must have 7+ years of work experience.


CI/CD, DevOps & Platform Engineering

  • Design and implement end-to-end CI/CD pipelines using tools such as Tekton, ArgoCD, GitLab CI, or GitHub Actions.
  • Build automated deployment pipelines for ML models, API endpoints, and data pipelines.
  • Manage and optimize containerized applications using Docker and Kubernetes/OpenShift.
  • Implement IaC using Terraform / Ansible for infrastructure provisioning.
  • Integrate security best practices (DevSecOps, image scanning, secrets management, policy enforcement).


OpenShift AI / OpenShift Data Science

  • Deploy and manage OpenShift AI (RHODS) environments.
  • Configure JupyterHub, Workbenches, Model Serving, and Distributed Training environments.
  • Automate GPU/accelerator configurations and model-training workloads.
  • Implement monitoring, autoscaling, and performance optimization for AI workloads.


MLOps / LLMOps

  • Build and maintain ML/LLM model pipelines across training, evaluation, packaging, deployment, and monitoring.
  • Develop reproducible training workflows using tools like:
  • KServe, Seldon, BentoML, Ray, MLflow, Kubeflow, Airflow
  • Implement model versioning, registry management, and feature store integrations.
  • Optimize model serving, including LLM serving (vLLM, HuggingFace TGI, Triton, Inferentia, etc.).
  • Implement real-time model monitoring (drift detection, performance, logging, tracing).


AIOps

  • Use AI/ML techniques to enhance platform reliability—predictive scaling, anomaly detection, and intelligent alerting.
  • Leverage tools like Prometheus, Grafana, ELK/EFK, Dynatrace, Datadog, New Relic for observability automation.
  • Implement self-healing automation using event-driven workflows (Argo Events, Knative, Kafka).


Qualifications:

Education

· Degree, Post graduate in Computer Science or related field (or equivalent industry experience)


If this opportunity aligns with your experience and career goals, please share your updated resume at [email protected]

Key Skills

Ranked by relevance