Sundus
Senior AI/ML Engineer
SundusUnited Arab Emirates7 hours ago
Full-timeEngineering, Information Technology
Job Code: 5427

Position: Senior AI/ML Engineer

Experience: 4+ years

Location: Abu Dhabi / Dubai

Client: Government entity

Service Duration: 1 year contract – to be renewed annually

Role Summary

We are seeking Senior AI/ML Engineers to join our AI Services team. You will design, build, and operate AI-powered microservices that provide core business capabilities such as OCR/IDP, NLP, computer vision, embeddings, and LLM services. This role is hands-on and focused on delivering production-grade AI services from prototype to deployment, ensuring scalability, reliability, and cost efficiency.

Key Responsibilities

  • Design, implement, and maintain AI microservices (Python/FastAPI preferred) with strong APIs and backward-compatible versioning.
  • Productionize ML/AI models (training, fine-tuning, evaluation, optimization) using ONNX Runtime, TensorRT, Triton Inference Server, vLLM.
  • Build and maintain MLOps/LLMOps pipelines with CI/CD, model registry, data/versioning (DVC/LakeFS), canary/shadow deployments.
  • Develop services for OCR/IDP, NLP, CV, embeddings, and LLM inference with focus on low latency and high throughput.
  • Implement observability and reliability: tracing, metrics, logging (OpenTelemetry, Prometheus, Grafana), automated rollback.
  • Optimize infrastructure for performance and cost efficiency: autoscaling, batching, caching, GPU/CPU right-sizing.
  • Collaborate with AI Lead, data engineers, and platform teams to integrate services into enterprise systems.
  • Write robust tests (unit, integration, performance) and clear technical documentation.

Must Have Qualifications

  • Bachelor’s or master’s degree in computer science, AI/ML, or related field.
  • 4–6+ years of total software/ML engineering experience, with 2+ years building production AI services.
  • Strong proficiency in Python and microservice frameworks (FastAPI/Flask).
  • Experience with Docker, Kubernetes, Helm, GitHub Actions/Azure DevOps.
  • Hands-on with PyTorch, Hugging Face, OpenCV, PaddleOCR/Tesseract.
  • Practical knowledge of inference optimization (ONNX, TensorRT, quantization, batching).
  • Familiarity with Kafka/RabbitMQ, Redis, PostgreSQL, and vector DBs (FAISS, Milvus, pgvector).
  • Cloud-native deployment experience (Azure preferred; AWS/GCP acceptable).
  • Strong software engineering fundamentals (API design, testing, CI/CD).

Nice to Have

  • Experience with C#/.NET for enterprise microservices and API integration.
  • Exposure to RAG architectures, retrieval evaluation, and safety/guardrail techniques.
  • Knowledge of event-driven architectures, API gateways, and service mesh (Istio/Linkerd).
  • Familiarity with data engineering tools (Airflow/Prefect, Delta/Lakehouse) and feature stores (Feast).

Experience with Arabic NLP and multilingual systems

Key Skills

Ranked by relevance