Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
DeepLight is a pioneering AI company committed to pushing the boundaries of innovation in artificial intelligence. Our mission is to harness the power of data and machine learning to revolutionize industries and create a brighter future. With a dynamic team of experts and a culture of relentless innovation, we are at the forefront of AI research and development.
Position Overview:
DeepLight is seeking an experienced AWS Platform Engineer to join our cutting-edge technology team in the UAE. This role goes beyond design and development—you will be directly responsible for implementing, deploying, and operationalizing our cloud infrastructure on AWS to support real-time AI workloads and enterprise-grade solutions.
You will lead the end-to-end implementation of robust, scalable, and secure cloud architectures that enable DeepLight's data pipelines, machine learning platforms, and client-facing applications. Your work will ensure that our solutions move seamlessly from design into production, powering our mission-critical systems.
Requirements
Responsibilities:
- Cloud Infrastructure Management: Design, implement, and maintain secure, scalable, and highly available cloud infrastructure to support a range of applications and services, including data-intensive and AI workloads, following industry best practices
- Automation: Implement Infrastructure as Code (IaC) using tools such as Terraform or equivalent to automate the provisioning, deployment, monitoring, and management of cloud infrastructure across environments
- Security and Compliance: Enforce and implement security best practices and compliance measures, including identity and access management, data protection, encryption, and vulnerability scanning—especially for AI and analytics data pipelines
- Performance Optimization: Monitor, analyze, and actively improve system performance, ensuring reliability and operational efficiency across workloads, including those supporting machine learning model training and inference
- Continuous Integration/Continuous Deployment (CI/CD): Develop, implement, and manage CI/CD pipelines to streamline deployment processes, including integration with MLOps workflows for rapid and stable delivery of applications and AI services
- Collaboration: Work closely with software engineers, data teams, and operations groups to gather requirements, implement solutions, troubleshoot issues, and support the full deployment lifecycle for analytics and AI workloads
- Documentation: Create and maintain comprehensive documentation for system architecture, deployment workflows, configuration standards, and operational processes—including infrastructure components relevant to AI and machine learning
- Proven experience as a Platform or Infrastructure Engineer, or in a similar role, with a strong background in building and maintaining cloud-agnostic infrastructure environments supporting scalable applications and AI workloads
- Hands-on experience with a variety of cloud platforms and services, including compute, storage, networking, identity management, and container orchestration tools, across public cloud, hybrid, or on-prem environments
- Proficiency with containerization and orchestration technologies, such as Docker and Kubernetes, and familiarity with microservices-based architecture and distributed systems design
- Demonstrated experience in DevOps practices, including CI/CD pipeline creation and integration, configuration management, and system monitoring with tools like Jenkins, Git, Prometheus, or Grafana
- Experience supporting AI/ML infrastructure, including the provisioning of GPU-accelerated environments, managing ML training pipelines, and deploying models in production environments using infrastructure automation and MLOps tooling
- Strong analytical and problem-solving skills, with a track record of delivering production-ready infrastructure in cross-functional teams, including collaborations with data science and machine learning engineers
- Commitment to continuous learning and staying current with emerging technologies and best practices in cloud infrastructure, automation, and AI operations
Why Join DeepLight?
- Impact: Be part of a dynamic team that is shaping the future of AI and making a meaningful impact on industries and society
- Innovation: Work on cutting-edge projects at the intersection of AI, data engineering, and machine learning, leveraging the latest technologies and methodologies
- Collaboration: Collaborate with a diverse team of experts from various disciplines, fostering creativity, learning, and growth
- Opportunity: Enjoy ample opportunities for professional development, career advancement, and leadership roles in a rapidly growing company
- Culture: Join a culture of curiosity, excellence, and collaboration, where your ideas are valued, and your contributions are recognized and rewarded